Skip wrote:
I only use two commands to process my mail:
dspam --debug --stdout --deliver=innocent,spam --mode=teft --feature=noise,whitelist --user skip
for all incoming email

dspam --mode=teft --source=error --class=spam --user skip
to fix the missed spammies

Of course, I did feed a bunch of spam and ham corpus when I first set everything up, but why would I still have a few new spam corpusfed each day?

This table shows the output of dspam_stats at the end of each day since I set this up.
DP=Daily positives
DN=Daily Negatives
DFP=Daily False Positives
DFN=Daily False Negatives
DSC=Daily Spam Corpusfed

and those are just the change from the previous day's value.

Date TP TN FP FN SC NC SHR HSR OSA DP DN DFP DFN DSC 3/04/2008 24 23 0 2 210 3194 0.92 0 0.96 24 23 0 2 210 3/05/2008 41 36 0 3 214 3194 0.93 0 0.96 17 13 0 1 4 3/06/2008 75 60 0 5 219 3194 0.94 0 0.96 34 24 0 2 5 3/07/2008 92 79 0 7 221 3194 0.93 0 0.96 17 19 0 2 2 3/08/2008 115 94 0 14 228 3194 0.89 0 0.94 23 15 0 7 7 3/09/2008 167 103 0 29 255 3194 0.85 0 0.90 52 9 0 15 27 3/10/2008 191 110 0 40 264 3194 0.83 0 0.88 24 7 0 11 9 3/11/2008 254 122 0 46 270 3194 0.85 0 0.89 63 12 0 6 6 3/12/2008 288 136 0 53 283 3194 0.84 0 0.89 34 14 0 7 13 3/13/2008 328 148 0 63 290 3194 0.84 0 0.88 40 12 0 10 7 3/14/2008 374 163 0 66 291 3194 0.85 0 0.89 46 15 0 3 1 3/15/2008 431 180 0 74 303 3194 0.85 0 0.89 57 17 0 8 12 3/16/2008 463 194 0 79 309 3194 0.85 0 0.89 32 14 0 5 6 3/17/2008 511 204 0 88 314 3194 0.85 0 0.89 48 10 0 9 5

As you can see, I do get a few new corpusfed spams each day, but I know I am not using the source=corpus option. Could this be affecting my performance?

Regards,
Skip

Skip,

If you have the TestConditionalTraining option on, then dspam will retrain (corpus-feed) the message a few times, which can make that counter go up. Also, you do not need to specify --mode or --feature on the command line, as dspam will automatically pick them up from either dspam.conf, or from the preferences file. ALSO, if you are only using dspam --mode=teft --source=error --class=spam --user skip for retraining, then how do you fix wrongly classified ham?

--Kyle Johnson!

Reply via email to