I try to understand how dspam is trained. My setup is IMAP + Maildir, thus, I 
have created a Junk and a NoSpam directory at which users add the spams and 
the false positive mails respectively. 
I run a script every hour in order to collect data. The script trains dspam 
as:
dspam --user $USER --class=spam --source=error < $j
where $USER is the username (not the mail address, but the username) and $j is 
the file that is spam but is classified as Innocent. 

Now, what I cannot understand is this:
I have a lot of emails with the same subject and almost identical body. I had 
trained dspam to handle those emails as errors. However, I still receive 
those emails! 
Since I do have the emails, I run dspam from command line in order to see the 
classification result as:
dspam --mode=notrain --user username --classify --stdout<mail_file
The result was:
X-DSPAM-Result: username; result="Innocent"; class="Innocent"; 
probability=0.0000; confidence=1.00; signature=489787f2131472612618147

So, why? The message was feed to dspam just a couple of minutes ago, with the 
same command as above (source=error).

Shouldn't dspam report the file as spam?
Thank you 
Peter


!DSPAM:1011,4897fde0150921570549289!


Reply via email to