On 27.03.2015 19:54, Matus UHLAR - fantomas wrote:
> On 27.03.15 15:16, Michael wrote:
>> I would like automatically learn each users Bayes database in the
>> following way:
>>
>> Do the following once a day for each user:
>> 1.) sa-learn -u username --ham ../maildir/cur
>> 2.) sa-learn -u username --spam ../maildir/.Spam/cur
> 
>> What do you think about this strategy?
> 
> the easiest way is to train on false positives and false negatives.
> dovecot imapd has plugin to train when mail is moved from/to spam.

My concerns are the following:
Sometimes new kind of spam is appearing. This new kind often gets low
scores so that they are just 0.1 to 0.5 points above the limit. And the
auto learner gets no hit.
If the same spam then comes from another sending server, the score is
just a little bit below the border so that I'm getting a false-negative.
If the previous spam would have already been learned, the second mail
would have been scored as spam.

> 
> you use something other, you should create pair of special folders for
> users
> to train both ham and spam.
> 
> 

Reply via email to