Re: bayes learn best practice

Arthur Kerpician Thu, 09 Apr 2009 10:26:14 -0700

Kai Schaetzl wrote:

Arthur Kerpician wrote on Thu, 09 Apr 2009 09:41:22 +0300:
The docs mention that after 5000 spam and ham learned,spamassassin doesn't improve spam detection much.
do they? What is meant is that once you reach some threshold the detectionrate doesn't improve as good as before. You can't get any better as"nearly everything". But it will drop if no new tokens get added.
What is the best
practice to optimize the bayes detection? Should I stop auto-learningafter reaching the 5000 mark and than re-train from time to time fromscratch?
No, keep the automatic training (unless there are too many FPs in theautotrained messages). Do a regular manual expire, so old tokens arepurged out.

I don't get many FPs or FNs after upgrading to 3.2.5 and retrainingbayes. But, if I keep auto-learning enabled, I should monitor thetrained spam and ham levels and manual train ham when the spam exceedsit (as it will always exceed ham level). So from time to time I shouldfeed ham manually to sa-learn, until it reaches the spam level again. Isthis correct? If it is, I think it's rather time-consuming to alwayscheck the trained ham/spam and level them.

I was thinking to increase bayes_auto_learn_threshold_spam to a highernumber, so less spam is auto-learned. Is this ok?

Re: bayes learn best practice

Reply via email to