Re: Bayes resolution gettin weaker

Jim Maul Mon, 12 Feb 2007 10:36:18 -0800

Jack Gostl wrote:

Well... I'm convinced. I turned off autolearn a week ago, and thingshave never been smoother. Its a shame really, that's a nice feature, butfor some reason it waters down the Bayes resolution until its almostuseless.

Most likely because the autolearn thresholds are too generous. Thepossibility to autolearn spam as ham and/or ham as spam is too great. Ihave been running with autolearn enabled, my thresholds set to:


bayes_auto_learn_threshold_nonspam -0.1
bayes_auto_learn_threshold_spam 12.0

without any problems for almost 3 years now. My bayes database hasnever been better. I think too many people have problems with itbecause of the defaults and instead of trying to figure out how to makeit work better, they just turn it off and call it "broken".


-Jim

----- Original Message ----- From: "Jack Gostl" <[EMAIL PROTECTED]>
To: "Anthony Peacock" <[EMAIL PROTECTED]>; "SpamAssassin"<users@spamassassin.apache.org>
Sent: Monday, February 05, 2007 7:06 AM
Subject: Re: Bayes resolution gettin weaker
----- Original Message ----- From: "Anthony Peacock"<[EMAIL PROTECTED]>
To: "SpamAssassin" <users@spamassassin.apache.org>
Sent: Monday, February 05, 2007 3:56 AM
Subject: Re: Bayes resolution gettin weaker
Hi,

Jack Gostl wrote:
I've been watching this for awhile, and there is now a pattern towhat I'm seeing.
I'm running a configuration with multiple users sharing a bayesfiles. This is an interim move to facilitate the spamassassinupgrades, and like many interim moves its been going on for a longtime.
When I first build the bayes files from my personal folders and myspam archives, things were great. 99.8% of the spam caught orbetter. Then, usually after a week or so, the number starts todrop. Right now, its down to 97%, in another day or two it will bedown below 95%. With the amount of spam we receive, that is a lotof missed junk mail.
So I blow away my bayes* files, rebuild, and I'm back up to darnnear 100% caught. For about a week. Then the deterioration beginsagain.
Has anyone else encountered this? Is this an artifact of too manyusers sharing a spam file?
Also.... I retrain each night, feeding any missed spams plus anynew hams received back through sa-learn. I can't see how thatmakes it worse, but who knows.
Do you have autolearn enabled?
Uh... yes? You are suggesting that I turn it off? I had alwaysassumed that if the Bayes learned something as ham that itshouldn't, sa-learn was smart enough to undo it.
Change the thresholds for auto learning.  Mine are:

bayes_auto_learn_threshold_nonspam -0.1
bayes_auto_learn_threshold_spam 12.0
I'm willing to try. I made the change in my user_prefs and we'll seewhat the next week brings.
Thanks

Re: Bayes resolution gettin weaker

Reply via email to