Re: Very spammy messages yield BAYES_00 (-1.9)

Ben Johnson Wed, 15 Aug 2012 13:56:58 -0700

On 8/15/2012 4:19 PM, Kris Deugau wrote:
> John Hardin wrote:
>> I wasn't aware that autolearning could do a cold-start of Bayes, can
>> anyone confirm whether this is the case?
> 
> If you let it run long enough to pass the 200/200 ham/spam thresholds,
> yes;  there's no distinction I've ever met about where the learning came
> from.
> 
> That said, I wouldn't trust a pure autolearn setup with stock autolearn
> thresholds - all too much spam will get learned scoring under 0.1.  :(
> 
> -kgd
>


It's a bit disappointing to learn this (pardon the pun), given:

a.) This exchange between John Hardin and I, which occurred previously
in this thread:

---8<--------------------------

Me:

> Most of the list is probably laughing, but given the complexity of Spam
> Assassin, this crucial requirement was lost on me, amidst the sea of
> information and instructions. For example, there is no mention of the
> fact that SA is essentially useless without Bayesian training on
> http://wiki.apache.org/spamassassin/StartUsing .

John:

That's because that shouldn't be the case. The base ruleset + URIBL
should be very effective pretty much out-of-the-box.

---8<--------------------------

b.) The default value for bayes_auto_learn is 1 (on). (At least in my
particular distribution.)

Correct me if I'm wrong, but this issue's root cause seems to be that
bayes_auto_learn was on, out-of-the-box, yet I was not complementing its
efficacy via sa-learn.

Is this an accurate summary? Because if so, it seems prudent to change
the default bayes_auto_learn value to zero, and scorn any package
maintainer or developer who modifies it, or, alternatively, put a
banner, at font-size 100em, on the SpamAssassin homepage that issues an
unmistakable warning about Bayesian training's importance.

(John, I'll respond to your most recent message tomorrow most likely;
had enough for one day!)

Thank you,

-Ben

Re: Very spammy messages yield BAYES_00 (-1.9)

Reply via email to