It looks like the process is getting killed from an external signal.
Maybe this is the Linux OOM killer in action? What is the memory/swap
status of this machine? Have you tried running sa-learn with the -D option?

Leander


On 29-jul-2006, at 0:31, Steven Scotten wrote:

The bayesian filter seems super-delicate. If I run sa-learn on a
mailbox with more than about 200 messages in it, it gets killed, I'm
not sure why:

$ sa-learn --spam --dir Maildir/.spam/cur/
Killed
$

If sa-learn gets killed in the middle, it leaves a database that it
thinks is empty.

Before a killed process:

debug: bayes: found bayes db version 3
debug: bayes corpus size: nspam = 592, nham = 562

After a killed process:

debug: bayes: found bayes db version 3
debug: bayes: Not available for scanning, only 0 spam(s) in Bayes DB < 200

rescanning doesn't do any good, because sa-learn still knows about the
messages it's already looked at. I have to start training all over by
deleting bayes_seen and bayes_toks. Furthermore, this kills my
bayesian filter and Spamassassin lets through about 75% of my incoming
spam without it.

I've got thousands of spams and hams ready to feed to sa-learn, but
having to feed them 100 at a time is cumbersome and starting over
again a dozen times in the last few days

Other than backing up my .spamassassin directory before I run sa-learn
each time, are there any suggestions? I'm running 3.0.3, but it's a
hosted box so upgrading isn't my call.

Thanks,


Steve
--
Steven M. Scotten
<[EMAIL PROTECTED]>
The future will blow your mind


Reply via email to