This doesn't prove anything. sa-learn --dump magic shows you what's inside.
Also, Bayes is not a checksum system like Razor, that's its strength. If you
learn something to it that means that it extracts tokens (short pieces) from
the message and adjusts its internal probability for them being ham or spam by
a certain factor. Or if it doesn't know that token yet it adds it.
That the size doesn't grow can have several reasons, f.i. expiry or the fact
that the db format seems to have some "air" in it, so that it grows in jumps
and not continually.

Perhaps I have not been clear enough. It's not only that the files' size is constant. I am pasting the output of dump magic, and I have to explain that the nham and nspam values are the same for many days now. This is not normal, since we are talking about a very busy server (more than 4,000 messages per day). This behaviour has not always been the case, it used to work fine. If I send to myself a message from Yahoo, with subject 'Viagra sex teen ........" and other nice words, I certainly do not want it to pass. Bayes classifies it as 50% spam. I tried to sa-learn --forget, and then re-learn, still is BAYES_50. The nham and nspam values used to increase very rapidly (sometimes by a value of 200-300 per day). No errors are produced. I wouldn't have noticed the particular problem, but fortunately during the last days we started having more spam than usual to be passing. Also, I tried to force an expiration many times, but as you can see the expiration did not take place. Its definitely not a file permission issue.


Thanks

Number of Spam Messages:        49,740
Number of Ham Messages: 47,167
Number of Tokens:       123,325
Oldest Token:   Wed, 2 Feb 2005 06:37:53 +0200
Newest Token:   Sat, 12 Mar 2005 16:07:30 +0200
Last Journal Sync:      Fri, 11 Feb 2005 18:03:10 +0200
Last Expiry:    Fri, 11 Feb 2005 15:45:34 +0200
Last Expiry Reduction Count:    3,475 tokens

_________________________________________________________________
FREE pop-up blocking with the new MSN Toolbar - get it now! http://toolbar.msn.click-url.com/go/onm00200415ave/direct/01/




Reply via email to