GRP Productions wrote on Sun, 13 Mar 2005 22:54:22 +0200:

> Perhaps I have not been clear enough. It's not only that the files' size is 
> constant. I am pasting the output of dump magic,

That is the output of --dump magic? I haven't ever seen it formatted that 
nicely. I assume you skipped the first line, but there's also missing the 
expire atime delta. So, where do you got this from? Not directly from sa-learn 
--dump magic I'd say. You are running SA thru some interface? You should have 
said something about the whereabouts of your installation.

 and I have to explain that 
> the nham and nspam values are the same for many days now.

Ok. Get the values. Then learn a message to it. Make sure it says that it 
actually learned, then check the values again. Is either the spam or ham count 
increased by one or not?

> work fine. If I send to myself a message from Yahoo, with subject 'Viagra 
> sex teen ........" and other nice words, I certainly do not want it to pass. 
> Bayes classifies it as 50% spam.  I tried to sa-learn --forget, and then 
> re-learn, still is BAYES_50.

Again, this is NOT how Bayes works. You can't learn it one message and then 
expect it to flag that message as spam next time. Bayes does not work like 
this!
And that it classifies that message as 50%, which means, it cannot determine if 
it's ham or spam, just says that the tokens in the db are not good enough for 
that message. Or maybe it contains enough hammy tokens, whatever.

> Number of Spam Messages: 49,740 
> Number of Ham Messages: 47,167 
> Number of Tokens: 123,325 
> Oldest Token: Wed, 2 Feb 2005 06:37:53 +0200 
> Newest Token: Sat, 12 Mar 2005 16:07:30 +0200 

Says it added/changed time a token yesterday.

> Last Journal Sync: Fri, 11 Feb 2005 18:03:10 +0200 
> Last Expiry: Fri, 11 Feb 2005 15:45:34 +0200 
> Last Expiry Reduction Count: 3,475 tokens

Ok, this finally looks a bit suspicious. No sync and no expire for a month. If 
it doesn't sync you don't get new tokens. Check in your bayes directory how big 
your bayes_journal is. I'd think it's quite big. Do a sync now. (Please don't 
do it via an interface, do it on the command line.) What's the output? Is the 
journal gone and the number of tokens increased now? If so, you need to 
investigate why it doesn't sync anymore. Also do an expire then.


Kai

-- 
Kai Schätzl, Berlin, Germany
Get your web at Conactive Internet Services: http://www.conactive.com
IE-Center: http://ie5.de & http://msie.winware.org



Reply via email to