Sorry if this is long... 

***Config*** 
I have spamassassin 2.64 running under Amavisd-new on Postfix set to tag and 
then relay all mail to another mail server. I have 2500 users and process about 
55,000 inbound messages daily (85+% spam). No mail goes out through this box. 

***Problem*** 
Recently I've noticed that my bayes database seems to be working against me - 
*many* clearly spammy messages are getting bayes_0 hits and having negative 
points assigned. When I first set this system up I had never even touched linux 
before, so I just kinda threw it together with whatever FAQ I could find. I 
know this is wrong now, but I did absolutely no manual bayes training - I let 
it auto learn everything. You can see that my spam count is way higher than ham 
(bottom). 

***Questions*** 
Am I better off deleting my database and starting over? 
Or should I just start doing some manual training to try to correct the 
database? 
Lastly, how do I get even spam and ham counts when autolearning and my incoming 
mail consists of 85% spam? 

P.S. - If my setup is lame-a$$ and I should do it another way, please tell me 
(but it seems to be working). 

***Magic Numbers*** 
0.000          0          2          0  non-token data: bayes db version 
0.000          0     592146          0  non-token data: nspam 
0.000          0     201142          0  non-token data: nham 
0.000          0     221687          0  non-token data: ntokens 
0.000          0 1093068678          0  non-token data: oldest atime 
0.000          0 1093369880          0  non-token data: newest atime 
0.000          0 1093369889          0  non-token data: last journal sync atime 
0.000          0 1093328411          0  non-token data: last expiry atime 
0.000          0      43200          0  non-token data: last expire atime delta 
0.000          0      67895          0  non-token data: last expire reduction 
count 

Reply via email to