https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6149





--- Comment #14 from Warren Togami <[email protected]>  2009-08-03 05:22:33 
PST ---
I split my two Japanese users into their own masscheck named wt-japanese. 
Unfortunately I have only ~1000 messages in this corpus, but it is showing
definite signs of some rules being bad.  These rules seem to have very low
Correctly Spam ratios across all corpora.

http://ruleqa.spamassassin.org/20090801-r799815-n/TVD_SPACE_RATIO/detail
26.7% FP rate
http://ruleqa.spamassassin.org/20090801-r799815-n/PLING_QUERY/detail
11.5% FP rate
http://ruleqa.spamassassin.org/20090802-r800007-n/OBSCURED_EMAIL/detail
6.5% FP rate
http://ruleqa.spamassassin.org/20090801-r799815-n/WEIRD_QUOTING/detail
4.8% FP rate
http://ruleqa.spamassassin.org/20090801-r799815-n/GAPPY_SUBJECT/detail
4.7% FP rate

These users insist that they have confirmed their Ham boxes manually.  I would
like to split out folders containing specific rule hits and ask them to choose
a few to submit as samples, but mboxget is misbehaving.  It would be nice if
mboxget could output in mbox format.

-- 
Configure bugmail: 
https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

Reply via email to