https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6149





--- Comment #16 from Karsten Bräckelmann <[email protected]>  2009-08-03 
08:14:30 PST ---
At least the TVD_SPACE_RATIO and GAPPY_SUBJECT are known to FP on Japanese (and
Chinese IIRC), bugs filed.

(In reply to comment #15)
> Is there any way to easily detect if other wt-japanese rule FP ratios are
> differing substantially from the other corpora?

You're using mbox format, so a hack like this should get you some nice
hit-rates at least. Anything close to the top with a positive score is a
candidate.

formail -c -x X-Spam-Status -s < MBOX | \
  sed -re 's/^.+tests=(.+) autolearn.+/\1/' -e 's/[, \t]+/\n/g' | \
  sort | uniq -c | sort -r -n

Hope this help.

-- 
Configure bugmail: 
https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

Reply via email to