https://issues.apache.org/SpamAssassin/show_bug.cgi?id=5736





--- Comment #13 from Karsten Bräckelmann <[EMAIL PROTECTED]>  2008-10-30 
10:13:47 PST ---
The ruleqa results are heavily biased anyway. The only ham hits are in
Michael's corpus, which is quite "small" compared to Daryl's and Justin's ham
corpus. Extrapolating the number of hams to align the corpora draws an even
much worse picture and makes the S/O ratio drop significantly -- below the
already *poor* 0.5 it shows today (which is without Theo's massive corpus,
granted). Most of the English-centric ham corpora are much less likely to
contain German company domains.

I kind of wonder if From headers are a good indicator today anyway. Most of my
spam shows a forged sender. The increasing problem of backscatter supports
this.

+1 for seriously down-scoring FROM_DOMAIN_NOVOWEL, if we keep it at all.


Let's just hope GMX uses sa-update. Ironically, a German company. If they
don't, I'm afraid it'll take quite some GMX users complaining, to gently
massage the message from front-line support down to the tech staff.

(GMX themself evaded this rule, FWIW, using gmx-gmbh.de with a hyphen. Doh!)


-- 
Configure bugmail: 
https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

Reply via email to