http://issues.apache.org/SpamAssassin/show_bug.cgi?id=5110





------- Additional Comments From [EMAIL PROTECTED]  2006-12-05 07:59 -------
Hrm.  Interesting results:

 22.807  25.2886   0.1464    0.994   1.00    1.00  EXTRA_MPART_TYPE2
 26.188  29.0237   0.2928    0.990   0.67    1.00  EXTRA_MPART_TYPE3
 19.882  22.0423   0.1464    0.993   0.00    0.85  EXTRA_MPART_TYPE

In order:
- content-type includes /\btype=/i
- content-type starts with multipart/related
- original

So the diff between the original and #2 is probably that the original looks for
" type=" and #2 will accept ";type=", so that's an easy win.

I just threw in #3 because I was curious.  Essentially what this means is that
for my corpus, at least, multipart/related is very likely to be spam (and all of
the #2 hits also hit #3).  Interestingly, the difference between 2 and 3 are
people ignoring RFC 2387. :(

So part of me wants to just use #3.  While the 2x ham rate looks daunting, it
really means that instead of 2 ham hits, it was 4 ham hits.

Thoughts?



------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

Reply via email to