On Wed, 20 Jan 2016 22:21:49 -0800
Marc Perkel <supp...@junkemailfilter.com> wrote:

> Here is a list of 5505874 words and phrases used in the subject line
> of HAM and never seen in the subject line of SPAM

> Here is a list of 3494938 words and phrases used in the subject line
> of SPAM and never seen in the subject line of HAM

[snip]

And what, exactly, is your point?  Bayes would handle that just fine.
Tokens in your first list would score 0.00 for spam probability and
tokens in your second list would score 1.00 and Bayes would be great.

Regards,

Dianne.

Reply via email to