Re: Bayes DB's

Daryl C. W. O'Shea 6 Dec 2004 17:16:05 -0000

Gray, Richard wrote:

Surely that would only happen if there were equal amounts of Spam and
ham passing through. Otherwise the token will have a tendency toward
whichever the program has seen more of.


________________________________

From: Loren Wilton [mailto:[EMAIL PROTECTED]

>

Assuming that the same header values appear in both spam and ham, I'd expect that Bayes would conclude the token was useless for classification and ignore it. Loren

Is there a reason why the trusted received headers would make good bayes tokens? I can't think of any.

I can see the value in the first untrusted received header (which would be YOUR first server that the message hits), since you could tokenize data such as the received 'Date:' (and time of day) along with which MX the message came in on (most mail to a lower preference MX will be spam... but that could cause problems if your preferred MXes go down).


Daryl

Re: Bayes DB's

Reply via email to