On 7/26/2010 2:46 PM, andrij wrote: > Bowie Bailey wrote: >>>>> 3) Evaluating whether an email is spam or not. Does the bayes >>>>> classifier >>>>> analyze headers if I have, for example, the following rule: "body >>>>> BAYES_05 >>>>> eval:check_bayes('0.00', '0.05')". According to the >>>>> http://wiki.apache.org/spamassassin/WritingRules : "Body rules also >>>>> include >>>>> the Subject as the first line of the body content". So, any headers >>>>> that >>>>> precede subject header are not considered by the bayes classifier? >>>> I don't have an answer for you here, but just another question. Why do >>>> you want to mess with the bayes rules? >>> Maybe I am mistaken, but what is the sense to train the bayes classifier >>> on >>> headers if headers (at least those that precede a subject header) are not >>> considered during the spam detection phase? >> Bayes learns based on the entire message -- headers and all. >> (Otherwise, what would be the point of the bayes_ignore_header option?) >> >> I can see where you might get that impression by looking at the rule, >> but if I understand it correctly, Bayes has already been run and the >> rule is just checking the result. > Thank you for the clarifying. The word "body" at the begining of the rule > confused me. So, in general it does not matter what word ("body" or > "header") is put there -- the Bayes clasifier analyzes both headers (except > those introduced by bayes_ignore_header) and body during both learning and > scoring phases. Right?
Right. -- Bowie