On 7/26/2010 2:46 PM, andrij wrote:
> Bowie Bailey wrote:
>>>>> 3) Evaluating whether an email is spam or not. Does the bayes
>>>>> classifier
>>>>> analyze headers if I have, for example, the following rule: "body
>>>>> BAYES_05
>>>>> eval:check_bayes('0.00', '0.05')". According to the
>>>>> http://wiki.apache.org/spamassassin/WritingRules : "Body rules also
>>>>> include
>>>>> the Subject as the first line of the body content". So, any headers
>>>>> that
>>>>> precede subject header are not considered by the bayes classifier?
>>>> I don't have an answer for you here, but just another question.  Why do
>>>> you want to mess with the bayes rules?
>>> Maybe I am mistaken, but what is the sense to train the bayes classifier
>>> on
>>> headers if headers (at least those that precede a subject header) are not
>>> considered during the spam detection phase?
>> Bayes learns based on the entire message -- headers and all. 
>> (Otherwise, what would be the point of the bayes_ignore_header option?)
>>
>> I can see where you might get that impression by looking at the rule,
>> but if I understand it correctly, Bayes has already been run and the
>> rule is just checking the result.
> Thank you for the clarifying. The word "body" at the begining of the rule
> confused me. So, in general it does not matter what word ("body" or
> "header") is put there -- the Bayes clasifier analyzes both headers (except
> those introduced by bayes_ignore_header) and body during both learning and
> scoring phases. Right?

Right.

-- 
Bowie

Reply via email to