DB tokens expiration

2010-08-02 Thread andrij
Hi all, after I trained the bayes classifier with several thousands of e-mails I run sa-learn --dump magic and I got the following: 0.000 0 3 0 non-token data: bayes db version 0.000 0 5367 0 non-token data: nspam 0.000 0 3792

Bayes scoring

2010-08-02 Thread andrij
Hi all, I run the bayes classifier on more than 4500 e-mails. All (except of cca 100 e-mails) contained test=BAYES_*. Does anybody have any idea why these 100 e-mails were not scored by the bayes classifier? At http://www.paulgraham.com/spam.html, it is written that When new mail arrives, it is

Re: Bayes scoring

2010-08-02 Thread andrij
Daniel Lemke wrote: andrij wrote: I run the bayes classifier on more than 4500 e-mails. All (except of cca 100 e-mails) contained test=BAYES_*. Does anybody have any idea why these 100 e-mails were not scored by the bayes classifier? Do you have any shortcircuit enabled? No. I

Re: RelayCountry plugin

2010-07-29 Thread andrij
RW-15 wrote: Does Bayes learn the tokens from the X-Spam-Relay-Country header? Contrary to popular belief, the country codes are not used by Bayes. I think that it does not, because all headers X-Spam- are removed before learning, right? That's not the reason. The plugin does make

RelayCountry plugin

2010-07-28 Thread andrij
Hi all, I am playing with RelayCountry plugin. I have a small database of e-mails. I processed these emails with RelayCountry plugin, so every email contains X-Spam-Relay-country header (and corresponding countries). Now I want to train Bayes with these emails. Does Bayes learn the tokens

Bayes classifier

2010-07-26 Thread andrij
Hi all, I am new to spamassassin and bayes classifier. I have several questions and I will greatly appreciate your help with that. 1) Training of the bayes classifier with _multipart_ e-mails (e.g., an e-mail contains other e-mails within its body). If I set bayes_ignore_header Some-header,

Re: Bayes classifier

2010-07-26 Thread andrij
2) Evaluating whether an email is spam or not. Again, if I set bayes_ignore_header Some-header, will the bayes classifier ignore the header while evaluating an e-mail? Yes. That's what it's for. So, the bayes clasifier will ignore Some-header in both learning and spam detection

Re: Bayes classifier

2010-07-26 Thread andrij
Bowie Bailey wrote: 3) Evaluating whether an email is spam or not. Does the bayes classifier analyze headers if I have, for example, the following rule: body BAYES_05 eval:check_bayes('0.00', '0.05'). According to the http://wiki.apache.org/spamassassin/WritingRules : Body rules also