From: Arvid Ephraim Picciani <a...@exys.org>
   Date: Wed, 25 Mar 2009 16:59:58 +0100
   
   http://codepad.org/W53onqK9
   
   i gave on this kind of spam.  its impossible to train bayes and changing 
   to fast to make custom rules. ...
   
What do you mean "its impossible to train bayes"?
Bayes really can be trained to deal with this message.
For example, I get BAYES_95:

  wget -O - -q http://codepad.org/W53onqK9/raw.txt | spamc | /bin/fgrep --text 
X-Spam-Bayes
  X-Spam-Bayes: bayes=0.9679, N=50(29-2+11), ham=(sort, doing), 
spam=(UD:spaces.live.com, UD:live.com, UD:entry, dawn, 
HX-Mozilla-Status2:00000000)

After I learn this message the probability increases to BAYES_99

  % wget -O - -q http://codepad.org/W53onqK9/raw.txt | sa-learn --spam
  Learned tokens from 1 message(s) (1 message(s) examined)
  % sa-learn --sync
  % wget -O - -q http://codepad.org/W53onqK9/raw.txt | spamc | /bin/fgrep 
--text X-Spam-Bayes
  X-Spam-Bayes: bayes=1.0000, N=50(47-2+29), ham=(sort, doing), 
spam=(UD:spaces.live.com, UD:live.com, UD:entry, dawn, 
HX-Mozilla-Status2:00000000)

Note that Bayes has determined that UD:spaces.live.com is a spam sign.

The X-Spam-Bayes header is added with
  add_header all Bayes bayes=_BAYES_, 
N=_BAYESTC_(_BAYESTCLEARNED_-_BAYESTCHAMMY_+_BAYESTCSPAMMY_), 
ham=(_HAMMYTOKENS(5,short)_), spam=(_SPAMMYTOKENS(5,short)_)

-jeff

Reply via email to