Hello Aleksander,

Tuesday, February 10, 2004, 3:57:16 AM, you wrote:

AA> I've just wanted to share my rule which is quite effective against spam
AA> that contains phrases in the type of
AA> "100 purcant M0ney Back G.u.a.r.a.n.t.e.e" or "|F|R|E|E| B0ttle W0rth
AA> Alm0st 60 d0llars!":

AA> rawbody PIPE_DOT_PUNCTUATION            /([a-z][|.]+[a-z].{1,2}){2}/i
AA> describe PIPE_DOT_PUNCTUATION           Body contains 
punctuation-obfuscated text
AA> lang pl describe PIPE_DOT_PUNCTUATION   Tresc zawiera tekst znieksztalcony 
znakami interpunkcyjnymi
AA> score PIPE_DOT_PUNCTUATION              2.5 2.2 2.0 2.0

AA> It catches text where individual letters or pairs of letters are 
AA> separated with a pipe or dot character.

Results here:

Section 3 -- Frequencies Log
(First numeric frequencies, followed by percentage frequencies)

OVERALL     SPAM      HAM     S/O   SCORE  NAME
  91185    73148    18037    0.802   0.00    0.00  (all messages)
  12152     8907     3245    0.404   0.00   2.50  PIPE_DOT_PUNCTUATION

OVERALL%   SPAM%     HAM%     S/O    RANK   SCORE  NAME
  91185    73148    18037    0.802   0.00    0.00  (all messages)
100.000  80.2193  19.7807    0.802   0.00    0.00  (all messages as %)
 13.327  12.1767  17.9908    0.404   0.00    2.50  PIPE_DOT_PUNCTUATION

 In other words, it hits a higher percentage of ham in my corpus than it
 does spam.


Reply via email to