> Following discussions on this list about obfuscating words to avoid spam
> detection, and not being a ninja, I'd like some feedback about the
> possible efficacy or pitfalls on rules like the following.
[snip]

In general, there are three main ways of dealing with these obfuscations:
1.  Hand-crafted rules looking for the generally expected variants; usually
on a phrase rather than a word.
2. Chris's Obfu generator that generates and exhaustive (and exhausing, if
you try to read the result ;-) regex to catch just about any variation on
obfuscation on a word or phrase, and
3. Tripwire and related rules that will very often end up triggering pretty
heavily on the more creative obfuscations.
4. And then there is SURBL, that renders all the previous pretty moot after
the first hour or two of a new spam target domain.

As a slight subject change, there is another form of obfuscation in the wild
that tends to escape all except the SURBL test.  I'm actually rather fond of
these spams, since I can always get a good belly laugh from whatever the
spam generator managed to come up with.  I'm guessing that these are
generated by a tool that takes a phrase and then does a thesaurus lookup on
each word, with a *very* creative thesarus.  Below is an edited sample of
one such.  The message appears twice, with modifications.  Once in the text
part of the spam, once in the html part:

------ begin spam --------
These pills are only similar normal lozenges but they
are specially formulated to be soft and dissolvable
under the glossa. The tablets is sorbed at the oral fissure
and gets in the bloodstream directly alternatively of rising
through with the tummytum. This effects in a quicker much more
powerful outcome which run up to 35 hours!

Our tablets are simply equal usual lozenges but they <BR>
are specially formulated to be pliant and soluble<BR>
below the clapper. The tablets is sorbed at the oral cavity<BR>
and gets into the bloodstream straight alternatively of progressing<BR>
through with the tummytum. This results in a faster more<BR>
strong result which yet up to 39 hours!<BR>
------------ end spam ------------

        Loren

Reply via email to