On 5/10/2005 12:31, Martin Blapp wrote:

> We use a specific ruleset against those 'ASCII artists', the
> 
> rawbody               __SMALL_FONT    /font-size:[\s\t 
> ]{1,3}(?:1|2)(?:px|pt|;)/i

Looks quite good  :)

> We also look for different gaps between chars

I'm interested if things like source code might trigger some of these
unnecessarily?  Also the amount of processing time might be worrisome?

> http://antispam.imp.ch/rules/asciispam.cf
> 
> Maybe it is useful for you.

Definitely gives me some interesting ideas in other areas.  In particular,
the unusual char combinations seem quite useful (maybe it's high time to
find a statistical analysis on an english dictionary).  :)  Interestingly
enough I don't really get any ascii art spams here but a lot of your
techniques look promising.  I guess every site has their own most common
"types" of ham and spam so the ability to customize is really important.

Cheers,

~Jason

-- 
_______________________________________________
Visit http://www.mimedefang.org and http://www.canit.ca
MIMEDefang mailing list
MIMEDefang@lists.roaringpenguin.com
http://lists.roaringpenguin.com/mailman/listinfo/mimedefang

Reply via email to