On Thursday, August 19, 2004, 10:45:37 PM, Jorge wrote:

JA> Could a filter be created that will tag as spam any messages that
JA> contaning NON-ascii characters? I mean allow only CHRS 1 through 255.

JA> I believe this fill filter out all these foreign character sets, and let
JA> through regular old and plain messages through...

JA> Of course such a rule will only apply for most of us on the western
JA> hemisphere...

In theory this could be done, but it would be a tricky gadget -
probably best done as something programatic... There are a lot of
opportunities for false positives.

I will think about this...

Then again - why not simply block on anything that says charset= ? If
it's plain old ascii, then there's no need for charset. (Lots of FPs
with this, but then I would never use a filter like that... It might
be very close to what you are looking for.

The other way to do it would be to build patterns that match all of
the known character sets -- or at least the majority. That would be a
chunk of work but doable - especially with a few well placed
wildcards and a good comprehensive list.

_M



This E-Mail came from the Message Sniffer mailing list. For information and 
(un)subscription instructions go to 
http://www.sortmonster.com/MessageSniffer/Help/Help.html

Reply via email to