On Sun, 5 Nov 2006, Robert Nicholson wrote: > I'm looking to run SA on some mailing list mail that's constantly > getting bombarded by asian spam. > > So I would like the check to be as efficient as possible such that > as soon as I know the mail has asian character sets or unwanted > language I don't want it to check any more. Any chance I can short > circuit the additional checking once I know the mail is likely to > be asian or unwanted language?
Don't limit yourself to options within SA if you can do MTA checks. As a slash-and-burn tactic you can RBL large regions of asia (using e.g. cn.countries.nerd.dk, tw.countries.nerd.dk) Of course, that won't catch asian-language messages originating elsewhere, and isn't language-sensitive. You might also want to take a look at milter-regex. Here's what I use on my MTA: reject "Sorry - only English spoken here" header /Subject/i /=[?](KOI8-[RU]|GB2312|GB2312_CHARSET|ISO-2022-JP|SHIFT[-_]JIS|BIG5|WINDOWS-125[15])[?][QB][?]/ie header /Subject/i /charset=(3D)?"?(KOI8-[RU]|GB2312|GB2312_CHARSET|ISO-2022-JP|SHIFT[-_]JIS|BIG5)/ie header /Subject/i /[?-?]{6}/e header /Content-Type/i ,text/(plain|html); *charset="?(KOI8-[RU]|GB2312(_CHARSET)?|ISO-2022-JP|SHIFT[-_]JIS|BIG5),ie body ,Content-Type(: |" content=")text/(plain|html); charset="?(KOI8-[RU]|GB2312(_CHARSET)?|ISO-2022-JP|SHIFT[-_]JIS|BIG5),ie body ,http-equiv=3D"Content-Type" content=3D"text/(plain|html); charset=3D(KOI8-[RU]|GB2312|ISO-2|SHIFT|BIG5),ie -- John Hardin KA7OHZ ICQ#15735746 http://www.impsec.org/~jhardin/ [EMAIL PROTECTED] FALaholic #11174 pgpk -a [EMAIL PROTECTED] key: 0xB8732E79 - 2D8C 34F4 6411 F507 136C AF76 D822 E6E6 B873 2E79 ----------------------------------------------------------------------- If someone has a gun and is trying to kill you, it would be reasonable to shoot back with your own gun. -- the Dalai Lama, May 15, 2001 ----------------------------------------------------------------------- 2 days until the campaign ads stop