[SAtalk] Umlaut on subjects

2003-11-11 Thread Leonardo H. Machado

Hi,

is there a way to block subjects with Umlauted Letters like:

ö ü ä (sorry if it's messesd after mailing)

The spammers are Umlauting words like Viägra. Those Umlauts are quite
difficult to detected. They are converted to ISO Code, or whatever.

In my language there is no Umlauts. So, it's safe to block them on
subjects.

How would a rule to get Umlauts in subject look like?

/ISO?/




//leoh
main(){int j=1234;char t[]=:@abcdefghijklmnopqrstuvwxyz.\n
,*i = iqgbgxmlvivuc\n:wwnfwsdoi; char *strchr(char *,int);
while(*i){j+=strchr(t,*i++)-t;j%=sizeof t-1;putchar(t[j]);}}



---
This SF.Net email sponsored by: ApacheCon 2003,
16-19 November in Las Vegas. Learn firsthand the latest
developments in Apache, PHP, Perl, XML, Java, MySQL,
WebDAV, and more! http://www.apachecon.com/
___
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk


Re: [SAtalk] Umlaut on subjects

2003-11-11 Thread Robert Menschel
Hello Leonardo,

Sunday, November 9, 2003, 12:38:40 PM, you wrote:

LHM is there a way to block subjects with Umlauted Letters like:
LHM ö ü ä (sorry if it's messesd after mailing)

LHM How would a rule to get Umlauts in subject look like?
LHM /ISO?/

Does this work?
header   RM_sl_ForeignChar  Subject =~ /\w[äëöü]\w/
describe RM_sl_ForeignChar  Subject contains foreign character apparently embedded 
within a word
scoreRM_sl_ForeignChar  2.15   # 47s/0h of 36402 corpus; 115s/0h of 39283 corpus

or this?
header   RM_sx_USAscii  Subject:raw =~ /us-ascii/i
describe RM_sx_USAscii  Subject specifies display in US-ascii, unnecessary unless 
spam hides subject
scoreRM_sx_USAscii  0.90   # 27s/2h of 39283 corpus, ham = MS Passport.com
header   RM_sx_KS5601   Subject:raw =~ /\=\?ks_c_5601\-1987\?/i
describe RM_sx_KS5601   Subject specifies display in Korean?, unnecessary unless 
spam hides subject
scoreRM_sx_KS5601   0.40   # 4s/0h of 39283 corpus
header   RM_sx_iso8859  Subject:raw =~ /iso-8859-1/i
describe RM_sx_iso8859  Subject specifies display in ISO-8859
scoreRM_sx_iso8859  1.298  # 1013s/33h of 39283 corpus

Bob Menschel




---
This SF.Net email sponsored by: ApacheCon 2003,
16-19 November in Las Vegas. Learn firsthand the latest
developments in Apache, PHP, Perl, XML, Java, MySQL,
WebDAV, and more! http://www.apachecon.com/
___
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk