Sorry for replying to my own message, but that should have read: http://sandgnat.com/cmos/cmos.jsp?sourceRules=body+CUM+/\bcum\b/
(http, not https) Chris Thielen said: > Lyle Evans said: >> At 10:52 AM 11/10/03, Jason wrote: >> >>>... >>> >>>I was thinking today wouldn't it be better to just ignore all the >>> periods, >>>commas, and what have you in the text? Inside SA we could just drop >>> those >>>and then search the message from that. >>> >>>I've had one spammer who just puts a random period in the message and it >>>doesn't get tagged. Taking out all the periods in the message and it >>>scored a 10.6 just from the body of the message. >> >> Yes I am being hit hard by he same type of spam. >> >> While I think chasing variants is in general a losing battle >> and that instead more general rules are needed such as ones to eliminate >> internal periods, I just did a quick and dirty rule: >> >> body LE_bp_Naughtydot / CU\.M/i >> describe LE_bp_Naughtydot Body Naughty word with inserted dot >> score LE_bp_Naughtydot 2.85 >> >> The score is more or less arbitrary. The logic is tackle the >> the short words that can't have as many variants. >> A N letter word can have N-1 internal single dot variants. >> Suggestions for improvement strongly encouraged. > > Try this: > > https://sandgnat.com/cmos/cmos.jsp?sourceRules=body+CUM+/\bcum\b/ > > tweak as necessary.. I'm still trying to track down some false positives I > have as the generated rules catch many permutations. ------------------------------------------------------- This SF.Net email sponsored by: ApacheCon 2003, 16-19 November in Las Vegas. Learn firsthand the latest developments in Apache, PHP, Perl, XML, Java, MySQL, WebDAV, and more! http://www.apachecon.com/ _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk