Lyle Evans said:
> At 10:52 AM 11/10/03, Jason wrote:
>
>>...
>>
>>I was thinking today wouldn't it be better to just ignore all the
>> periods,
>>commas, and what have you in the text?  Inside SA we could just drop
>> those
>>and then search the message from that.
>>
>>I've had one spammer who just puts a random period in the message and it
>>doesn't get tagged.  Taking out all the periods in the message and it
>>scored a 10.6 just from the body of the message.
>
> Yes I am being hit hard by he same type of spam.
>
> While I think chasing variants is in general a losing battle
> and that instead more general rules are needed such as ones to eliminate
> internal periods, I just did a quick and dirty rule:
>
> body     LE_bp_Naughtydot   / CU\.M/i
> describe LE_bp_Naughtydot   Body Naughty word with inserted dot
> score    LE_bp_Naughtydot   2.85
>
> The score is more or less arbitrary. The logic is tackle the
> the short words that can't have as many variants.
> A N letter word can have N-1 internal single dot variants.
> Suggestions for improvement strongly encouraged.

Try this:

https://sandgnat.com/cmos/cmos.jsp?sourceRules=body+CUM+/\bcum\b/

tweak as necessary.. I'm still trying to track down some false positives I
have as the generated rules catch many permutations.




-------------------------------------------------------
This SF.Net email sponsored by: ApacheCon 2003,
16-19 November in Las Vegas. Learn firsthand the latest
developments in Apache, PHP, Perl, XML, Java, MySQL,
WebDAV, and more! http://www.apachecon.com/
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to