Yes image spam can be a real pain. I have just implemented a new mailserver and image spam is certainly on the increase :-
mysql> select count(*) from maillog; +----------+ | count(*) | +----------+ | 15091 | +----------+ 1 row in set (0.00 sec) mysql> select count(*) from maillog where spamreport like '%FUZZY_OCR%'; +----------+ | count(*) | +----------+ | 3438 | +----------+ 1 row in set (0.04 sec) mysql> select count(*) from maillog where spamreport like '%FUZZY_OCR_KNOWN_HASH%'; +----------+ | count(*) | +----------+ | 1070 | +----------+ 1 row in set (0.04 sec) On Fri, 23 Mar 2007 06:46:50 -0700, Marc Perkel <[EMAIL PROTECTED]> wrote: > Perhaps what I need to do is to get rid of autolearn and write my own > learning system that strips out the body of messages with images and > just learns the headers. My problem is that when users get image spam > they put it in the spam folders and they get learned. But the text in > the image spam causes ham type text to be learned as spam. That causes > ham to get higher scores. > > -- > This message has been scanned for viruses and dangerous content by > MailScanner, and is > believed to be clean. -- --[ UxBoD ]-- // PGP Key: "curl -s http://www.splatnix.net/uxbod.asc | gpg --import" // Fingerprint: 543A E778 7F2D 98F1 3E50 9C1F F190 93E0 E8E8 0CF8 // Keyserver: www.keyserver.net Key-ID: 0xE8E80CF8 // SIP Phone: [EMAIL PROTECTED] -- This message has been scanned for viruses and dangerous content by MailScanner, and is believed to be clean.