Back in April, Tony Meyer posted that he was receiving
a lot of image-based spam.
I too
am having nothing but trouble with embedded images:
-
Daily adds for fake Rolex watches
-
Daily stock tips
- TONS of drugs for
sale.
This style of Spam contains an image at the top, followed by a bunch
of totally unrelated text that has been copied from some kind of random
composition. I have very large Spam & Ham folders, that I've
successfully trained SpamBayes with. It's only these image-based adverts
that sneak by EVERY DAY.
Mostly my SpamBayes catches ALL of these when anything gets this far...
Mostly my SpamBayes catches ALL of these when anything gets this far...
Something really needs to be done about this type of Spam within
SpamBayes. Are any other Spam engines able to handle this stuff, by
scanning the image for text, or something?
Sure, there are others (as well a SpamBayes if you just keep training EVERY ONE of them) but most of the others are either commercial (i.e., cost money) OR they run on the Server (SpamAssassin, greylistd, and other filters.)
There has been talk about filters which would explicitly do OCR or some other type of image content detection but I don't (personally) know of any that are working/available/effective right now.
Such would also likely be "resource (CPU) intensive".
FWIW, greylisting on the server knocks down practically all of this junk and SpamAssassin catches the rest.
The VERY occasional item that slips through our server is caught by SpamBayes. (Defense in depth is our key to ZERO spam -- with practically everything REJECTED, not bounced, at the server during SMTP connect time.)
Sure, there are others (as well a SpamBayes if you just keep training EVERY ONE of them) but most of the others are either commercial (i.e., cost money) OR they run on the Server (SpamAssassin, greylistd, and other filters.)
There has been talk about filters which would explicitly do OCR or some other type of image content detection but I don't (personally) know of any that are working/available/effective right now.
Such would also likely be "resource (CPU) intensive".
FWIW, greylisting on the server knocks down practically all of this junk and SpamAssassin catches the rest.
The VERY occasional item that slips through our server is caught by SpamBayes. (Defense in depth is our key to ZERO spam -- with practically everything REJECTED, not bounced, at the server during SMTP connect time.)
And some of us DO WISH to get graphical email
-- picture of my grand kid(s) frequently arrive this
way.
--
Herb Martin
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of [EMAIL PROTECTED]
Sent: Sunday, October 02, 2005 1:53 PM
To: [email protected]
Subject: [Spambayes] SpamBayes to Handle Embedded Images
_______________________________________________ [email protected] http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html
