My thoughts on image spam strategies

John Rudd Tue, 01 Aug 2006 18:08:33 -0700


1) use Martin Blapp's OCR plugin/patch for SA.  feed data to bayes.
  http://antispam.imp.ch/patches/patch-ocrtext

2) to combat the "images with subtle differences", develop a checksummethod that ignores the lower (3 or 4 bits? out of 8 bits) of eachcolor channel. That way you get what is essentially a very highcontrast image, washing out the subtle variations. Checksum that, cropit down to remove all white border area, and compare it to a databaseof known spam images that have been similarly altered. (which wouldthen suggest: someone developing a razor-like database of imagechecksums; it'd be nice if the return was a confidence percentage)

(if the alteration leaves an image that is 0x0 pixels (because itbecame all white) or all one color, then it might be worth flagging itwith a decent confidence percentage, as it was composed entirely ofsubtle variations from a base color, which I would find suspicious)

My thoughts on image spam strategies

Reply via email to