>
> On Aug 2, 2006, at 3:03 AM, Matthias Keller wrote:
>
>>>> will it not be much faster just to make a md5 sum on the image file
>>>> without
>>>> thinking if it a appel or orange ? :-)
>>> Yes, but just taking a straight sum will be sensitive to all of those
>>> small pixels which are changed by the spammers so that they have
>>> different sums, but the differences aren't visible to the human eye.
>>>
>>> That's my point.  If you drop out the lower bits of the colors, then
>>> you mostly retain what is perceptible (in color ranges) to the human,
>>> while losing those parts that a) the human wouldn't have noticed
>>> anyway, and b) throw off your sum of the image for comparison to
>>> known spam images.
>> Hi
>>
>> You're idea is kinda interesting, but what would you do about a pic
>> with white background, black font and some random black noise on it ?
>
>
> Yeah, my strategy fights hidden pixel variations, but not overt ones.
> making the image actually appear grainy/noisy to the human eye, with
> different grain/noise for each spam, still gets past my strategy.
>
>
Maybe I'm not getting the obvious, but what about using something like
Perl::Magick to convert a given image into B/W? I mean, ImageMagick is
made for things like that...
Shrinking it to, say, a quarter of it's original size would take care of
at least many random noise pixels.

Dirk

Reply via email to