Re: [Mimedefang] Image validator/OCR SA plugin

2006-04-23 Thread David F. Skoll
Martin Blapp wrote: > Hmm, the SA and rules du jour stock and obfu rules suck ;-) Beside that, > I also match some words which are 100% legitimate. And the OCR words > are often truncated so one must match those too. But the real key is Bayes. Adding the OCR words to Bayes will be a real advanta

RE: [Mimedefang] Image validator/OCR SA plugin

2006-04-22 Thread Martin Blapp
Hi, be something to be gained by running the OCR scan from mimdefang? The idea would be to run the scan, and if sufficient text results (I'd hesitate to suggest that a quick spelling scan would be run on the result, but that is a possibility) that this text is written by MdF into a new text att

RE: [Mimedefang] Image validator/OCR SA plugin

2006-04-22 Thread Gary Funck
Late follow-up to this thread ... was wondering if there might be something to be gained by running the OCR scan from mimdefang? The idea would be to run the scan, and if sufficient text results (I'd hesitate to suggest that a quick spelling scan would be run on the result, but that is a possibili

Re: [Mimedefang] Image validator/OCR SA plugin

2006-04-19 Thread David F. Skoll
Nels Lindquist wrote: > As far as spammers obfuscating their images, couldn't that be worked > around by tying OCR into the bayesian system? I think the original idea was to obfuscate the images so people could read the text, but OCR tools wouldn't be able to. > Then obfuscation wouldn't matter

Re: [Mimedefang] Image validator/OCR SA plugin

2006-04-19 Thread Nels Lindquist
On 14 Apr 2006 at 18:42, Martin Blapp wrote: > This is just a little advertisement for my plugin which is now > in a usable state and works very well. > > Anyone interested should keep an eye on it - it really helps > with the image only spam we get today. But problably the spammers > will soon c

RE: [Mimedefang] Image validator/OCR SA plugin

2006-04-19 Thread Cormack, Ken
So far in my tests, this OCR plugin looks like it's working ok. I rounded up the needed prereqs (that was a bit of a chore, but everything compiled cleanly), and changed the package definition as indicated in Martin's post (be sure to run "spamassassin -D --lint"). So far I've seen several hits f

Re: [Mimedefang] Image validator/OCR SA plugin

2006-04-18 Thread Philip Prindeville
Dave Williss wrote: >- Original Message - >From: "Gary Funck" <[EMAIL PROTECTED]> >To: >Sent: Sunday, April 16, 2006 6:34 PM >Subject: RE: [Mimedefang] Image validator/OCR SA plugin > > > > >>Martin wrote: >> >> >

Re: [Mimedefang] Image validator/OCR SA plugin

2006-04-17 Thread Dave Williss
- Original Message - From: "Gary Funck" <[EMAIL PROTECTED]> To: Sent: Sunday, April 16, 2006 6:34 PM Subject: RE: [Mimedefang] Image validator/OCR SA plugin Martin wrote: But problably the spammers will soon change their tricks to different images which are more d

RE: [Mimedefang] Image validator/OCR SA plugin

2006-04-16 Thread Gary Funck
Martin wrote: > But problably the spammers > will soon change their tricks to different images which are more > difficult to read :-( > > http://antispam.imp.ch/patches/patch-ocrtext On this topic, Nick FitzGerald mentioned this article, http://www.jgc.org/blog/2006/01/do-spammers-fear-ocr.html S

RE: [Mimedefang] Image validator/OCR SA plugin

2006-04-15 Thread Gary Funck
> From: Martin Blapp > Sent: Friday, April 14, 2006 9:43 AM > > This is just a little advertisement for my plugin which is now > in a usable state and works very well. > > > http://antispam.imp.ch/patches/patch-ocrtext > Martin, this is a Very Good Thing, and has been needed for quite some t

RE: [Mimedefang] Image validator/OCR SA plugin

2006-04-15 Thread Paul Murphy
Martin, I installed your plugin for testing, but found that it would not load correctly on my system, giving the error: [5631] dbg: plugin: loading Mail::SpamAssassin::Plugin::ocrtext from @INC [5631] warn: plugin: failed to create instance of plugin Mail::SpamAssassin::Pl ugin::ocrtext: Can't lo

Re: [Mimedefang] Image validator/OCR SA plugin

2006-04-14 Thread Martin Blapp
# grep HTML_IMAGE_ONLY /var/log/maillog | wc -l 35834 This is wrong. It should have been # grep "HTML_IMAGE_ONLY.*hits=" /var/log/maillog | wc -l 17917 But almost 45% of all mails match HTML_IMAGE_ONLY, so it's unusable at all. I even use lower scores for those rules now - which gives m

Re: [Mimedefang] Image validator/OCR SA plugin

2006-04-14 Thread Martin Blapp
Interesting... What's the performance like with this? How many messages do you scan per day with it? It is rather fast. On a Pentium IV 3Ghz I can scan a average jpg/gif picture in 0,2 - 0,3 seconds. I've limited the scantime to 5 seconds per image, and I allow only three images to be scanne

Re: [Mimedefang] Image validator/OCR SA plugin

2006-04-14 Thread John Rudd
On Apr 14, 2006, at 9:42 AM, Martin Blapp wrote: Anyone interested should keep an eye on it - it really helps with the image only spam we get today. But problably the spammers will soon change their tricks to different images which are more difficult to read :-( I can see it now ... pretty s

Re: [Mimedefang] Image validator/OCR SA plugin

2006-04-14 Thread Richard Laager
On Fri, 2006-04-14 at 18:42 +0200, Martin Blapp wrote: > Anyone interested should keep an eye on it - it really helps > with the image only spam we get today. But problably the spammers > will soon change their tricks to different images which are more > difficult to read :-( Interesting... What's

RE: [Mimedefang] Image validator/OCR SA plugin

2006-04-14 Thread Matthew.van.Eerde
Martin Blapp wrote: > http://antispam.imp.ch/patches/patch-ocrtext That is unbelievably sweet. I remember a couple of years ago there was a virus that sent itself in a password-protected .zip file, with an image containing the password. OCR would have been useful... I could easily see MIMEDefa