Re: Is fuzzyocr i.e. Image scanning

2018-10-17 Thread Matus UHLAR - fantomas
On 16.10.18 18:42, RW wrote: Bayes might work, but I wouldn't like to see it added to body text because corrupted text could look like obfuscation. On Wed, 17 Oct 2018, Matus UHLAR - fantomas wrote: it should be pushed back to body text just for filters like bayes. The same could/should be do

Re: Is fuzzyocr i.e. Image scanning

2018-10-17 Thread David B Funk
On Wed, 17 Oct 2018, Rupert Gallagher wrote: IC is an effort to dig a hole in the water, because the problem of image spam with obfuscated text cannot be solved by ocr.  My approach is a "better safe than sorry" best practice that anyone can implement with existing software:  1. do not displ

Re: Is fuzzyocr i.e. Image scanning

2018-10-17 Thread John Hardin
On Wed, 17 Oct 2018, Matus UHLAR - fantomas wrote: On 16.10.18 18:42, RW wrote: Bayes might work, but I wouldn't like to see it added to body text because corrupted text could look like obfuscation. it should be pushed back to body text just for filters like bayes. The same could/should be do

Re: Is fuzzyocr i.e. Image scanning

2018-10-17 Thread Matus UHLAR - fantomas
>On Tue, 16 Oct 2018 11:49:54 +0700 Olivier wrote: >> One of my holdback with FuzzyOCR is that you have to provide an >> independant word list, while we have a very good tool to analyze >> text contents: SpamAssassin itself. So I would much prefer >> FuzzyOCR to feed the OCR'ed text back to SA for

Re: Is fuzzyocr i.e. Image scanning

2018-10-17 Thread Henrik K
On Wed, Oct 17, 2018 at 09:21:33AM +0700, Olivier wrote: > > That is the way I meant it, it's an AND, not an OR. I see FuzzyOCR as > just one more tool that can be added to SA. The problem is it's so inefficient.. I've never seen image spam as a problem, mostly it hits other rules and MTA blocks

Re: Is fuzzyocr i.e. Image scanning

2018-10-16 Thread Rupert Gallagher
I see a vps and an ".expert" tld sender domain. My servers handle those with a REJECT rule. On Fri, Oct 12, 2018 at 15:11, Brent Clark wrote: > Good day Guys > > I am getting quite a bit of image spam, and googling put me in the > direction of a tool called FuzzyOCR. > > What I did was configur

Re: Is fuzzyocr i.e. Image scanning

2018-10-16 Thread Rupert Gallagher
My comments on http://pralab.diee.unica.it/en/ImageCerberus IC is an effort to dig a hole in the water, because the problem of image spam with obfuscated text cannot be solved by ocr. My approach is a "better safe than sorry" best practice that anyone can implement with existing software: 1.

Re: Is fuzzyocr i.e. Image scanning

2018-10-16 Thread Olivier
Hi, > > > One of my holdback with FuzzyOCR is that you have to provide an > > independant word list, while we have a very good tool to analyze text > > contents: SpamAssassin itself. So I would much prefer FuzzyOCR to feed > > the OCR'ed text back to SA for further analysis (the way pdfAssassin >

Re: Is fuzzyocr i.e. Image scanning

2018-10-16 Thread RW
On Tue, 16 Oct 2018 15:48:34 +0200 Matus UHLAR - fantomas wrote: > >On Tue, 16 Oct 2018 11:49:54 +0700 Olivier wrote: > >> One of my holdback with FuzzyOCR is that you have to provide an > >> independant word list, while we have a very good tool to analyze > >> text contents: SpamAssassin itself

Re: Is fuzzyocr i.e. Image scanning

2018-10-16 Thread Matus UHLAR - fantomas
On Tue, 16 Oct 2018 11:49:54 +0700 Olivier wrote: One of my holdback with FuzzyOCR is that you have to provide an independant word list, while we have a very good tool to analyze text contents: SpamAssassin itself. So I would much prefer FuzzyOCR to feed the OCR'ed text back to SA for further ana

Re: Is fuzzyocr i.e. Image scanning

2018-10-16 Thread RW
On Tue, 16 Oct 2018 11:49:54 +0700 Olivier wrote: > One of my holdback with FuzzyOCR is that you have to provide an > independant word list, while we have a very good tool to analyze text > contents: SpamAssassin itself. So I would much prefer FuzzyOCR to feed > the OCR'ed text back to SA for fur

Re: Is fuzzyocr i.e. Image scanning

2018-10-15 Thread Brent Clark
Olivier, Thank you *ever* so much for replying. Regards Brent On 2018/10/16 06:49, Olivier wrote: Brent, I have Fuzzy OCR installed and running, but the only rule that was trigered 22 times during the past 40 days was FUZZY_OCR_WRONG_CTYPE, meaning that the image type does not match the conten

Re: Is fuzzyocr i.e. Image scanning

2018-10-15 Thread Olivier
Brent, I have Fuzzy OCR installed and running, but the only rule that was trigered 22 times during the past 40 days was FUZZY_OCR_WRONG_CTYPE, meaning that the image type does not match the content-type set for MIME. That is still a valid catch, but not based on the OCR'ed text. One of my holdb

Re: Is fuzzyocr i.e. Image scanning

2018-10-15 Thread John Hardin
On Mon, 15 Oct 2018, Brent Clark wrote: Good day Guys I was fortunate that someone privately emailed me, but is there no one else, that has any thing they can share (its not only to me, but the community as a whole). Im sure there is others out there, whose users dealing with this nonsense.

Re: Is fuzzyocr i.e. Image scanning

2018-10-15 Thread Brent Clark
Good day Guys I was fortunate that someone privately emailed me, but is there no one else, that has any thing they can share (its not only to me, but the community as a whole). Im sure there is others out there, whose users dealing with this nonsense. Please share. Regards Brent On 2018/10

Re: Is fuzzyocr i.e. Image scanning

2018-10-12 Thread Brent Clark
Apologies for the subject. It was meant to read "Is fuzzyocr i.e. Image scanning, warranted in 2018" Regards Brent On 2018/10/12 15:11, Brent Clark wrote: Good day Guys I am getting quite a bit of image spam, and googling put me in the direction of a tool called FuzzyOCR. What

Is fuzzyocr i.e. Image scanning

2018-10-12 Thread Brent Clark
Good day Guys I am getting quite a bit of image spam, and googling put me in the direction of a tool called FuzzyOCR. What I did was configure vagrant to install spamassassin and fuzzyocr, and fuzzyocr does not appear to be catching my spam (The example provided work). Before I go down the