RE: FuzzyOCR matches word "" -BROKE AGAIN - But reproducible

2007-01-03 Thread Nigel Kendrick
 
-Original Message-
From: Nigel Kendrick [mailto:[EMAIL PROTECTED]
Sent: Wednesday, January 03, 2007 9:11 AM
To: users@spamassassin.apache.org
Subject: FuzzyOCR matches word ""

Hi,

I have just upgraded from FuzzOCR 2.3b to the 3.4.2 devel by copying over
the .cf and .pm files, re-making my tweaks to the .cf file and
compiling/installing gifsicle. Following a restart of spamassassin,
everything is kinda working, but the debug log shows that FuzzyOCR is
finding matches for "":

[SNIP]



OK, so I added a couple of debug lines to check the parsing of the words
list, saved the original .pm file and put mine in place and everything
checked out OK. I put back the original .pm file and everything is still
working OK.

Not planning to take this much further - just happy it's working - but will
keep an eye on it.

Hmmm





Broke again!

Only happens when I turn on the hash database by setting
"focr_enable_image_hashing 2"

The db files are present and world-writable (for testing).

Double Hmmm




RE: FuzzyOCR matches word "" - FIXED!

2007-01-03 Thread Nigel Kendrick


-Original Message-
From: Nigel Kendrick [mailto:[EMAIL PROTECTED] 
Sent: Wednesday, January 03, 2007 9:11 AM
To: users@spamassassin.apache.org
Subject: FuzzyOCR matches word ""

Hi,

I have just upgraded from FuzzOCR 2.3b to the 3.4.2 devel by copying over
the .cf and .pm files, re-making my tweaks to the .cf file and
compiling/installing gifsicle. Following a restart of spamassassin,
everything is kinda working, but the debug log shows that FuzzyOCR is
finding matches for "":

[SNIP]



OK, so I added a couple of debug lines to check the parsing of the words
list, saved the original .pm file and put mine in place and everything
checked out OK. I put back the original .pm file and everything is still
working OK.

Not planning to take this much further - just happy it's working - but will
keep an eye on it.

Hmmm





FuzzyOCR matches word ""

2007-01-03 Thread Nigel Kendrick
Hi,

I have just upgraded from FuzzOCR 2.3b to the 3.4.2 devel by copying over
the .cf and .pm files, re-making my tweaks to the .cf file and
compiling/installing gifsicle. Following a restart of spamassassin,
everything is kinda working, but the debug log shows that FuzzyOCR is
finding matches for "":

[SNIP]
[8695] dbg: FuzzyOcr: Found word "target" in line
[8695] dbg: FuzzyOcr: "shotermprcetargetoo" 
[8695] dbg: FuzzyOcr: with fuzz of 0. scanned with scanset $gocr -i
$pfile
[8695] dbg: FuzzyOcr: Found word "target" in line
[8695] dbg: FuzzyOcr: "shotermprlcetargetoo" 
[8695] dbg: FuzzyOcr: with fuzz of 0. scanned with scanset $gocr -l 180
-d 2 -i $pfile
[8695] warn: Use of uninitialized value in abs at
/etc/mail/spamassassin/FuzzyOcr.pm line 1146.
[8695] dbg: FuzzyOcr: Found word "" in line
[8695] dbg: FuzzyOcr: "quantumeneryrinccorporateupdateqegyob" 
[8695] dbg: FuzzyOcr: with fuzz of 0. scanned with scanset $gocr -i
$pfile
[8695] warn: Use of uninitialized value in abs at
/etc/mail/spamassassin/FuzzyOcr.pm line 1146.
[8695] dbg: FuzzyOcr: Found word "" in line
[8695] dbg: FuzzyOcr: "hotenergystocki" 
[8695] dbg: FuzzyOcr: with fuzz of 0. scanned with scanset $gocr -i
$pfile
[8695] warn: Use of uninitialized value in abs at
/etc/mail/spamassassin/FuzzyOcr.pm line 1146.
[8695] dbg: FuzzyOcr: Found word "" in line
[8695] dbg: FuzzyOcr: "redytoriiibigmmmeriii" 
[SNIP]
[8695] dbg: FuzzyOcr: Message is spam, score = 69.000
[8695] dbg: FuzzyOcr: Words found:
[8695] dbg: FuzzyOcr: "target" in 1 lines
[8695] dbg: FuzzyOcr: "" in 15 lines
[8695] dbg: FuzzyOcr: "stock" in 2 lines
[8695] dbg: FuzzyOcr: "rating" in 1 lines
[8695] dbg: FuzzyOcr: "company" in 1 lines
[8695] dbg: FuzzyOcr: "recommendation" in 1 lines
[8695] dbg: FuzzyOcr: "trading" in 1 lines
[8695] dbg: FuzzyOcr: "" in 15 lines
[8695] dbg: FuzzyOcr: "" in 15 lines
[8695] dbg: FuzzyOcr: "" in 15 lines
[8695] dbg: FuzzyOcr: (67 word occurrences found)
[8695] dbg: FuzzyOcr: Remove DIR: /tmp/.spamassassin8695S9N6D5tmp
[8695] dbg: FuzzyOcr: FuzzyOcr ending successfully...


Any ideas?

Thanks

Nigel Kendrick