Re: tmp files being left over from FuzzyOCR?

2006-10-20 Thread Nick Leverton
On Thursday 19 October 2006 16:00, George R. Kasica wrote:
 Its not a formal released version from Chris/decoder. I'm running b
 here as it seems the most stable.

 If you want J is at:

Decoder said some time ago that J was an early development version and 
recommended people go with b.  He's issued one patch since, in Message-ID 
[EMAIL PROTECTED]

FuzzyOCR.pm line 492:
  print DB $score::$digest\n;
Should be:
  print DB ${score}::${digest}\n;

HTH etc,

Nick


fuzzyocr-23b-hashdb-poison.patch.gz
Description: GNU Zip compressed data


Re: tmp files being left over from FuzzyOCR?

2006-10-20 Thread Nick Leverton
On Friday 20 October 2006 15:44, Nick Leverton wrote:

 HTH etc,
 Nick
 fuzzyocr-23b-hashdb-poison.patch.gz

Ignore that second patch which wasn't from decoder.  I must have 
accidentally dropped it into the mail somehow (have I mentioned recently I 
hate GUI mailers ?)

Nick


tmp files being left over from FuzzyOCR?

2006-10-19 Thread Bill
Since I installed FuzzyOCR I've noticed I'm having a lot of files named
similar to  .spamassassin8932mZBFrtmp  left in my /tmp folder. These are
from FuzzyOCR, correct? The content of these files has lots of spaces,
hyphens, commas with a few readable words and the word picture a few
times.

Is there something I need to do to ensure these files are removed? After
I manually remove them I see new tmp files being created and removed but
sometimes a file is NOT removed.

  Bill



Re: tmp files being left over from FuzzyOCR?

2006-10-19 Thread Chris Lear

* Bill wrote (19/10/06 14:03):

Since I installed FuzzyOCR I've noticed I'm having a lot of files named
similar to  .spamassassin8932mZBFrtmp  left in my /tmp folder. These are
from FuzzyOCR, correct? The content of these files has lots of spaces,
hyphens, commas with a few readable words and the word picture a few
times.

Is there something I need to do to ensure these files are removed? After
I manually remove them I see new tmp files being created and removed but
sometimes a file is NOT removed.


I suspect that if you look in your FuzzyOCR log, you will find errors 
that match the unremoved temp files.


Eg from my FuzzyOCR.log:

[2006-10-18 10:10:47] Unexpected error in pipe to external programs.
  Please check that all helper programs are 
installed and in the correct path.
  (Pipe Command /usr/bin/gifasm -d 
/tmp/.spamassassin2591CHsvrEtmp/out, Pipe exit code 1 (), Temporary 
file: /tmp/.spamassassin2591dNqOn7tmp)


I see that /tmp/.spamassassin2591CHsvrEtmp/ is still there, but 
/tmp/.spamassassin2591dNqOn7tmp isn't.


And another example:

[2006-10-18 09:34:24] FuzzyOcr received timeout after running 10 seconds.

#ls -l /tmp/.spamassassin* | grep 09:34
-rw---  1 spamd users 0 Oct 18 09:34 /tmp/.spamassassin2589Wc3z7Gtmp
-rw---  1 spamd users 23579 Oct 18 09:34 /tmp/.spamassassin2589yvpP1Htmp


Looks like when gifasm fails, you get a dir left over. If there's a 
timeout, you get a file left over.


Chris


RE: tmp files being left over from FuzzyOCR?

2006-10-19 Thread Duncan, Brian M.

I just looked and have tmp dirs being created by FuzzyOCR - with what
looks like tmp files in those dirs.  No tmp files in the root of /tmp

It looks like certain images are causing FuzzyOCR to quit proccessing
messages in my case based on what I see in these dead tmp dirs left
behind.  It's only happening on certain images it looks like. (only 3
tmp dirs left from all of today so far)

For the heck of it I took one of the graphic files in one of those tmp
dirs and put it inline and sent it in as a test.  It created a tmp dir
did not remove it, and the message that came in had a Spam score of 0
and it looks like Spam Assassin died because of FuzzyOCR having an issue
with the image. (I have the patches applied to the couple of apps that
supposedly cause segfaults - so I am not sure what caused the issue)

X-MailScanner-SpamCheck: not spam, SpamAssassin (not cached,
score=0, required 6.5, autolearn=)

From the header it looks like SpamAssassin did not finish processing the
message because of FuzzyOCR.  I better look into that now.


 -Original Message-
 From: Bill [mailto:[EMAIL PROTECTED]
 Sent: Thursday, October 19, 2006 8:03 AM
 To: users@spamassassin.apache.org
 Subject: tmp files being left over from FuzzyOCR?

 Since I installed FuzzyOCR I've noticed I'm having a lot
 of files named similar to  .spamassassin8932mZBFrtmp  left in
 my /tmp folder. These are from FuzzyOCR, correct? The content
 of these files has lots of spaces, hyphens, commas with a few
 readable words and the word picture a few times.

 Is there something I need to do to ensure these files are
 removed? After I manually remove them I see new tmp files
 being created and removed but sometimes a file is NOT removed.

   Bill



===
CIRCULAR 230 DISCLOSURE: Pursuant to Regulations Governing Practice Before the 
Internal Revenue Service, any tax advice contained herein is not intended or 
written to be used and cannot be used by a taxpayer for the purpose of avoiding 
tax penalties that may be imposed on the taxpayer.
===
CONFIDENTIALITY NOTICE:
This electronic mail message and any attached files contain information 
intended for the exclusive use of the individual or entity to whom it is 
addressed and may contain information that is proprietary, privileged, 
confidential and/or exempt from disclosure under applicable law.  If you are 
not the intended recipient, you are hereby notified that any viewing, copying, 
disclosure or distribution of this information may be subject to legal 
restriction or sanction.  Please notify the sender, by electronic mail or 
telephone, of any unintended recipients and delete the original message without 
making any copies.
===
NOTIFICATION:  Katten Muchin Rosenman LLP is an Illinois limited liability 
partnership that has elected to be governed by the Illinois Uniform Partnership 
Act (1997).
===


RE: tmp files being left over from FuzzyOCR?

2006-10-19 Thread Duncan, Brian M.

I noticed that there is this directive in the fuzzyocr.cf:

#  0 = always cleanup
#  1 = keep only if error
#  2 = always keep
focr_keep_bad_images 0

Mine was set to 1 by default, to keep bad images.  I set it to 0 but it
still is keeping bad images. (If what is in the dirs is bad images, when
I send a GIF I created no tmp dirs are left, if I send one of the images
left from one of the dirs from before it leaves it)



 -Original Message-
 From: Duncan, Brian M.
 Sent: Thursday, October 19, 2006 8:27 AM
 To: users@spamassassin.apache.org
 Subject: RE: tmp files being left over from FuzzyOCR?

 I just looked and have tmp dirs being created by FuzzyOCR -
 with what looks like tmp files in those dirs.  No tmp files
 in the root of /tmp

 It looks like certain images are causing FuzzyOCR to quit
 proccessing messages in my case based on what I see in these
 dead tmp dirs left behind.  It's only happening on certain
 images it looks like. (only 3 tmp dirs left from all of today so far)

 For the heck of it I took one of the graphic files in one of
 those tmp dirs and put it inline and sent it in as a test. 
 It created a tmp dir did not remove it, and the message that
 came in had a Spam score of 0 and it looks like Spam Assassin
 died because of FuzzyOCR having an issue with the image. (I
 have the patches applied to the couple of apps that
 supposedly cause segfaults - so I am not sure what caused the issue)

 X-MailScanner-SpamCheck: not spam, SpamAssassin (not cached,
   score=0, required 6.5, autolearn=)

 From the header it looks like SpamAssassin did not finish
 processing the message because of FuzzyOCR.  I better look
 into that now.


  -Original Message-
  From: Bill [mailto:[EMAIL PROTECTED]
  Sent: Thursday, October 19, 2006 8:03 AM
  To: users@spamassassin.apache.org
  Subject: tmp files being left over from FuzzyOCR?
 
  Since I installed FuzzyOCR I've noticed I'm having a
 lot of files
  named similar to  .spamassassin8932mZBFrtmp  left in my
 /tmp folder.
  These are from FuzzyOCR, correct? The content of these
 files has lots
  of spaces, hyphens, commas with a few readable words and the word
  picture a few times.
 
  Is there something I need to do to ensure these files
 are removed?
  After I manually remove them I see new tmp files being created and
  removed but sometimes a file is NOT removed.
 
Bill
 
 

===
CIRCULAR 230 DISCLOSURE: Pursuant to Regulations Governing Practice Before the 
Internal Revenue Service, any tax advice contained herein is not intended or 
written to be used and cannot be used by a taxpayer for the purpose of avoiding 
tax penalties that may be imposed on the taxpayer.
===
CONFIDENTIALITY NOTICE:
This electronic mail message and any attached files contain information 
intended for the exclusive use of the individual or entity to whom it is 
addressed and may contain information that is proprietary, privileged, 
confidential and/or exempt from disclosure under applicable law.  If you are 
not the intended recipient, you are hereby notified that any viewing, copying, 
disclosure or distribution of this information may be subject to legal 
restriction or sanction.  Please notify the sender, by electronic mail or 
telephone, of any unintended recipients and delete the original message without 
making any copies.
===
NOTIFICATION:  Katten Muchin Rosenman LLP is an Illinois limited liability 
partnership that has elected to be governed by the Illinois Uniform Partnership 
Act (1997).
===


Re: tmp files being left over from FuzzyOCR?

2006-10-19 Thread Bill
I'm using FuzzyOcr-2.3b and I can't find any reference to this option in
any of the FuzzyOCR software I downloaded.

focr_keep_bad_images 0

Here's a sample of the items in my /tmp folder. You said your's were
folders, mine's not. All of these files are left behind as at the time I
made this sample it was 9:25.

-rw---   1 mail mail 0 Oct 19 08:25 .spamassassin2053SgXN1Ktmp
-rw---   1 mail mail 0 Oct 19 08:31 .spamassassin2053uswH8Ttmp
-rw---   1 mail mail52 Oct 19 08:31 .spamassassin2053Wpxhuftmp
-rw---   1 mail mail  6549 Oct 19 08:31 .spamassassin26901cFX9cetmp
-rw---   1 mail mail 0 Oct 19 08:31 .spamassassin26901JsLOMYtmp
-rw---   1 mail mail  4267 Oct 19 08:36 .spamassassin300465gddzGtmp
-rw---   1 mail mail 0 Oct 19 08:31 .spamassassin30046lKDkjjtmp
-rw---   1 mail mail  3150 Oct 19 08:31 .spamassassin30046ptOrdstmp
-rw---   1 mail mail 0 Oct 19 08:36 .spamassassin30046xUScTotmp
-rw---   1 mail mail  4112 Oct 19 08:24 .spamassassin9067MfgpH3tmp


Here's a sample of a file.

  _(PICTURE)_   _  _   _  _ _   _ _ _   _ (PICTURE)_ _ _   _ _ _
__   _  _   _  _  (PICTURE) _  (PICTURE)__ _  _
_ _  __  _  _   _ _ _ _
An _nves_or A_ER_!!!   pE_RoSU_ DR_rr__C (pSUD )
 '_ UU_ _ L Y_L LdUY_ _L __ _ L_ __ UU_L_
_Ld! L WdLL_ ___ _ LUUdY _  ' ' ' '' ' ' '
 ' ' ' ' ' ' ' '' ' ' ''' ' ' ' ' ' ' '' '' ' ' ' '' ' ' ' '' ' ' '' ' ' ' '
' '' ' '' ' '_'
  \  petro5un, Incorporeted (p5UD _ _ews) ennounced thet
rU_ L_ _d__ _Y_ ___ _ ___ _L W_L_ _ ___LL_ d
_ _ __! _ _ __ _ _ _ _ __
,



  Bill






- Original Message - 
From: Duncan, Brian M.
To: users@spamassassin.apache.org
Sent: Thursday, October 19, 2006 8:42 AM
Subject: RE: tmp files being left over from FuzzyOCR?



I noticed that there is this directive in the fuzzyocr.cf:

#  0 = always cleanup
#  1 = keep only if error
#  2 = always keep
focr_keep_bad_images 0

Mine was set to 1 by default, to keep bad images.  I set it to 0 but it
still is keeping bad images. (If what is in the dirs is bad images, when
I send a GIF I created no tmp dirs are left, if I send one of the images
left from one of the dirs from before it leaves it)



 -Original Message-
 From: Duncan, Brian M.
 Sent: Thursday, October 19, 2006 8:27 AM
 To: users@spamassassin.apache.org
 Subject: RE: tmp files being left over from FuzzyOCR?

 I just looked and have tmp dirs being created by FuzzyOCR -
 with what looks like tmp files in those dirs.  No tmp files
 in the root of /tmp

 It looks like certain images are causing FuzzyOCR to quit
 proccessing messages in my case based on what I see in these
 dead tmp dirs left behind.  It's only happening on certain
 images it looks like. (only 3 tmp dirs left from all of today so far)

 For the heck of it I took one of the graphic files in one of
 those tmp dirs and put it inline and sent it in as a test.
 It created a tmp dir did not remove it, and the message that
 came in had a Spam score of 0 and it looks like Spam Assassin
 died because of FuzzyOCR having an issue with the image. (I
 have the patches applied to the couple of apps that
 supposedly cause segfaults - so I am not sure what caused the issue)

 X-MailScanner-SpamCheck: not spam, SpamAssassin (not cached,
 score=0, required 6.5, autolearn=)

 From the header it looks like SpamAssassin did not finish
 processing the message because of FuzzyOCR.  I better look
 into that now.


  -Original Message-
  From: Bill [mailto:[EMAIL PROTECTED]
  Sent: Thursday, October 19, 2006 8:03 AM
  To: users@spamassassin.apache.org
  Subject: tmp files being left over from FuzzyOCR?
 
  Since I installed FuzzyOCR I've noticed I'm having a
 lot of files
  named similar to  .spamassassin8932mZBFrtmp  left in my
 /tmp folder.
  These are from FuzzyOCR, correct? The content of these
 files has lots
  of spaces, hyphens, commas with a few readable words and the word
  picture a few times.
 
  Is there something I need to do to ensure these files
 are removed?
  After I manually remove them I see new tmp files being created and
  removed but sometimes a file is NOT removed.
 
Bill
 
 

===
CIRCULAR 230 DISCLOSURE: Pursuant to Regulations Governing Practice Before
the Internal Revenue Service, any tax advice contained herein is not
intended or written to be used and cannot be used by a taxpayer for the
purpose of avoiding tax penalties that may be imposed on the taxpayer.
===
CONFIDENTIALITY NOTICE:
This electronic mail message and any attached files contain information
intended for the exclusive use of the individual or entity to whom it is
addressed and may contain information that is proprietary, privileged

RE: tmp files being left over from FuzzyOCR?

2006-10-19 Thread Duncan, Brian M.
I am using 2.3j of Fuzzy OCR according to the Perl script.

drwx--   2 mail mail  4096 Oct 19 08:29 .spamassassin17656WleDs7tmp
drwx--   2 mail mail  4096 Oct 19 09:15 .spamassassin25775kNluNhtmp 

These are two dirs in my tmp folder currently. 

In one of those dirs I have:

Line-multi-gif
Line.gif
raw.eml
raw.err

 -Original Message-
 From: Bill [mailto:[EMAIL PROTECTED] 
 Sent: Thursday, October 19, 2006 9:29 AM
 To: users@spamassassin.apache.org
 Subject: Re: tmp files being left over from FuzzyOCR?
 
 I'm using FuzzyOcr-2.3b and I can't find any reference to 
 this option in any of the FuzzyOCR software I downloaded.
 
 focr_keep_bad_images 0
 
 Here's a sample of the items in my /tmp folder. You said 
 your's were folders, mine's not. All of these files are left 
 behind as at the time I made this sample it was 9:25.
 
 -rw---   1 mail mail 0 Oct 19 08:25 .spamassassin2053SgXN1Ktmp
 -rw---   1 mail mail 0 Oct 19 08:31 .spamassassin2053uswH8Ttmp
 -rw---   1 mail mail52 Oct 19 08:31 .spamassassin2053Wpxhuftmp
 -rw---   1 mail mail  6549 Oct 19 08:31 
 .spamassassin26901cFX9cetmp
 -rw---   1 mail mail 0 Oct 19 08:31 
 .spamassassin26901JsLOMYtmp
 -rw---   1 mail mail  4267 Oct 19 08:36 
 .spamassassin300465gddzGtmp
 -rw---   1 mail mail 0 Oct 19 08:31 
 .spamassassin30046lKDkjjtmp
 -rw---   1 mail mail  3150 Oct 19 08:31 
 .spamassassin30046ptOrdstmp
 -rw---   1 mail mail 0 Oct 19 08:36 
 .spamassassin30046xUScTotmp
 -rw---   1 mail mail  4112 Oct 19 08:24 .spamassassin9067MfgpH3tmp
 
 
 Here's a sample of a file.
 
   _(PICTURE)_   _  _   _  _ _   _ _ _   _ (PICTURE)_ _
  _   _ _ _
 __   _  _   _  _  (PICTURE) _  
 (PICTURE)__ _  _
 _ _  __  _  _   _ _ _ _
 An _nves_or A_ER_!!!   pE_RoSU_ DR_rr__C (pSUD )
  '_ UU_ _ L Y_L LdUY_ _L __ _ 
 L_ __ UU_L_
 _Ld! L WdLL_ ___ _ LUUdY _  ' ' ' '' ' ' '
  ' ' ' ' ' ' ' '' ' ' ''' ' ' ' ' ' ' '' '' ' ' ' '' ' ' ' '' 
 ' ' '' ' ' ' '
 ' '' ' '' ' '_'
   \  petro5un, Incorporeted (p5UD _ _ews) 
 ennounced thet rU_ L_ _d__ _Y_ ___ _ ___ _L W_L_ _ ___LL_ d
 _ _ __! _ _ __ _ _ _ _ __
 ,
 
 
 
   Bill
 
 
 
 
 
 
 - Original Message -
 From: Duncan, Brian M.
 To: users@spamassassin.apache.org
 Sent: Thursday, October 19, 2006 8:42 AM
 Subject: RE: tmp files being left over from FuzzyOCR?
 
 
 
 I noticed that there is this directive in the fuzzyocr.cf:
 
 #  0 = always cleanup
 #  1 = keep only if error
 #  2 = always keep
 focr_keep_bad_images 0
 
 Mine was set to 1 by default, to keep bad images.  I set it 
 to 0 but it
 still is keeping bad images. (If what is in the dirs is bad 
 images, when
 I send a GIF I created no tmp dirs are left, if I send one of 
 the images
 left from one of the dirs from before it leaves it)
 
 
 
  -Original Message-
  From: Duncan, Brian M.
  Sent: Thursday, October 19, 2006 8:27 AM
  To: users@spamassassin.apache.org
  Subject: RE: tmp files being left over from FuzzyOCR?
 
  I just looked and have tmp dirs being created by FuzzyOCR -
  with what looks like tmp files in those dirs.  No tmp files
  in the root of /tmp
 
  It looks like certain images are causing FuzzyOCR to quit
  proccessing messages in my case based on what I see in these
  dead tmp dirs left behind.  It's only happening on certain
  images it looks like. (only 3 tmp dirs left from all of 
 today so far)
 
  For the heck of it I took one of the graphic files in one of
  those tmp dirs and put it inline and sent it in as a test.
  It created a tmp dir did not remove it, and the message that
  came in had a Spam score of 0 and it looks like Spam Assassin
  died because of FuzzyOCR having an issue with the image. (I
  have the patches applied to the couple of apps that
  supposedly cause segfaults - so I am not sure what caused the issue)
 
  X-MailScanner-SpamCheck: not spam, SpamAssassin (not cached,
  score=0, required 6.5, autolearn=)
 
  From the header it looks like SpamAssassin did not finish
  processing the message because of FuzzyOCR.  I better look
  into that now.
 
 
   -Original Message-
   From: Bill [mailto:[EMAIL PROTECTED]
   Sent: Thursday, October 19, 2006 8:03 AM
   To: users@spamassassin.apache.org
   Subject: tmp files being left over from FuzzyOCR?
  
   Since I installed FuzzyOCR I've noticed I'm having a
  lot of files
   named similar to  .spamassassin8932mZBFrtmp  left in my
  /tmp folder.
   These are from FuzzyOCR, correct? The content of these
  files has lots
   of spaces, hyphens, commas with a few readable words and the word
   picture a few times.
  
   Is there something I need to do to ensure these files
  are removed?
   After I manually remove them I see new tmp files being created and
   removed but sometimes a file

Re: tmp files being left over from FuzzyOCR?

2006-10-19 Thread Bill
Ok, I wasn't going to ask but I guess I'll have to.

Where do I get the j version. It's not at
http://users.own-hero.net/~decoder/fuzzyocr/

  Bill



- Original Message - 
From: Duncan, Brian M.
To: Bill ; users@spamassassin.apache.org
Sent: Thursday, October 19, 2006 9:36 AM
Subject: RE: tmp files being left over from FuzzyOCR?


I am using 2.3j of Fuzzy OCR according to the Perl script.

drwx--   2 mail mail  4096 Oct 19 08:29 .spamassassin17656WleDs7tmp
drwx--   2 mail mail  4096 Oct 19 09:15 .spamassassin25775kNluNhtmp

These are two dirs in my tmp folder currently.

In one of those dirs I have:

Line-multi-gif
Line.gif
raw.eml
raw.err

 -Original Message-
 From: Bill [mailto:[EMAIL PROTECTED]
 Sent: Thursday, October 19, 2006 9:29 AM
 To: users@spamassassin.apache.org
 Subject: Re: tmp files being left over from FuzzyOCR?

 I'm using FuzzyOcr-2.3b and I can't find any reference to
 this option in any of the FuzzyOCR software I downloaded.

 focr_keep_bad_images 0

 Here's a sample of the items in my /tmp folder. You said
 your's were folders, mine's not. All of these files are left
 behind as at the time I made this sample it was 9:25.

 -rw---   1 mail mail 0 Oct 19 08:25 .spamassassin2053SgXN1Ktmp
 -rw---   1 mail mail 0 Oct 19 08:31 .spamassassin2053uswH8Ttmp
 -rw---   1 mail mail52 Oct 19 08:31 .spamassassin2053Wpxhuftmp
 -rw---   1 mail mail  6549 Oct 19 08:31
 .spamassassin26901cFX9cetmp
 -rw---   1 mail mail 0 Oct 19 08:31
 .spamassassin26901JsLOMYtmp
 -rw---   1 mail mail  4267 Oct 19 08:36
 .spamassassin300465gddzGtmp
 -rw---   1 mail mail 0 Oct 19 08:31
 .spamassassin30046lKDkjjtmp
 -rw---   1 mail mail  3150 Oct 19 08:31
 .spamassassin30046ptOrdstmp
 -rw---   1 mail mail 0 Oct 19 08:36
 .spamassassin30046xUScTotmp
 -rw---   1 mail mail  4112 Oct 19 08:24 .spamassassin9067MfgpH3tmp


 Here's a sample of a file.

   _(PICTURE)_   _  _   _  _ _   _ _ _   _ (PICTURE)_ _
  _   _ _ _
 __   _  _   _  _  (PICTURE) _
 (PICTURE)__ _  _
 _ _  __  _  _   _ _ _ _
 An _nves_or A_ER_!!!   pE_RoSU_ DR_rr__C (pSUD )
  '_ UU_ _ L Y_L LdUY_ _L __ _
 L_ __ UU_L_
 _Ld! L WdLL_ ___ _ LUUdY _  ' ' ' '' ' ' '
  ' ' ' ' ' ' ' '' ' ' ''' ' ' ' ' ' ' '' '' ' ' ' '' ' ' ' ''
 ' ' '' ' ' ' '
 ' '' ' '' ' '_'
   \  petro5un, Incorporeted (p5UD _ _ews)
 ennounced thet rU_ L_ _d__ _Y_ ___ _ ___ _L W_L_ _ ___LL_ d
 _ _ __! _ _ __ _ _ _ _ __
 ,



   Bill






 - Original Message -
 From: Duncan, Brian M.
 To: users@spamassassin.apache.org
 Sent: Thursday, October 19, 2006 8:42 AM
 Subject: RE: tmp files being left over from FuzzyOCR?



 I noticed that there is this directive in the fuzzyocr.cf:

 #  0 = always cleanup
 #  1 = keep only if error
 #  2 = always keep
 focr_keep_bad_images 0

 Mine was set to 1 by default, to keep bad images.  I set it
 to 0 but it
 still is keeping bad images. (If what is in the dirs is bad
 images, when
 I send a GIF I created no tmp dirs are left, if I send one of
 the images
 left from one of the dirs from before it leaves it)



  -Original Message-
  From: Duncan, Brian M.
  Sent: Thursday, October 19, 2006 8:27 AM
  To: users@spamassassin.apache.org
  Subject: RE: tmp files being left over from FuzzyOCR?
 
  I just looked and have tmp dirs being created by FuzzyOCR -
  with what looks like tmp files in those dirs.  No tmp files
  in the root of /tmp
 
  It looks like certain images are causing FuzzyOCR to quit
  proccessing messages in my case based on what I see in these
  dead tmp dirs left behind.  It's only happening on certain
  images it looks like. (only 3 tmp dirs left from all of
 today so far)
 
  For the heck of it I took one of the graphic files in one of
  those tmp dirs and put it inline and sent it in as a test.
  It created a tmp dir did not remove it, and the message that
  came in had a Spam score of 0 and it looks like Spam Assassin
  died because of FuzzyOCR having an issue with the image. (I
  have the patches applied to the couple of apps that
  supposedly cause segfaults - so I am not sure what caused the issue)
 
  X-MailScanner-SpamCheck: not spam, SpamAssassin (not cached,
  score=0, required 6.5, autolearn=)
 
  From the header it looks like SpamAssassin did not finish
  processing the message because of FuzzyOCR.  I better look
  into that now.
 
 
   -Original Message-
   From: Bill [mailto:[EMAIL PROTECTED]
   Sent: Thursday, October 19, 2006 8:03 AM
   To: users@spamassassin.apache.org
   Subject: tmp files being left over from FuzzyOCR?
  
   Since I installed FuzzyOCR I've noticed I'm having a
  lot of files
   named similar to  .spamassassin8932mZBFrtmp  left in my
  /tmp folder.
   These are from FuzzyOCR, correct? The content

Re: tmp files being left over from FuzzyOCR?

2006-10-19 Thread Chris Lear

* Bill wrote (19/10/06 15:29):

I'm using FuzzyOcr-2.3b and I can't find any reference to this option in
any of the FuzzyOCR software I downloaded.

focr_keep_bad_images 0

Here's a sample of the items in my /tmp folder. You said your's were
folders, mine's not. All of these files are left behind as at the time I
made this sample it was 9:25.


Look in your FuzzyOCR log. If it's like mine, you will see timeouts like 
this:


[2006-10-18 09:49:14] FuzzyOcr received timeout after running 10 seconds.

If the times on these timeouts match the times on the temp files, then 
that's what's causing them. That logic works for what I'm seeing.




===
CIRCULAR 230 DISCLOSURE: Pursuant to Regulations Governing Practice Before
the Internal Revenue Service, any tax advice contained herein is not
intended or written to be used and cannot be used by a taxpayer for the
purpose of avoiding tax penalties that may be imposed on the taxpayer.
===


Shame. I was hoping to get out of paying some tax.


CONFIDENTIALITY NOTICE:
This electronic mail message and any attached files contain information
intended for the exclusive use of the individual or entity to whom it is
addressed and may contain information that is proprietary, privileged,
confidential and/or exempt from disclosure under applicable law.  If you are
not the intended recipient, you are hereby notified that any viewing,
copying, disclosure or distribution of this information may be subject to
legal restriction or sanction.  Please notify the sender, by electronic mail
or telephone, of any unintended recipients and delete the original message
without making any copies.


I hope I was the intended recipient, but I'm not sure how I can know.


Re: tmp files being left over from FuzzyOCR?

2006-10-19 Thread George R . Kasica
Its not a formal released version from Chris/decoder. I'm running b
here as it seems the most stable. 

If you want J is at:

To: [EMAIL PROTECTED]
Subject: [Devel-spam] [Announce] Version 2.3j
From: Jorge Valdes [EMAIL PROTECTED]
Date: Mon, 25 Sep 2006 10:49:24 -0600

Hi all,

Just wanted to let everyone now that version 2.3j available here:

http://www.joval.info/proj/FuzzyOcr.html

This version is VERY stable, and besides fixing a couple of bugs 
reported to me on this list, better traps and reports errors from 
ImageMagick, no new features have been added. Unless a serious bug is 
encountered, this should be very close to a stable release.

On my todo list is that image inside application/ms-word (.doc) 
attachment spam.

Jorge



Ok, I wasn't going to ask but I guess I'll have to.

Where do I get the j version. It's not at
http://users.own-hero.net/~decoder/fuzzyocr/

  Bill



- Original Message - 
From: Duncan, Brian M.
To: Bill ; users@spamassassin.apache.org
Sent: Thursday, October 19, 2006 9:36 AM
Subject: RE: tmp files being left over from FuzzyOCR?


I am using 2.3j of Fuzzy OCR according to the Perl script.

drwx--   2 mail mail  4096 Oct 19 08:29 .spamassassin17656WleDs7tmp
drwx--   2 mail mail  4096 Oct 19 09:15 .spamassassin25775kNluNhtmp

These are two dirs in my tmp folder currently.

In one of those dirs I have:

Line-multi-gif
Line.gif
raw.eml
raw.err

 -Original Message-
 From: Bill [mailto:[EMAIL PROTECTED]
 Sent: Thursday, October 19, 2006 9:29 AM
 To: users@spamassassin.apache.org
 Subject: Re: tmp files being left over from FuzzyOCR?

 I'm using FuzzyOcr-2.3b and I can't find any reference to
 this option in any of the FuzzyOCR software I downloaded.

 focr_keep_bad_images 0

 Here's a sample of the items in my /tmp folder. You said
 your's were folders, mine's not. All of these files are left
 behind as at the time I made this sample it was 9:25.

 -rw---   1 mail mail 0 Oct 19 08:25 .spamassassin2053SgXN1Ktmp
 -rw---   1 mail mail 0 Oct 19 08:31 .spamassassin2053uswH8Ttmp
 -rw---   1 mail mail52 Oct 19 08:31 .spamassassin2053Wpxhuftmp
 -rw---   1 mail mail  6549 Oct 19 08:31
 .spamassassin26901cFX9cetmp
 -rw---   1 mail mail 0 Oct 19 08:31
 .spamassassin26901JsLOMYtmp
 -rw---   1 mail mail  4267 Oct 19 08:36
 .spamassassin300465gddzGtmp
 -rw---   1 mail mail 0 Oct 19 08:31
 .spamassassin30046lKDkjjtmp
 -rw---   1 mail mail  3150 Oct 19 08:31
 .spamassassin30046ptOrdstmp
 -rw---   1 mail mail 0 Oct 19 08:36
 .spamassassin30046xUScTotmp
 -rw---   1 mail mail  4112 Oct 19 08:24 .spamassassin9067MfgpH3tmp


 Here's a sample of a file.

   _(PICTURE)_   _  _   _  _ _   _ _ _   _ (PICTURE)_ _
  _   _ _ _
 __   _  _   _  _  (PICTURE) _
 (PICTURE)__ _  _
 _ _  __  _  _   _ _ _ _
 An _nves_or A_ER_!!!   pE_RoSU_ DR_rr__C (pSUD )
  '_ UU_ _ L Y_L LdUY_ _L __ _
 L_ __ UU_L_
 _Ld! L WdLL_ ___ _ LUUdY _  ' ' ' '' ' ' '
  ' ' ' ' ' ' ' '' ' ' ''' ' ' ' ' ' ' '' '' ' ' ' '' ' ' ' ''
 ' ' '' ' ' ' '
 ' '' ' '' ' '_'
   \  petro5un, Incorporeted (p5UD _ _ews)
 ennounced thet rU_ L_ _d__ _Y_ ___ _ ___ _L W_L_ _ ___LL_ d
 _ _ __! _ _ __ _ _ _ _ __
 ,



   Bill






 - Original Message -
 From: Duncan, Brian M.
 To: users@spamassassin.apache.org
 Sent: Thursday, October 19, 2006 8:42 AM
 Subject: RE: tmp files being left over from FuzzyOCR?



 I noticed that there is this directive in the fuzzyocr.cf:

 #  0 = always cleanup
 #  1 = keep only if error
 #  2 = always keep
 focr_keep_bad_images 0

 Mine was set to 1 by default, to keep bad images.  I set it
 to 0 but it
 still is keeping bad images. (If what is in the dirs is bad
 images, when
 I send a GIF I created no tmp dirs are left, if I send one of
 the images
 left from one of the dirs from before it leaves it)



  -Original Message-
  From: Duncan, Brian M.
  Sent: Thursday, October 19, 2006 8:27 AM
  To: users@spamassassin.apache.org
  Subject: RE: tmp files being left over from FuzzyOCR?
 
  I just looked and have tmp dirs being created by FuzzyOCR -
  with what looks like tmp files in those dirs.  No tmp files
  in the root of /tmp
 
  It looks like certain images are causing FuzzyOCR to quit
  proccessing messages in my case based on what I see in these
  dead tmp dirs left behind.  It's only happening on certain
  images it looks like. (only 3 tmp dirs left from all of
 today so far)
 
  For the heck of it I took one of the graphic files in one of
  those tmp dirs and put it inline and sent it in as a test.
  It created a tmp dir did not remove it, and the message that
  came in had a Spam score of 0 and it looks like Spam Assassin
  died because of FuzzyOCR having an issue with the image. (I
  have the patches applied to the couple