Re: orphaned .tmp files in the submission folder

2008-07-17 Thread SM

Hi Ron,
At 05:28 17-07-2008, Ron Smith wrote:

spamassassin --lint has always returned no issues with the rules.
spamassassin -D --lint returns a 304 line log file which I can provide
if requested. Other than the failure with Net::Ident (which refuses to
install under CPAN because it fails the make test), there is nothing
there that seems to be an issue.


spamassassin --lint tests the command line spamassassin and not 
spamd.  Your problem is not with the spamd configuration or the rules.



But if what you say is true of spamc being responsible for the actual
file creation and placement in the submission folder, then it would
appear to be a spamc issue entirely. Or the interaction between the
scanspam.sh script in the CommuniGate folder which


spamc does not create files.  It takes the file as input and sends 
the contents to spamd.  According to your script, a 
"$myCgate/Submitted/$NewFile" is appended to as the output of the 
spamc command.  That must be the orphaned files you were talking about.



I'm assuming that the spamc is probably failing, sending the .tmp file
back to the Submitted folder and CommuniGate is then reprocessing the
message and sending it back to scanspam.sh and so again to spamc.


It's your script that does that and no spamc.  spamprep may be 
creating files as well.  As I don't use these scripts, I cannot tell 
what is happening.



Now to figure out why spamc is failing on these messages.


If there is any failure, it should be logged to "$myCgate/Submitted/$NewFile".


Question though: does spamc return any email back to the submitted
folder with extra .tmp suffixes at any time or for any reason?


spamc does not return any emails.  It will return the results from 
spamd.  spamc does not create any .tmp files.  You can see the actual 
output by piping an email through spamc.  It's most likely spamprep 
which is creating the orphaned .tmp files.


Regards,
-sm 



Re: orphaned .tmp files in the submission folder

2008-07-17 Thread Alex Woick

Ron Smith schrieb am 17.07.2008 14:28:

I'm assuming that the spamc is probably failing, sending the .tmp file 
back to the Submitted folder and CommuniGate is then reprocessing the 
message and sending it back to scanspam.sh and so again to spamc.


Now to figure out why spamc is failing on these messages.

Question though: does spamc return any email back to the submitted 
folder with extra .tmp suffixes at any time or for any reason?


Spamc is only a filter that gets a message from standard input and 
passes it (with SpamAssassin headers added) back to standard output. 
Additionally, it gives a scan success/failure return code.


Spamc is not reading messages from files nor bouncing messages. It is 
simply adding a few headers to the given message, so that other programs 
can detect what SpamAssassin thinks of the spamminess of a message.


The caller of spamc is responsible for reading a message from a file or 
a mail queue and for writing the updated message back to where it should 
go. So I suggest you move your question to a Communigate support list, 
since it seems that there is a problem with the calling of spamc, not 
with spamc itself.


Tschau
Alex


Re: orphaned .tmp files in the submission folder

2008-07-17 Thread Ron Smith

Thanks, SM.

spamassassin --lint has always returned no issues with the rules.  
spamassassin -D --lint returns a 304 line log file which I can provide  
if requested. Other than the failure with Net::Ident (which refuses to  
install under CPAN because it fails the make test), there is nothing  
there that seems to be an issue.


But if what you say is true of spamc being responsible for the actual  
file creation and placement in the submission folder, then it would  
appear to be a spamc issue entirely. Or the interaction between the  
scanspam.sh script in the CommuniGate folder which


I'm assuming that the spamc is probably failing, sending the .tmp file  
back to the Submitted folder and CommuniGate is then reprocessing the  
message and sending it back to scanspam.sh and so again to spamc.


Now to figure out why spamc is failing on these messages.

Question though: does spamc return any email back to the submitted  
folder with extra .tmp suffixes at any time or for any reason?


Here's the working line in the scanspam.sh script that calls spamc:

 /var/CommuniGate/spam/spamprep "$myCgate/$QueuePath" "$ReturnPath"  
"$Username" | /usr/bin/spamc -d 127.0.0.1 -t 800 -u "$Username" >>  
"$myCgate/Submitted/$NewFile"



Thanks,

Ron

On Jul 17, 2008, at 3:39 AM, SM wrote:


At 17:34 16-07-2008, Ron Smith wrote:

I'm using spamc/spamd with CommuniGate Pro. When spamd puts the file
in the submission folder it USUALLY gets a .sub extension applied
within a minute or two. However I am seeing orphaned files that are
both non-spam and spam that just get left as .tmp files. Many times
these files have multiple .tmp extensions (up to 9 or 10 of them) as
though spamd just coughed and started over.


spamd does not put files in folders.  spamc sends the contents of  
the file to spamd.  The contents are evaluated and the results are  
returned back to spamc.


You can enable debugging in spamd to see what is happening at that  
end.  Find out what process is creating those .tmp files.  The  
process should usually do the cleaning up.


The probability of getting feedback is higher if you provide  
technical information such as log entries which demonstrate what is  
going on.  If it's a spamd configuration problem, you can get the  
debug output by following http://wiki.apache.org/spamassassin/HowToDebug


Regards,
-sm




Re: orphaned .tmp files in the submission folder

2008-07-17 Thread SM

At 17:34 16-07-2008, Ron Smith wrote:

I'm using spamc/spamd with CommuniGate Pro. When spamd puts the file
in the submission folder it USUALLY gets a .sub extension applied
within a minute or two. However I am seeing orphaned files that are
both non-spam and spam that just get left as .tmp files. Many times
these files have multiple .tmp extensions (up to 9 or 10 of them) as
though spamd just coughed and started over.


spamd does not put files in folders.  spamc sends the contents of the 
file to spamd.  The contents are evaluated and the results are 
returned back to spamc.


You can enable debugging in spamd to see what is happening at that 
end.  Find out what process is creating those .tmp files.  The 
process should usually do the cleaning up.


The probability of getting feedback is higher if you provide 
technical information such as log entries which demonstrate what is 
going on.  If it's a spamd configuration problem, you can get the 
debug output by following http://wiki.apache.org/spamassassin/HowToDebug


Regards,
-sm 



orphaned .tmp files in the submission folder

2008-07-16 Thread Ron Smith
Hello again. I have not received any feedback from anyone. I would  
really appreciate any help.


I'm using spamc/spamd with CommuniGate Pro. When spamd puts the file  
in the submission folder it USUALLY gets a .sub extension applied  
within a minute or two. However I am seeing orphaned files that are  
both non-spam and spam that just get left as .tmp files. Many times  
these files have multiple .tmp extensions (up to 9 or 10 of them) as  
though spamd just coughed and started over.


Currently I'm examining these files and deleting the spam and using a  
drag and drop renaming utility to manually submit (add the .sub  
extensions) these files a couple of times a day.


Why is this happening and what settings changes will stop it? I've  
tried more spamd children and less spamd children. I've turned  
autowhitelisting off and on. I've checked all the mysql spamassassin  
database files and they are in working order.


Here's the Startup.sh script that CommuniGate pro runs with variations  
of the spamd command that I've tried:


#!/bin/sh

#SpamAssassin daemonstartup

spamd -d -x -q -u nobody -x -m 5 -r /var/Communigate/spamdprocessid
#spamd -d -x -q -u nobody -x -m 8 --ident-timeout=15  -r /var/ 
Communigate/spamdprocessid
#spamd -d -x -Q -m 8 --ident-timeout=15  -r /var/Communigate/ 
spamdprocessid



My hardware/software setup is as follow:

Mac OS X 10.5.4 client
4 gb ram
spamd with up to 5 children
CommuniGate Pro 5.2.5
SpamAssassin 3.2.5



orphaned .tmp files in the submission folder

2008-07-15 Thread Ron Smith

Hi,

I'm using spamc/spamd with CommuniGate Pro. When spamd puts the file  
in the submission folder it USUALLY gets a .sub extension applied  
within a minute or two. However I am seeing orphaned files that are  
both non-spam and spam that just get left as .tmp files. Many times  
these files have multiple .tmp extensions (up to 9 or 10 of them) as  
though spamd just coughed and started over.


Currently I'm examining these files and deleting the spam and using a  
drag and drop renaming utility to manually submit (add the .sub  
extensions) these files a couple of times a day.


Why is this happening and what settings changes will stop it? I've  
tried more spamd children and less spamd children. I've turned  
autowhitelisting off and on. I've checked all the mysql spamassassin  
database files and they are in working order.


Here's the Startup.sh script that CommuniGate pro runs with variations  
of the spamd command that I've tried:


#!/bin/sh

#SpamAssassin daemonstartup

spamd -d -x -q -u nobody -x -m 5 -r /var/Communigate/spamdprocessid
#spamd -d -x -q -u nobody -x -m 8 --ident-timeout=15  -r /var/ 
Communigate/spamd$
#spamd -d -x -Q -m 8 --ident-timeout=15  -r /var/Communigate/ 
spamdprocessid



My hardware/software setup is as follow:

Mac OS X 10.5.4 client
4 gb ram
spamd with up to 5 children
CommuniGate Pro 5.2.5
SpamAssassin 3.2.5




RE: TMP Files

2007-05-03 Thread Jason Bertoch [Electronet]
> -Original Message-
> From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] 
> Sent: Wednesday, May 02, 2007 4:25 PM
> To: Theo Van Dinter
> Cc: users@spamassassin.apache.org
> Subject: Re: TMP Files 
> 
> 
> It does sound like that, alright.  The change was to support 
> win32, which
> doesn't allow the "open then unlink" POSIX mode that  the 
> code was relying
> on.
> 
> My guess -- if the Message object is not finish()'d, it may 
> leave those
> temporary files behind, I guess.
> 
> Could someone seeing this bug please open a bug on the 
> bugzilla about it?
> 
> --j.

Bug #5444 has been opened regarding this matter.  Since opening the bug, I
completely removed my SA install and started from scratch, including updating
all local Perl modules, to no avail.  For now, I've set up a cron job to remove
old files, but it's got me concerned where if that script fails, I'll run out of
tmp space.



Jason A. Bertoch
Network Administrator
[EMAIL PROTECTED]
ElectroNet Intermedia Consulting
3411 Capital Medical Blvd.
Tallahassee, FL 32308
(V) 850.222.0229 (F) 850.222.8771 



Re: TMP Files

2007-05-02 Thread Justin Mason

Theo Van Dinter writes:
> On Wed, May 02, 2007 at 02:53:03PM -0400, Daryl C. W. O'Shea wrote:
> > >normal .spamassassinXXXtmp files in /tmp but is not always removing
> > >them.  The files that remain only ever contain garbage text, like what's 
> > >seen as
> > >a mail attachment in a plain text file.  Without usable text from these 
> 
> fwiw, it sounds like the new "put non-text parts in a temp file" code.
> my version had those files removed when they were created, but I think
> I saw a mail float by recently about that bit getting changed before
> the release.

It does sound like that, alright.  The change was to support win32, which
doesn't allow the "open then unlink" POSIX mode that  the code was relying
on.

My guess -- if the Message object is not finish()'d, it may leave those
temporary files behind, I guess.

Could someone seeing this bug please open a bug on the bugzilla about it?

--j.


Re: TMP Files

2007-05-02 Thread Theo Van Dinter
On Wed, May 02, 2007 at 02:53:03PM -0400, Daryl C. W. O'Shea wrote:
> >normal .spamassassinXXXtmp files in /tmp but is not always removing
> >them.  The files that remain only ever contain garbage text, like what's 
> >seen as
> >a mail attachment in a plain text file.  Without usable text from these 

fwiw, it sounds like the new "put non-text parts in a temp file" code.
my version had those files removed when they were created, but I think
I saw a mail float by recently about that bit getting changed before
the release.

-- 
Randomly Selected Tagline:
"Duct tape is like the force; it has a light side & a dark side, and it
 holds the universe together." - Zen Musings


pgph0oE7bJbNh.pgp
Description: PGP signature


Re: TMP Files

2007-05-02 Thread Daryl C. W. O'Shea

Jason Bertoch [Electronet] wrote:

-Original Message-
From: Daryl C. W. O'Shea [mailto:[EMAIL PROTECTED] 
Sent: Wednesday, May 02, 2007 2:53 PM

To: Jason Bertoch [Electronet]
Cc: users@spamassassin.apache.org
Subject: Re: TMP Files

Is MIMEDefang timing out the SA processes and killing them off before 
they have a chance to finish (check your maillog)?



No, neither MIMEDefang nor Sendmail are logging anything out of the ordinary
while this is happening.


Is there a temp file left for every message processed, a lot of 
messages, or only a few?


Are all of your messages being successfully scanned?


Daryl


RE: TMP Files

2007-05-02 Thread Jason Bertoch [Electronet]
> -Original Message-
> From: Daryl C. W. O'Shea [mailto:[EMAIL PROTECTED] 
> Sent: Wednesday, May 02, 2007 2:53 PM
> To: Jason Bertoch [Electronet]
> Cc: users@spamassassin.apache.org
> Subject: Re: TMP Files
> 
> Is MIMEDefang timing out the SA processes and killing them off before 
> they have a chance to finish (check your maillog)?
> 
> Daryl

No, neither MIMEDefang nor Sendmail are logging anything out of the ordinary
while this is happening.



Re: TMP Files

2007-05-02 Thread Daryl C. W. O'Shea

Jason Bertoch [Electronet] wrote:


So far I've disabled all non-stock SA rules and plugins.  SA is creating its
normal .spamassassinXXXtmp files in /tmp but is not always removing
them.  The files that remain only ever contain garbage text, like what's seen as
a mail attachment in a plain text file.  Without usable text from these files,
I'm having a difficult time tying them to any specific message in my logs.

I'm running SA 3.2.0 with Sendmail 8.14.1 and calling SA from MIMEDefang 2.62.
All packages were compiled from source and the only package changed was the
upgrade to SA 3.2.0.


Is MIMEDefang timing out the SA processes and killing them off before 
they have a chance to finish (check your maillog)?


Daryl


RE: TMP Files

2007-05-02 Thread Jason Bertoch [Electronet]

> -Original Message-
> From: Daryl C. W. O'Shea [mailto:[EMAIL PROTECTED] 
> Sent: Wednesday, May 02, 2007 1:20 PM
> Jason Bertoch [Electronet] wrote:
> > 
> > Since the upgrade this morning, many SA temp files aren't 
> being cleaned up.  Is
> > anyone else seeing this?
> 
> Can you be more specific?
> 
> Daryl

So far I've disabled all non-stock SA rules and plugins.  SA is creating its
normal .spamassassinXXXtmp files in /tmp but is not always removing
them.  The files that remain only ever contain garbage text, like what's seen as
a mail attachment in a plain text file.  Without usable text from these files,
I'm having a difficult time tying them to any specific message in my logs.

I'm running SA 3.2.0 with Sendmail 8.14.1 and calling SA from MIMEDefang 2.62.
All packages were compiled from source and the only package changed was the
upgrade to SA 3.2.0.



Re: TMP Files

2007-05-02 Thread Daryl C. W. O'Shea

Jason Bertoch [Electronet] wrote:


Since the upgrade this morning, many SA temp files aren't being cleaned up.  Is
anyone else seeing this?


Can you be more specific?

Daryl


TMP Files

2007-05-02 Thread Jason Bertoch [Electronet]


Since the upgrade this morning, many SA temp files aren't being cleaned up.  Is
anyone else seeing this?

Jason A. Bertoch
Network Administrator
[EMAIL PROTECTED]
ElectroNet Intermedia Consulting
3411 Capital Medical Blvd.
Tallahassee, FL 32308
(V) 850.222.0229 (F) 850.222.8771



Re: tmp files being left over from FuzzyOCR?

2006-10-20 Thread Nick Leverton
On Friday 20 October 2006 15:44, Nick Leverton wrote:

> HTH etc,
> Nick
> fuzzyocr-23b-hashdb-poison.patch.gz

Ignore that second patch which wasn't from decoder.  I must have 
accidentally dropped it into the mail somehow (have I mentioned recently I 
hate GUI mailers ?)

Nick


Re: tmp files being left over from FuzzyOCR?

2006-10-20 Thread Nick Leverton
On Thursday 19 October 2006 16:00, George R. Kasica wrote:
> Its not a formal released version from Chris/decoder. I'm running b
> here as it seems the most stable.
>
> If you want J is at:

Decoder said some time ago that J was an early development version and 
recommended people go with b.  He's issued one patch since, in Message-ID 
<[EMAIL PROTECTED]>

FuzzyOCR.pm line 492:
  print DB "$score::$digest\n";
Should be:
  print DB "${score}::${digest}\n";

HTH etc,

Nick


fuzzyocr-23b-hashdb-poison.patch.gz
Description: GNU Zip compressed data


Re: tmp files being left over from FuzzyOCR?

2006-10-19 Thread George R . Kasica
Its not a formal released version from Chris/decoder. I'm running b
here as it seems the most stable. 

If you want J is at:

>To: [EMAIL PROTECTED]
>Subject: [Devel-spam] [Announce] Version 2.3j
>From: Jorge Valdes <[EMAIL PROTECTED]>
>Date: Mon, 25 Sep 2006 10:49:24 -0600
>
>Hi all,
>
>Just wanted to let everyone now that version 2.3j available here:
>
>http://www.joval.info/proj/FuzzyOcr.html
>
>This version is VERY stable, and besides fixing a couple of bugs 
>reported to me on this list, better traps and reports errors from 
>ImageMagick, no new features have been added. Unless a serious bug is 
>encountered, this should be very close to a stable release.
>
>On my todo list is that image inside application/ms-word (.doc) 
>attachment spam.
>
>Jorge



>Ok, I wasn't going to ask but I guess I'll have to.
>
>Where do I get the "j" version. It's not at
>http://users.own-hero.net/~decoder/fuzzyocr/
>
>  Bill
>
>
>
>- Original Message ----- 
>From: Duncan, Brian M.
>To: Bill ; users@spamassassin.apache.org
>Sent: Thursday, October 19, 2006 9:36 AM
>Subject: RE: tmp files being left over from FuzzyOCR?
>
>
>I am using 2.3j of Fuzzy OCR according to the Perl script.
>
>drwx--   2 mail mail  4096 Oct 19 08:29 .spamassassin17656WleDs7tmp
>drwx--   2 mail mail  4096 Oct 19 09:15 .spamassassin25775kNluNhtmp
>
>These are two dirs in my tmp folder currently.
>
>In one of those dirs I have:
>
>Line-multi-gif
>Line.gif
>raw.eml
>raw.err
>
>> -Original Message-
>> From: Bill [mailto:[EMAIL PROTECTED]
>> Sent: Thursday, October 19, 2006 9:29 AM
>> To: users@spamassassin.apache.org
>> Subject: Re: tmp files being left over from FuzzyOCR?
>>
>> I'm using FuzzyOcr-2.3b and I can't find any reference to
>> this option in any of the FuzzyOCR software I downloaded.
>>
>> focr_keep_bad_images 0
>>
>> Here's a sample of the items in my /tmp folder. You said
>> your's were folders, mine's not. All of these files are left
>> behind as at the time I made this sample it was 9:25.
>>
>> -rw---   1 mail mail 0 Oct 19 08:25 .spamassassin2053SgXN1Ktmp
>> -rw---   1 mail mail 0 Oct 19 08:31 .spamassassin2053uswH8Ttmp
>> -rw---   1 mail mail52 Oct 19 08:31 .spamassassin2053Wpxhuftmp
>> -rw---   1 mail mail  6549 Oct 19 08:31
>> .spamassassin26901cFX9cetmp
>> -rw---   1 mail mail 0 Oct 19 08:31
>> .spamassassin26901JsLOMYtmp
>> -rw---   1 mail mail  4267 Oct 19 08:36
>> .spamassassin300465gddzGtmp
>> -rw---   1 mail mail 0 Oct 19 08:31
>> .spamassassin30046lKDkjjtmp
>> -rw---   1 mail mail  3150 Oct 19 08:31
>> .spamassassin30046ptOrdstmp
>> -rw---   1 mail mail 0 Oct 19 08:36
>> .spamassassin30046xUScTotmp
>> -rw---   1 mail mail  4112 Oct 19 08:24 .spamassassin9067MfgpH3tmp
>>
>>
>> Here's a sample of a file.
>>
>>   _(PICTURE)_   _  _   _  _ _   _ _ _   _ (PICTURE)_ _
>>  _   _ _ _
>> __   _  _   _  _  (PICTURE) _
>> (PICTURE)__ _  _
>> _ _  __  _  _   _ _ _ _
>> An _nves_or A_ER_!!!   pE_RoSU_ DR_rr__C (pSUD )
>>  '_ UU_ _ L Y_L LdUY_ _L __ _
>> L_ __ UU_L_
>> _Ld! L WdLL_ ___ _ LUUdY _  ' ' ' '' ' ' '
>>  ' ' ' ' ' ' ' '' ' ' ''' ' ' ' ' ' ' '' '' ' ' ' '' ' ' ' ''
>> ' ' '' ' ' ' '
>> ' '' ' '' ' '_'
>>   \  petro5un, Incorporeted (p5UD _ _ews)
>> ennounced thet rU_ L_ _d__ _Y_ ___ _ ___ _L W_L_ _ ___LL_ d
>> _ _ __! _ _ __ _ _ _ _ __
>> ,
>>
>>
>>
>>   Bill
>>
>>
>>
>>
>>
>>
>> - Original Message -
>> From: Duncan, Brian M.
>> To: users@spamassassin.apache.org
>> Sent: Thursday, October 19, 2006 8:42 AM
>> Subject: RE: tmp files being left over from FuzzyOCR?
>>
>>
>>
>> I noticed that there is this directive in the fuzzyocr.cf:
>>
>> #  0 = always cleanup
>> #  1 = keep only if error
>> #  2 = always keep
>> focr_keep_bad_images 0
>>
>> Mine was set to 1 by default, to keep bad imag

Re: tmp files being left over from FuzzyOCR?

2006-10-19 Thread Chris Lear

* Bill wrote (19/10/06 15:29):

I'm using FuzzyOcr-2.3b and I can't find any reference to this option in
any of the FuzzyOCR software I downloaded.

focr_keep_bad_images 0

Here's a sample of the items in my /tmp folder. You said your's were
folders, mine's not. All of these files are left behind as at the time I
made this sample it was 9:25.


Look in your FuzzyOCR log. If it's like mine, you will see timeouts like 
this:


[2006-10-18 09:49:14] FuzzyOcr received timeout after running "10" seconds.

If the times on these timeouts match the times on the temp files, then 
that's what's causing them. That logic works for what I'm seeing.




===
CIRCULAR 230 DISCLOSURE: Pursuant to Regulations Governing Practice Before
the Internal Revenue Service, any tax advice contained herein is not
intended or written to be used and cannot be used by a taxpayer for the
purpose of avoiding tax penalties that may be imposed on the taxpayer.
===


Shame. I was hoping to get out of paying some tax.


CONFIDENTIALITY NOTICE:
This electronic mail message and any attached files contain information
intended for the exclusive use of the individual or entity to whom it is
addressed and may contain information that is proprietary, privileged,
confidential and/or exempt from disclosure under applicable law.  If you are
not the intended recipient, you are hereby notified that any viewing,
copying, disclosure or distribution of this information may be subject to
legal restriction or sanction.  Please notify the sender, by electronic mail
or telephone, of any unintended recipients and delete the original message
without making any copies.


I hope I was the intended recipient, but I'm not sure how I can know.


Re: tmp files being left over from FuzzyOCR?

2006-10-19 Thread Bill
Ok, I wasn't going to ask but I guess I'll have to.

Where do I get the "j" version. It's not at
http://users.own-hero.net/~decoder/fuzzyocr/

  Bill



- Original Message - 
From: Duncan, Brian M.
To: Bill ; users@spamassassin.apache.org
Sent: Thursday, October 19, 2006 9:36 AM
Subject: RE: tmp files being left over from FuzzyOCR?


I am using 2.3j of Fuzzy OCR according to the Perl script.

drwx--   2 mail mail  4096 Oct 19 08:29 .spamassassin17656WleDs7tmp
drwx--   2 mail mail  4096 Oct 19 09:15 .spamassassin25775kNluNhtmp

These are two dirs in my tmp folder currently.

In one of those dirs I have:

Line-multi-gif
Line.gif
raw.eml
raw.err

> -Original Message-
> From: Bill [mailto:[EMAIL PROTECTED]
> Sent: Thursday, October 19, 2006 9:29 AM
> To: users@spamassassin.apache.org
> Subject: Re: tmp files being left over from FuzzyOCR?
>
> I'm using FuzzyOcr-2.3b and I can't find any reference to
> this option in any of the FuzzyOCR software I downloaded.
>
> focr_keep_bad_images 0
>
> Here's a sample of the items in my /tmp folder. You said
> your's were folders, mine's not. All of these files are left
> behind as at the time I made this sample it was 9:25.
>
> -rw---   1 mail mail 0 Oct 19 08:25 .spamassassin2053SgXN1Ktmp
> -rw---   1 mail mail 0 Oct 19 08:31 .spamassassin2053uswH8Ttmp
> -rw---   1 mail mail52 Oct 19 08:31 .spamassassin2053Wpxhuftmp
> -rw---   1 mail mail  6549 Oct 19 08:31
> .spamassassin26901cFX9cetmp
> -rw---   1 mail mail 0 Oct 19 08:31
> .spamassassin26901JsLOMYtmp
> -rw---   1 mail mail  4267 Oct 19 08:36
> .spamassassin300465gddzGtmp
> -rw---   1 mail mail 0 Oct 19 08:31
> .spamassassin30046lKDkjjtmp
> -rw---   1 mail mail  3150 Oct 19 08:31
> .spamassassin30046ptOrdstmp
> -rw---   1 mail mail 0 Oct 19 08:36
> .spamassassin30046xUScTotmp
> -rw---   1 mail mail  4112 Oct 19 08:24 .spamassassin9067MfgpH3tmp
>
>
> Here's a sample of a file.
>
>   _(PICTURE)_   _  _   _  _ _   _ _ _   _ (PICTURE)_ _
>  _   _ _ _
> __   _  _   _  _  (PICTURE) _
> (PICTURE)__ _  _
> _ _  __  _  _   _ _ _ _
> An _nves_or A_ER_!!!   pE_RoSU_ DR_rr__C (pSUD )
>  '_ UU_ _ L Y_L LdUY_ _L __ _
> L_ __ UU_L_
> _Ld! L WdLL_ ___ _ LUUdY _  ' ' ' '' ' ' '
>  ' ' ' ' ' ' ' '' ' ' ''' ' ' ' ' ' ' '' '' ' ' ' '' ' ' ' ''
> ' ' '' ' ' ' '
> ' '' ' '' ' '_'
>   \  petro5un, Incorporeted (p5UD _ _ews)
> ennounced thet rU_ L_ _d__ _Y_ ___ _ ___ _L W_L_ _ ___LL_ d
> _ _ __! _ _ __ _ _ _ _ __
> ,
>
>
>
>   Bill
>
>
>
>
>
>
> - Original Message -
> From: Duncan, Brian M.
> To: users@spamassassin.apache.org
> Sent: Thursday, October 19, 2006 8:42 AM
> Subject: RE: tmp files being left over from FuzzyOCR?
>
>
>
> I noticed that there is this directive in the fuzzyocr.cf:
>
> #  0 = always cleanup
> #  1 = keep only if error
> #  2 = always keep
> focr_keep_bad_images 0
>
> Mine was set to 1 by default, to keep bad images.  I set it
> to 0 but it
> still is keeping bad images. (If what is in the dirs is bad
> images, when
> I send a GIF I created no tmp dirs are left, if I send one of
> the images
> left from one of the dirs from before it leaves it)
>
>
>
> > -Original Message-
> > From: Duncan, Brian M.
> > Sent: Thursday, October 19, 2006 8:27 AM
> > To: users@spamassassin.apache.org
> > Subject: RE: tmp files being left over from FuzzyOCR?
> >
> > I just looked and have tmp dirs being created by FuzzyOCR -
> > with what looks like tmp files in those dirs.  No tmp files
> > in the root of /tmp
> >
> > It looks like certain images are causing FuzzyOCR to quit
> > proccessing messages in my case based on what I see in these
> > "dead" tmp dirs left behind.  It's only happening on certain
> > images it looks like. (only 3 tmp dirs left from all of
> today so far)
> >
> > For the heck of it I took one of the graphic files in one of
> > those tmp dirs and put it inline and sent it in as a test.
> > It created a tmp dir did not remove it, and the message that
> > came in had a Spam score of 0 and it loo

RE: tmp files being left over from FuzzyOCR?

2006-10-19 Thread Duncan, Brian M.
I am using 2.3j of Fuzzy OCR according to the Perl script.

drwx--   2 mail mail  4096 Oct 19 08:29 .spamassassin17656WleDs7tmp
drwx--   2 mail mail  4096 Oct 19 09:15 .spamassassin25775kNluNhtmp 

These are two dirs in my tmp folder currently. 

In one of those dirs I have:

Line-multi-gif
Line.gif
raw.eml
raw.err

> -Original Message-
> From: Bill [mailto:[EMAIL PROTECTED] 
> Sent: Thursday, October 19, 2006 9:29 AM
> To: users@spamassassin.apache.org
> Subject: Re: tmp files being left over from FuzzyOCR?
> 
> I'm using FuzzyOcr-2.3b and I can't find any reference to 
> this option in any of the FuzzyOCR software I downloaded.
> 
> focr_keep_bad_images 0
> 
> Here's a sample of the items in my /tmp folder. You said 
> your's were folders, mine's not. All of these files are left 
> behind as at the time I made this sample it was 9:25.
> 
> -rw---   1 mail mail 0 Oct 19 08:25 .spamassassin2053SgXN1Ktmp
> -rw---   1 mail mail 0 Oct 19 08:31 .spamassassin2053uswH8Ttmp
> -rw---   1 mail mail52 Oct 19 08:31 .spamassassin2053Wpxhuftmp
> -rw---   1 mail mail  6549 Oct 19 08:31 
> .spamassassin26901cFX9cetmp
> -rw---   1 mail mail 0 Oct 19 08:31 
> .spamassassin26901JsLOMYtmp
> -rw---   1 mail mail  4267 Oct 19 08:36 
> .spamassassin300465gddzGtmp
> -rw---   1 mail mail 0 Oct 19 08:31 
> .spamassassin30046lKDkjjtmp
> -rw---   1 mail mail  3150 Oct 19 08:31 
> .spamassassin30046ptOrdstmp
> -rw---   1 mail mail 0 Oct 19 08:36 
> .spamassassin30046xUScTotmp
> -rw---   1 mail mail  4112 Oct 19 08:24 .spamassassin9067MfgpH3tmp
> 
> 
> Here's a sample of a file.
> 
>   _(PICTURE)_   _  _   _  _ _   _ _ _   _ (PICTURE)_ _
>  _   _ _ _
> __   _  _   _  _  (PICTURE) _  
> (PICTURE)__ _  _
> _ _  __  _  _   _ _ _ _
> An _nves_or A_ER_!!!   pE_RoSU_ DR_rr__C (pSUD )
>  '_ UU_ _ L Y_L LdUY_ _L __ _ 
> L_ __ UU_L_
> _Ld! L WdLL_ ___ _ LUUdY _  ' ' ' '' ' ' '
>  ' ' ' ' ' ' ' '' ' ' ''' ' ' ' ' ' ' '' '' ' ' ' '' ' ' ' '' 
> ' ' '' ' ' ' '
> ' '' ' '' ' '_'
>   \  petro5un, Incorporeted (p5UD _ _ews) 
> ennounced thet rU_ L_ _d__ _Y_ ___ _ ___ _L W_L_ _ ___LL_ d
> _ _ __! _ _ __ _ _ _ _ __
> ,
> 
> 
> 
>   Bill
> 
> 
> 
> 
> 
> 
> - Original Message -
> From: Duncan, Brian M.
> To: users@spamassassin.apache.org
> Sent: Thursday, October 19, 2006 8:42 AM
> Subject: RE: tmp files being left over from FuzzyOCR?
> 
> 
> 
> I noticed that there is this directive in the fuzzyocr.cf:
> 
> #  0 = always cleanup
> #  1 = keep only if error
> #  2 = always keep
> focr_keep_bad_images 0
> 
> Mine was set to 1 by default, to keep bad images.  I set it 
> to 0 but it
> still is keeping bad images. (If what is in the dirs is bad 
> images, when
> I send a GIF I created no tmp dirs are left, if I send one of 
> the images
> left from one of the dirs from before it leaves it)
> 
> 
> 
> > -Original Message-
> > From: Duncan, Brian M.
> > Sent: Thursday, October 19, 2006 8:27 AM
> > To: users@spamassassin.apache.org
> > Subject: RE: tmp files being left over from FuzzyOCR?
> >
> > I just looked and have tmp dirs being created by FuzzyOCR -
> > with what looks like tmp files in those dirs.  No tmp files
> > in the root of /tmp
> >
> > It looks like certain images are causing FuzzyOCR to quit
> > proccessing messages in my case based on what I see in these
> > "dead" tmp dirs left behind.  It's only happening on certain
> > images it looks like. (only 3 tmp dirs left from all of 
> today so far)
> >
> > For the heck of it I took one of the graphic files in one of
> > those tmp dirs and put it inline and sent it in as a test.
> > It created a tmp dir did not remove it, and the message that
> > came in had a Spam score of 0 and it looks like Spam Assassin
> > died because of FuzzyOCR having an issue with the image. (I
> > have the patches applied to the couple of apps that
> > supposedly cause segfaults - so I am not sure what caused the issue)
> >
> > X-MailScanner-SpamCheck: not spam, SpamAssassin (not cached,
> >

Re: tmp files being left over from FuzzyOCR?

2006-10-19 Thread Bill
I'm using FuzzyOcr-2.3b and I can't find any reference to this option in
any of the FuzzyOCR software I downloaded.

focr_keep_bad_images 0

Here's a sample of the items in my /tmp folder. You said your's were
folders, mine's not. All of these files are left behind as at the time I
made this sample it was 9:25.

-rw---   1 mail mail 0 Oct 19 08:25 .spamassassin2053SgXN1Ktmp
-rw---   1 mail mail 0 Oct 19 08:31 .spamassassin2053uswH8Ttmp
-rw---   1 mail mail52 Oct 19 08:31 .spamassassin2053Wpxhuftmp
-rw---   1 mail mail  6549 Oct 19 08:31 .spamassassin26901cFX9cetmp
-rw---   1 mail mail 0 Oct 19 08:31 .spamassassin26901JsLOMYtmp
-rw---   1 mail mail  4267 Oct 19 08:36 .spamassassin300465gddzGtmp
-rw---   1 mail mail 0 Oct 19 08:31 .spamassassin30046lKDkjjtmp
-rw---   1 mail mail  3150 Oct 19 08:31 .spamassassin30046ptOrdstmp
-rw---   1 mail mail 0 Oct 19 08:36 .spamassassin30046xUScTotmp
-rw---   1 mail mail  4112 Oct 19 08:24 .spamassassin9067MfgpH3tmp


Here's a sample of a file.

  _(PICTURE)_   _  _   _  _ _   _ _ _   _ (PICTURE)_ _ _   _ _ _
__   _  _   _  _  (PICTURE) _  (PICTURE)__ _  _
_ _  __  _  _   _ _ _ _
An _nves_or A_ER_!!!   pE_RoSU_ DR_rr__C (pSUD )
 '_ UU_ _ L Y_L LdUY_ _L __ _ L_ __ UU_L_
_Ld! L WdLL_ ___ _ LUUdY _  ' ' ' '' ' ' '
 ' ' ' ' ' ' ' '' ' ' ''' ' ' ' ' ' ' '' '' ' ' ' '' ' ' ' '' ' ' '' ' ' ' '
' '' ' '' ' '_    '
  \  petro5un, Incorporeted (p5UD _ _ews) ennounced thet
rU_ L_ _d__ _Y_ ___ _ ___ _L W_L_ _ ___LL_ d
_ _ __! _ _ __ _ _ _ _ __
,



  Bill






- Original Message - 
From: Duncan, Brian M.
To: users@spamassassin.apache.org
Sent: Thursday, October 19, 2006 8:42 AM
Subject: RE: tmp files being left over from FuzzyOCR?



I noticed that there is this directive in the fuzzyocr.cf:

#  0 = always cleanup
#  1 = keep only if error
#  2 = always keep
focr_keep_bad_images 0

Mine was set to 1 by default, to keep bad images.  I set it to 0 but it
still is keeping bad images. (If what is in the dirs is bad images, when
I send a GIF I created no tmp dirs are left, if I send one of the images
left from one of the dirs from before it leaves it)



> -Original Message-
> From: Duncan, Brian M.
> Sent: Thursday, October 19, 2006 8:27 AM
> To: users@spamassassin.apache.org
> Subject: RE: tmp files being left over from FuzzyOCR?
>
> I just looked and have tmp dirs being created by FuzzyOCR -
> with what looks like tmp files in those dirs.  No tmp files
> in the root of /tmp
>
> It looks like certain images are causing FuzzyOCR to quit
> proccessing messages in my case based on what I see in these
> "dead" tmp dirs left behind.  It's only happening on certain
> images it looks like. (only 3 tmp dirs left from all of today so far)
>
> For the heck of it I took one of the graphic files in one of
> those tmp dirs and put it inline and sent it in as a test.
> It created a tmp dir did not remove it, and the message that
> came in had a Spam score of 0 and it looks like Spam Assassin
> died because of FuzzyOCR having an issue with the image. (I
> have the patches applied to the couple of apps that
> supposedly cause segfaults - so I am not sure what caused the issue)
>
> X-MailScanner-SpamCheck: not spam, SpamAssassin (not cached,
> score=0, required 6.5, autolearn=)
>
> From the header it looks like SpamAssassin did not finish
> processing the message because of FuzzyOCR.  I better look
> into that now.
>
>
> > -Original Message-
> > From: Bill [mailto:[EMAIL PROTECTED]
> > Sent: Thursday, October 19, 2006 8:03 AM
> > To: users@spamassassin.apache.org
> > Subject: tmp files being left over from FuzzyOCR?
> >
> > Since I installed FuzzyOCR I've noticed I'm having a
> lot of files
> > named similar to  .spamassassin8932mZBFrtmp  left in my
> /tmp folder.
> > These are from FuzzyOCR, correct? The content of these
> files has lots
> > of spaces, hyphens, commas with a few readable words and the word
> > "picture" a few times.
> >
> > Is there something I need to do to ensure these files
> are removed?
> > After I manually remove them I see new tmp files being created and
> > removed but sometimes a file is NOT removed.
> >
> >   Bill
> >

RE: tmp files being left over from FuzzyOCR?

2006-10-19 Thread Duncan, Brian M.

I noticed that there is this directive in the fuzzyocr.cf:

#  0 = always cleanup
#  1 = keep only if error
#  2 = always keep
focr_keep_bad_images 0

Mine was set to 1 by default, to keep bad images.  I set it to 0 but it
still is keeping bad images. (If what is in the dirs is bad images, when
I send a GIF I created no tmp dirs are left, if I send one of the images
left from one of the dirs from before it leaves it)



> -Original Message-
> From: Duncan, Brian M.
> Sent: Thursday, October 19, 2006 8:27 AM
> To: users@spamassassin.apache.org
> Subject: RE: tmp files being left over from FuzzyOCR?
>
> I just looked and have tmp dirs being created by FuzzyOCR -
> with what looks like tmp files in those dirs.  No tmp files
> in the root of /tmp
>
> It looks like certain images are causing FuzzyOCR to quit
> proccessing messages in my case based on what I see in these
> "dead" tmp dirs left behind.  It's only happening on certain
> images it looks like. (only 3 tmp dirs left from all of today so far)
>
> For the heck of it I took one of the graphic files in one of
> those tmp dirs and put it inline and sent it in as a test. 
> It created a tmp dir did not remove it, and the message that
> came in had a Spam score of 0 and it looks like Spam Assassin
> died because of FuzzyOCR having an issue with the image. (I
> have the patches applied to the couple of apps that
> supposedly cause segfaults - so I am not sure what caused the issue)
>
> X-MailScanner-SpamCheck: not spam, SpamAssassin (not cached,
>   score=0, required 6.5, autolearn=)
>
> From the header it looks like SpamAssassin did not finish
> processing the message because of FuzzyOCR.  I better look
> into that now.
>
>
> > -Original Message-
> > From: Bill [mailto:[EMAIL PROTECTED]
> > Sent: Thursday, October 19, 2006 8:03 AM
> > To: users@spamassassin.apache.org
> > Subject: tmp files being left over from FuzzyOCR?
> >
> > Since I installed FuzzyOCR I've noticed I'm having a
> lot of files
> > named similar to  .spamassassin8932mZBFrtmp  left in my
> /tmp folder.
> > These are from FuzzyOCR, correct? The content of these
> files has lots
> > of spaces, hyphens, commas with a few readable words and the word
> > "picture" a few times.
> >
> > Is there something I need to do to ensure these files
> are removed?
> > After I manually remove them I see new tmp files being created and
> > removed but sometimes a file is NOT removed.
> >
> >   Bill
> >
> >

===
CIRCULAR 230 DISCLOSURE: Pursuant to Regulations Governing Practice Before the 
Internal Revenue Service, any tax advice contained herein is not intended or 
written to be used and cannot be used by a taxpayer for the purpose of avoiding 
tax penalties that may be imposed on the taxpayer.
===
CONFIDENTIALITY NOTICE:
This electronic mail message and any attached files contain information 
intended for the exclusive use of the individual or entity to whom it is 
addressed and may contain information that is proprietary, privileged, 
confidential and/or exempt from disclosure under applicable law.  If you are 
not the intended recipient, you are hereby notified that any viewing, copying, 
disclosure or distribution of this information may be subject to legal 
restriction or sanction.  Please notify the sender, by electronic mail or 
telephone, of any unintended recipients and delete the original message without 
making any copies.
===
NOTIFICATION:  Katten Muchin Rosenman LLP is an Illinois limited liability 
partnership that has elected to be governed by the Illinois Uniform Partnership 
Act (1997).
===


RE: tmp files being left over from FuzzyOCR?

2006-10-19 Thread Duncan, Brian M.

I just looked and have tmp dirs being created by FuzzyOCR - with what
looks like tmp files in those dirs.  No tmp files in the root of /tmp

It looks like certain images are causing FuzzyOCR to quit proccessing
messages in my case based on what I see in these "dead" tmp dirs left
behind.  It's only happening on certain images it looks like. (only 3
tmp dirs left from all of today so far)

For the heck of it I took one of the graphic files in one of those tmp
dirs and put it inline and sent it in as a test.  It created a tmp dir
did not remove it, and the message that came in had a Spam score of 0
and it looks like Spam Assassin died because of FuzzyOCR having an issue
with the image. (I have the patches applied to the couple of apps that
supposedly cause segfaults - so I am not sure what caused the issue)

X-MailScanner-SpamCheck: not spam, SpamAssassin (not cached,
score=0, required 6.5, autolearn=)

>From the header it looks like SpamAssassin did not finish processing the
message because of FuzzyOCR.  I better look into that now.


> -Original Message-
> From: Bill [mailto:[EMAIL PROTECTED]
> Sent: Thursday, October 19, 2006 8:03 AM
> To: users@spamassassin.apache.org
> Subject: tmp files being left over from FuzzyOCR?
>
> Since I installed FuzzyOCR I've noticed I'm having a lot
> of files named similar to  .spamassassin8932mZBFrtmp  left in
> my /tmp folder. These are from FuzzyOCR, correct? The content
> of these files has lots of spaces, hyphens, commas with a few
> readable words and the word "picture" a few times.
>
> Is there something I need to do to ensure these files are
> removed? After I manually remove them I see new tmp files
> being created and removed but sometimes a file is NOT removed.
>
>   Bill
>
>

===
CIRCULAR 230 DISCLOSURE: Pursuant to Regulations Governing Practice Before the 
Internal Revenue Service, any tax advice contained herein is not intended or 
written to be used and cannot be used by a taxpayer for the purpose of avoiding 
tax penalties that may be imposed on the taxpayer.
===
CONFIDENTIALITY NOTICE:
This electronic mail message and any attached files contain information 
intended for the exclusive use of the individual or entity to whom it is 
addressed and may contain information that is proprietary, privileged, 
confidential and/or exempt from disclosure under applicable law.  If you are 
not the intended recipient, you are hereby notified that any viewing, copying, 
disclosure or distribution of this information may be subject to legal 
restriction or sanction.  Please notify the sender, by electronic mail or 
telephone, of any unintended recipients and delete the original message without 
making any copies.
===
NOTIFICATION:  Katten Muchin Rosenman LLP is an Illinois limited liability 
partnership that has elected to be governed by the Illinois Uniform Partnership 
Act (1997).
===


Re: tmp files being left over from FuzzyOCR?

2006-10-19 Thread Chris Lear

* Bill wrote (19/10/06 14:03):

Since I installed FuzzyOCR I've noticed I'm having a lot of files named
similar to  .spamassassin8932mZBFrtmp  left in my /tmp folder. These are
from FuzzyOCR, correct? The content of these files has lots of spaces,
hyphens, commas with a few readable words and the word "picture" a few
times.

Is there something I need to do to ensure these files are removed? After
I manually remove them I see new tmp files being created and removed but
sometimes a file is NOT removed.


I suspect that if you look in your FuzzyOCR log, you will find errors 
that match the unremoved temp files.


Eg from my FuzzyOCR.log:

[2006-10-18 10:10:47] Unexpected error in pipe to external programs.
  Please check that all helper programs are 
installed and in the correct path.
  (Pipe Command "/usr/bin/gifasm -d 
/tmp/.spamassassin2591CHsvrEtmp/out", Pipe exit code 1 (""), Temporary 
file: "/tmp/.spamassassin2591dNqOn7tmp")


I see that /tmp/.spamassassin2591CHsvrEtmp/ is still there, but 
/tmp/.spamassassin2591dNqOn7tmp isn't.


And another example:

[2006-10-18 09:34:24] FuzzyOcr received timeout after running "10" seconds.

#ls -l /tmp/.spamassassin* | grep 09:34
-rw---  1 spamd users 0 Oct 18 09:34 /tmp/.spamassassin2589Wc3z7Gtmp
-rw---  1 spamd users 23579 Oct 18 09:34 /tmp/.spamassassin2589yvpP1Htmp


Looks like when gifasm fails, you get a dir left over. If there's a 
timeout, you get a file left over.


Chris


tmp files being left over from FuzzyOCR?

2006-10-19 Thread Bill
Since I installed FuzzyOCR I've noticed I'm having a lot of files named
similar to  .spamassassin8932mZBFrtmp  left in my /tmp folder. These are
from FuzzyOCR, correct? The content of these files has lots of spaces,
hyphens, commas with a few readable words and the word "picture" a few
times.

Is there something I need to do to ensure these files are removed? After
I manually remove them I see new tmp files being created and removed but
sometimes a file is NOT removed.

  Bill



Re: CPU usage very high (spamd) (was Re: relocating tmp files?)

2004-10-21 Thread email builder
Hooray!  Just when I was thinking about how to start another thread asking the 
same
question...  I am pleased that this thread can continue.  :)

Read below...

> > Thanks so much.  Unfortunately, I don't see much change in my CPU usage by
> spamd.  I am
> > at a loss, as I've spent almost an entire day reading old mailing list
> threads and the
> > wiki, but no one has seemed to post anything concrete as to why spamd
> would eat so much
> > CPU (in my case, it spikes to as much as 35%, so running 5 max children
> really taxes my
> 
> I think there are several theories on SA processor usage.
> 
> 1.bayes expire

I have not been using Bayes due to what appears to be such a resource problem 
with SA
that if I add the time on top of that for Bayes, my mail server will start 
backing up
email.  I would *like* to figure out what is consuming so much for spamd so I 
can
possibly turn Bayes on again.

> 2.bayes expire jorrnal bug

Is this bug fixed in development versions of the software?

> 3.Some rule with something like an .* in it that is looping like mad on
> certain text patterns.

I only use stock rules, is this a possibility with what I get with the standard 
SA
download?

> 4.Something else.

:)

> If you are getting suddenly huge bayes journals along with your processor
> usage, it is #2.

I use SQL-based Bayes, but again, I have autolearn temporarily turned off.

> If SA is taking 2-5 minutes randomly, and repeating on the same mail message
> doesn't have the same result, it may be #1.

Times are not out of the ordinary I think (at the busiest of times, they range 
from
almost as low as one second to typically 1 minute at the highest).  The problem 
merely
appears to be that spamd eats up so much of the CPU that I can do little else 
for fear
that the box will collapse into flames.

> If SA is taking a long time to process a specific message every time, it is
> probably #3.

I don't *think* this is a problem, but haven't been watching specific messages. 
 Is this
problem a possibility on a 100% stock ruleset?
 
> The first thing is to try to determine whether you are seeing spikes or
> constant heavy load.  I think you are implying that you see spikes.

Sorry, I am saying that I see a consistently high load (spamd bounces around 
between 5%
and 40% CPU... I only meant to use the word spike to say that spamd children 
usually
average 20%/30% but that almost 50% is only a bit unusual).  Again, the problem 
seems to
be a fairly consistent high CPU usage by spamd.  A sample from top:

 PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEMTIME+  COMMAND
1401 maildrop  16   0 39744  34m 6840 R 28.3  3.4   3:04.18 spamd
 
And this is how I start spamd:

LANG=en_US; export LANG; TMPDIR=/tmp/spamassassin; export TMPDIR
spamd -d -q -x --max-children=5 -H /etc/razor -u maildrop -r 
/var/run/spamd/spamd.pid

/tmp/spamassassin is mounted with tmpfs

we also run named on the same machine

if it's important, this is 3.0rc5, downloaded and compiled manually (not a CPAN 
install)

also, this is a Fedora Core 2 machine (2.8P-IV hyperthreaded, 1GB RAM)

spamc is called from maildrop as such:

if ( $SIZE < 262144 )
{
   exception {
  xfilter "/usr/bin/spamc -u $LOGNAME"
   }
}


Any advice or even just pointers on any more reading I can do would be highly
appreciated!
 
TIA!

> The next thing to do is to try to figure out of the spikes correlate with
> certain messages, with certian times, or occur randomly.  You can also check
> for the bayes journal bug fairly easily.
> 
> If you end up with a particular message that eats the processor alive, you
> probably also have a bad rule that is the real culprit.  DProf I believe it
> is should perhaps be able to track the problem down to the rule if you have
> an example message.
> 
> Loren




___
Do you Yahoo!?
Declare Yourself - Register online to vote today!
http://vote.yahoo.com


Re: CPU usage very high (spamd) (was Re: relocating tmp files?)

2004-10-16 Thread Loren Wilton
> Thanks so much.  Unfortunately, I don't see much change in my CPU usage by
spamd.  I am
> at a loss, as I've spent almost an entire day reading old mailing list
threads and the
> wiki, but no one has seemed to post anything concrete as to why spamd
would eat so much
> CPU (in my case, it spikes to as much as 35%, so running 5 max children
really taxes my

I think there are several theories on SA processor usage.

1.bayes expire
2.bayes expire jorrnal bug
3.Some rule with something like an .* in it that is looping like mad on
certain text patterns.
4.Something else.

If you are getting suddenly huge bayes journals along with your processor
usage, it is #2.
If SA is taking 2-5 minutes randomly, and repeating on the same mail message
doesn't have the same result, it may be #1.
If SA is taking a long time to process a specific message every time, it is
probably #3.

The first thing is to try to determine whether you are seeing spikes or
constant heavy load.  I think you are implying that you see spikes.

The next thing to do is to try to figure out of the spikes correlate with
certain messages, with certian times, or occur randomly.  You can also check
for the bayes journal bug fairly easily.

If you end up with a particular message that eats the processor alive, you
probably also have a bad rule that is the real culprit.  DProf I believe it
is should perhaps be able to track the problem down to the rule if you have
an example message.

Loren



CPU usage very high (spamd) (was Re: relocating tmp files?)

2004-10-15 Thread email builder

Thanks so much.  Unfortunately, I don't see much change in my CPU usage by 
spamd.  I am
at a loss, as I've spent almost an entire day reading old mailing list threads 
and the
wiki, but no one has seemed to post anything concrete as to why spamd would eat 
so much
CPU (in my case, it spikes to as much as 35%, so running 5 max children really 
taxes my
machine (2.8GHtz!).  I run a local DNS server and as I am running Fedora, I set 
the
language when I start spamd:

LANG=en_US; export LANG; TMPDIR=/tmp/spamassassin; export TMPDIR
spamd -d -q -x --max-children=5 -H /etc/razor -u maildrop -r 
/var/run/spamd/spamd.pid

With a ramdisk now for DCC temp files, I am at an absolute loss as to what else 
I can
change to get spamd to cooperate.  Anyone have any advice??  TIA!


> 64MB should be plenty -- those files shouldn't stick around once
> a scan has completed, and they should all be 250KB or less in
> size.
> 
> In my experience tmpfs helps quite a lot, but in this case it's
> a trivial amount of the overall total runtime.
>
> > > > I have been under the apparently false presumption that spamd prcessed
> > > > its messages in memory (perhaps this explains why each spamd process can
> > > > oft take up to 25% cpu?).  I recently looked in /tmp and found lots of
> > > > left over spamassassin..xx.tmp files (the first four x's are
> > > > numeric, the second set of x's are alpha-numeric).
> > > > 
> > > > I am interested in placing those files in a tmpfs, but do not want to
> > > > put my whole /tmp directory under tmpfs but I can't for the life of
> > > > me find any configuration settings that tell spamd where to place its
> > > > temp files.  Do I need to change something in the code itself?
> > > 
> > > nope -- setting the std env var TMPDIR  will do it.  those files are
> > > created if you're using Pyzor or DCC.
> > 
> > Thank you kindly!  
> > 
> > Might I ask if anyone else has gone with any type of ramdisk for this 
> > directory and
> if so
> > if it helped in any way (just five minutes running it so far; I don't see 
> > any
> significant
> > performance gain)?  I am wondering how to size it as well.  I have started 
> > out with a
> > 64MB tmpfs, but not sure if I'll need more.  I am tempted to size it 
> > similar to the
> > following recommendation from a how-to for creating a ramdisk for amavis:
> > 
> > n * (1 + max(expansionfactor)) * $message_size_limit
> > 
> > Thoughts anyone?  What else could be eating up so much CPU??
> > 
> > > > Pointers to anything I missed or other help greatly appreciated!  I am
> > > > running 3.0rc5 (as I understand it, only documentation changes were made
> > > > between rc5 and the official release), Fedora core 2, 2.8GHtz
> > > > hyperthreaded Pentium IV, 1 GB RAM, spamc out of Courier Maildrop to
> > > > spamd.  Spamd pounds my system all day long (cpu but not memory).  I run
> > > > max-children beween 5 and 8 (and all user prefs in (My)SQL, as well as
> > > > AWL and Bayes, although I cannot run bayes/awl without mail backing up,
> > > > as it appears to be too slow).


__
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


Re: relocating tmp files?

2004-10-15 Thread Justin Mason
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1


64MB should be plenty -- those files shouldn't stick around once
a scan has completed, and they should all be 250KB or less in
size.

In my experience tmpfs helps quite a lot, but in this case it's
a trivial amount of the overall total runtime.

- --j.

email builder writes:
> > > I have been under the apparently false presumption that spamd prcessed
> > > its messages in memory (perhaps this explains why each spamd process can
> > > oft take up to 25% cpu?).  I recently looked in /tmp and found lots of
> > > left over spamassassin..xx.tmp files (the first four x's are
> > > numeric, the second set of x's are alpha-numeric).
> > > 
> > > I am interested in placing those files in a tmpfs, but do not want to
> > > put my whole /tmp directory under tmpfs but I can't for the life of
> > > me find any configuration settings that tell spamd where to place its
> > > temp files.  Do I need to change something in the code itself?
> > 
> > nope -- setting the std env var TMPDIR  will do it.  those files are
> > created if you're using Pyzor or DCC.
> 
> Thank you kindly!  
> 
> Might I ask if anyone else has gone with any type of ramdisk for this 
> directory and if so
> if it helped in any way (just five minutes running it so far; I don't see any 
> significant
> performance gain)?  I am wondering how to size it as well.  I have started 
> out with a
> 64MB tmpfs, but not sure if I'll need more.  I am tempted to size it similar 
> to the
> following recommendation from a how-to for creating a ramdisk for amavis:
> 
> n * (1 + max(expansionfactor)) * $message_size_limit
> 
> Thoughts anyone?  What else could be eating up so much CPU??
> 
> > > Pointers to anything I missed or other help greatly appreciated!  I am
> > > running 3.0rc5 (as I understand it, only documentation changes were made
> > > between rc5 and the official release), Fedora core 2, 2.8GHtz
> > > hyperthreaded Pentium IV, 1 GB RAM, spamc out of Courier Maildrop to
> > > spamd.  Spamd pounds my system all day long (cpu but not memory).  I run
> > > max-children beween 5 and 8 (and all user prefs in (My)SQL, as well as
> > > AWL and Bayes, although I cannot run bayes/awl without mail backing up,
> > > as it appears to be too slow).
> 
> __
> Do You Yahoo!?
> Tired of spam?  Yahoo! Mail has the best spam protection around 
> http://mail.yahoo.com 
-BEGIN PGP SIGNATURE-
Version: GnuPG v1.2.4 (GNU/Linux)
Comment: Exmh CVS

iD8DBQFBby+XMJF5cimLx9ARAl70AKCOgfAMORWE1NPaBQ4WtwvMK5n9TQCdHwZi
CNg1DhJSi/kE/GASRHYnoKE=
=1BQZ
-END PGP SIGNATURE-



Re: relocating tmp files?

2004-10-15 Thread email builder
> > I have been under the apparently false presumption that spamd prcessed
> > its messages in memory (perhaps this explains why each spamd process can
> > oft take up to 25% cpu?).  I recently looked in /tmp and found lots of
> > left over spamassassin..xx.tmp files (the first four x's are
> > numeric, the second set of x's are alpha-numeric).
> > 
> > I am interested in placing those files in a tmpfs, but do not want to
> > put my whole /tmp directory under tmpfs but I can't for the life of
> > me find any configuration settings that tell spamd where to place its
> > temp files.  Do I need to change something in the code itself?
> 
> nope -- setting the std env var TMPDIR  will do it.  those files are
> created if you're using Pyzor or DCC.

Thank you kindly!  

Might I ask if anyone else has gone with any type of ramdisk for this directory 
and if so
if it helped in any way (just five minutes running it so far; I don't see any 
significant
performance gain)?  I am wondering how to size it as well.  I have started out 
with a
64MB tmpfs, but not sure if I'll need more.  I am tempted to size it similar to 
the
following recommendation from a how-to for creating a ramdisk for amavis:

n * (1 + max(expansionfactor)) * $message_size_limit

Thoughts anyone?  What else could be eating up so much CPU??

> > Pointers to anything I missed or other help greatly appreciated!  I am
> > running 3.0rc5 (as I understand it, only documentation changes were made
> > between rc5 and the official release), Fedora core 2, 2.8GHtz
> > hyperthreaded Pentium IV, 1 GB RAM, spamc out of Courier Maildrop to
> > spamd.  Spamd pounds my system all day long (cpu but not memory).  I run
> > max-children beween 5 and 8 (and all user prefs in (My)SQL, as well as
> > AWL and Bayes, although I cannot run bayes/awl without mail backing up,
> > as it appears to be too slow).


__
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


Re: relocating tmp files?

2004-10-15 Thread Justin Mason
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1


email builder writes:
> Greetings,
> 
> I have been under the apparently false presumption that spamd prcessed
> its messages in memory (perhaps this explains why each spamd process can
> oft take up to 25% cpu?).  I recently looked in /tmp and found lots of
> left over spamassassin..xx.tmp files (the first four x's are
> numeric, the second set of x's are alpha-numeric).
> 
> I am interested in placing those files in a tmpfs, but do not want to
> put my whole /tmp directory under tmpfs but I can't for the life of
> me find any configuration settings that tell spamd where to place its
> temp files.  Do I need to change something in the code itself?

nope -- setting the std env var TMPDIR  will do it.  those files are
created if you're using Pyzor or DCC.

> Pointers to anything I missed or other help greatly appreciated!  I am
> running 3.0rc5 (as I understand it, only documentation changes were made
> between rc5 and the official release), Fedora core 2, 2.8GHtz
> hyperthreaded Pentium IV, 1 GB RAM, spamc out of Courier Maildrop to
> spamd.  Spamd pounds my system all day long (cpu but not memory).  I run
> max-children beween 5 and 8 (and all user prefs in (My)SQL, as well as
> AWL and Bayes, although I cannot run bayes/awl without mail backing up,
> as it appears to be too slow).

- --j.
-BEGIN PGP SIGNATURE-
Version: GnuPG v1.2.4 (GNU/Linux)
Comment: Exmh CVS

iD8DBQFBbxt0MJF5cimLx9ARAtUVAKCO9t2+VLvXnz+yjCTLdedlmUDxdACdHl7i
0eQpDSVyh6F3HBRJlt/+qxI=
=NJ19
-END PGP SIGNATURE-



relocating tmp files?

2004-10-14 Thread email builder
Greetings,

  I have been under the apparently false presumption that spamd prcessed its 
messages in
memory (perhaps this explains why each spamd process can oft take up to 25% 
cpu?).  I
recently looked in /tmp and found lots of left over 
spamassassin..xx.tmp files
(the first four x's are numeric, the second set of x's are alpha-numeric).

  I am interested in placing those files in a tmpfs, but do not want to put my 
whole /tmp
directory under tmpfs but I can't for the life of me find any configuration 
settings
that tell spamd where to place its temp files.  Do I need to change something 
in the code
itself?

  Pointers to anything I missed or other help greatly appreciated!  I am 
running 3.0rc5
(as I understand it, only documentation changes were made between rc5 and the 
official
release), Fedora core 2, 2.8GHtz hyperthreaded Pentium IV, 1 GB RAM, spamc out 
of Courier
Maildrop to spamd.  Spamd pounds my system all day long (cpu but not memory).  
I run
max-children beween 5 and 8 (and all user prefs in (My)SQL, as well as AWL and 
Bayes,
although I cannot run bayes/awl without mail backing up, as it appears to be 
too slow).

Thanks!



__
Do you Yahoo!?
Yahoo! Mail Address AutoComplete - You start. We finish.
http://promotions.yahoo.com/new_mail