feeding spam messages for training

2004-11-19 Thread Richard Harding
I am looking at getting messages together to train spamassassin and told 
users to forward me messages that are spam that still get through. Is 
this an ok method of collecting or will the fact that so many are 
forwarded messages throw off the training?

I have thought about setting up a specific mailox for them to sent to, 
but this will still be all forwarded messages.

Thanks for the tips.
Rick


Re: sa-learn problem

2004-11-19 Thread Bob Mortimer
On Friday 19 Nov 2004 22:08, Nix wrote:

> > Yes, I know what it means, but see below (I'm not sure exactly where the
> > request for  /usr/bin/perl5.8.3 is coming form or how I fix it) ...
>
> It's on the first line of the sa-learn script (which names the
> interpreter, i.e., perl, to use to interpret the script).

Ah, OK.

> > I'm still not sure where to go from here - I initially suspected that
> > spamassassin was compiled against perl5.8.3 and was therefore looking for
> > that ...
>
> Correct. (Except that it isn't compiled --- only spamc is a compiled
> program --- but that's a nit..)

Yes, I spose I knew that, but meant that I expected a reinstall using perl 
Makefile.pl, make, make install would sort that out ...

> >  which is why I recompiled it hoping that would fix things but it
> > hasn't.
>
> It certainly should have done. What does the top line of sa-learn read?

#!/usr/bin/perl5.8.3 -T -w
which is clearly where the problem lies.
Correcting that to #!/usr/bin/perl5.8.5 -T -w solves the problem but still 
leaves me wondering how it happened in the first place.

Thanks for the help!

-- 
Regards,

Bob


Re: sa-learn problem

2004-11-19 Thread Nix
On Fri, 19 Nov 2004, Bob Mortimer spake:
> On Friday 19 Nov 2004 17:40, you wrote:
> 
>> > when I try to sa-learn my ham/spam I get the following:
>> > /usr/bin/sa-learn: /usr/bin/perl5.8.3: bad interpreter: No such file
>> > or directory
>>
>> I've always thought that "No such file or directory" was one of the
>> clearest messages from a computer, but people ask all the time what it
>> means.
> 
> Yes, I know what it means, but see below (I'm not sure exactly where the 
> request for  /usr/bin/perl5.8.3 is coming form or how I fix it) ...

It's on the first line of the sa-learn script (which names the
interpreter, i.e., perl, to use to interpret the script).

> I'm still not sure where to go from here - I initially suspected that 
> spamassassin was compiled against perl5.8.3 and was therefore looking for 
> that ...

Correct. (Except that it isn't compiled --- only spamc is a compiled
program --- but that's a nit..)

>  which is why I recompiled it hoping that would fix things but it 
> hasn't.

It certainly should have done. What does the top line of sa-learn read?

-- 
`The sword we forged has turned upon us
 Only now, at the end of all things do we see
 The lamp-bearer dies; only the lamp burns on.'


Are theses error/warning messages OK ?

2004-11-19 Thread Rakotomandimby (R12y) Mihamina
Hello,
I just ran SA 2.64 on my Debian Testing (all installed via apt-get) on a
x86.

I got these error messages i dont know to interpret.
Would you help me to correctly set my system up ?

The way I run SA: Debian init script  
OPTIONS="-m 10 -a -H"
(-d is automatically added by the startup script)

Is there something I have to install more ?
-- 
ASPO Infogérance   http://aspo.rktmb.org/activites/infogerance
Unofficial FAQ fcolc   http://faq.fcolc.eu.org/
LUG sur Orléans et alentours.
Tél : 02 38 76 43 65 (France)
: ^I(Can't locate object method "check_for_very_long_text" via package 
"Mail::SpamAssassin::PerMsgStatus" at 
/usr/share/perl5/Mail/SpamAssassin/PerMsgStatus.pm line 2293,  line 75. 
) 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine YAHOO_REDIR_uri_test 
redefined at /usr/share/spamassassin/20_uri_tests.cf, rule YAHOO_REDIR, line 5. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine NORMAL_HTTP_TO_IP_uri_test 
redefined at /etc/spamassassin/20_uri_tests.cf, rule NORMAL_HTTP_TO_IP, line 8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine 
MAILTO_WITH_SUBJ_REMOVE_uri_test redefined at 
/etc/spamassassin/20_uri_tests.cf, rule MAILTO_WITH_SUBJ_REMOVE, line 8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine PORN_4_uri_test redefined at 
/etc/spamassassin/20_uri_tests.cf, rule PORN_4, line 8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine 
WWW_NETSITESFORFREE_NET_uri_test redefined at 
/etc/spamassassin/20_uri_tests.cf, rule WWW_NETSITESFORFREE_NET, line 8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine REMOVE_PAGE_uri_test 
redefined at /etc/spamassassin/20_uri_tests.cf, rule REMOVE_PAGE, line 8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine WWW_TRAFFICWOW_NET_uri_test 
redefined at /etc/spamassassin/20_uri_tests.cf, rule WWW_TRAFFICWOW_NET, line 
8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine BTAMAIL_URL_uri_test 
redefined at /usr/share/spamassassin/20_uri_tests.cf, rule BTAMAIL_URL, line 8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine MAILTO_WITH_SUBJ_uri_test 
redefined at /etc/spamassassin/20_uri_tests.cf, rule MAILTO_WITH_SUBJ, line 8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine URI_FREEHT_uri_test 
redefined at /usr/share/spamassassin/20_uri_tests.cf, rule URI_FREEHT, line 8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine RATWARE_STORM_URI_uri_test 
redefined at /usr/share/spamassassin/20_ratware.cf, rule RATWARE_STORM_URI, 
line 8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine DAILY_PL_uri_test redefined 
at /usr/share/spamassassin/20_uri_tests.cf, rule DAILY_PL, line 8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine HTTP_NUMBER_WORD_uri_test 
redefined at /etc/spamassassin/20_uri_tests.cf, rule HTTP_NUMBER_WORD, line 8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine BARGAIN_URL_uri_test 
redefined at /usr/share/spamassassin/20_uri_tests.cf, rule BARGAIN_URL, line 8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine IP_LINK_PLUS_uri_test 
redefined at /usr/share/spamassassin/20_uri_tests.cf, rule IP_LINK_PLUS, line 
8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine 
LONG_NUMERIC_HTTP_ADDR_uri_test redefined at /etc/spamassassin/20_uri_tests.cf, 
rule LONG_NUMERIC_HTTP_ADDR, line 8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine NUMERIC_HTTP_ADDR_uri_test 
redefined at /etc/spamassassin/20_uri_tests.cf, rule NUMERIC_HTTP_ADDR, line 8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine CHINA_URL_uri_test redefined 
at /usr/share/spamassassin/20_uri_tests.cf, rule CHINA_URL, line 8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine YELLOWSUN_uri_test redefined 
at /etc/spamassassin/20_uri_tests.cf, rule YELLOWSUN, line 8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine WWW_CLIK4YOU_COM_uri_test 
redefined at /etc/spamassassin/20_uri_tests.cf, rule WWW_CLIK4YOU_COM, line 8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine URI_IS_POUND_uri_test 
redefined at /etc/spamassassin/20_uri_tests.cf, rule URI_IS_POUND, line 8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine JAVASCRIPT_URI_uri_test 
redefined at /etc/spamassassin/20_uri_tests.cf, rule JAVASCRIPT_URI, line 8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine WEB4PORNO_URL_uri_test 
redefined at /etc/spamassassin/20_uri_tests.cf, rule WEB4PORNO_URL, line 8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine 
HTTP_EXCESSIVE_ESCAPES_uri_test redefined at 
/usr/share/spamassassin/20_uri_tests.cf, rule HTTP_EXCESSIVE_ESCAPES, line 8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Subroutine MAILTO_TO_SPAM_ADDR_uri_test 
redefined at /etc/spamassassin/20_uri_tests.cf, rule MAILTO_TO_SPAM_ADDR, line 
8. 
Nov 19 21:50:03 hosting13 spamd[21333]: Failed to compile URI SpamAssassin 
tests, skipping: ^I(Illegal declaration of anonymous subroutine at 
/etc/spamassassin/20_uri_tests.cf, rule MAILTO_TO_SPAM_ADDR, line 8. ) 
Nov 19 21:50:03 hosting13 spamd[21333]: Failed to run BASE64_ENC_TEXT 
SpamAssassin test, skipping: ^I(Can't locate object method 
"che

Re: Spamassassin runs slow with perl from source

2004-11-19 Thread Michael Parker
On Fri, Nov 19, 2004 at 11:30:41AM -0800, Bryan Koschmann - GKT wrote:
> 
> Both were perl 5.8.4. I cannot seem to track down what is causing this.
> All the perl modules are the same as well. Is there some specific option
> that perl needs to be compiled with to make this run properly?
> 

Possibly due to difference in your network setup.  Try re-running your
tests with -L (local tests only) and see how it goes.

Michael


pgpZGlxbzfzVy.pgp
Description: PGP signature


Re: Spamassassin runs slow with perl from source

2004-11-19 Thread Justin Mason
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1


check the settings for MULTIPLICITY, use of a perl shared lib, and the
various threading options.  turning those on will slow things down quite a
lot.

- --j.

Bryan Koschmann - GKT writes:
> Hello,
> 
> I'm running a Slack 10 system with mostly self-compiled software. Using
> SA version 3.0.1. Athlon 2600 w/ 512meg + 120gb SATA harddrive.
> 
> Here is my problem. I downloaded and compiled a new version of perl.
> running spamassassin against the sample-spam.txt and sample-nospam.txt
> files yields scan times that are far more than they should:
> 
> $ time spamassassin -t < sample-spam.txt > /dev/null
> 
> real0m1.056s
> 
> Now, I had this exact problem on another machine, identical setup. I
> simply removed the new perl, installed the package from slackware, and it
> runs much faster:
> 
> $ time spamassassin -t < sample-spam.txt > /dev/null
> 
> real0m0.518s
> 
> spamc has the same problem:
> 
> Packaged perl:
> 
> time spamc < sample-spam.txt > /dev/null
> 
> real0m0.046s
> 
> perl from source:
> 
> $ time spamc < sample-spam.txt > /dev/null
> 
> real0m1.190s
> 
> Both were perl 5.8.4. I cannot seem to track down what is causing this.
> All the perl modules are the same as well. Is there some specific option
> that perl needs to be compiled with to make this run properly?
> 
> Thanks in advance,
> 
>   Bryan
-BEGIN PGP SIGNATURE-
Version: GnuPG v1.2.4 (GNU/Linux)
Comment: Exmh CVS

iD8DBQFBnkn6MJF5cimLx9ARAuXCAKCvn/ej5IOL4gn+nEN3obWmRhFVTQCbBVYJ
7dQoNbHKHBZ6lg/T8MSYX+A=
=Pck0
-END PGP SIGNATURE-



Spamassassin runs slow with perl from source

2004-11-19 Thread Bryan Koschmann - GKT
Hello,

I'm running a Slack 10 system with mostly self-compiled software. Using
SA version 3.0.1. Athlon 2600 w/ 512meg + 120gb SATA harddrive.

Here is my problem. I downloaded and compiled a new version of perl.
running spamassassin against the sample-spam.txt and sample-nospam.txt
files yields scan times that are far more than they should:

$ time spamassassin -t < sample-spam.txt > /dev/null

real0m1.056s

Now, I had this exact problem on another machine, identical setup. I
simply removed the new perl, installed the package from slackware, and it
runs much faster:

$ time spamassassin -t < sample-spam.txt > /dev/null

real0m0.518s

spamc has the same problem:

Packaged perl:

time spamc < sample-spam.txt > /dev/null

real0m0.046s


perl from source:

$ time spamc < sample-spam.txt > /dev/null

real0m1.190s


Both were perl 5.8.4. I cannot seem to track down what is causing this.
All the perl modules are the same as well. Is there some specific option
that perl needs to be compiled with to make this run properly?

Thanks in advance,

Bryan



exit0 - Rules_du_jour

2004-11-19 Thread Scott Wolfe
I am trying to get to www.exit0.us to download Rules_du_jour for a new SA
install but am unable to get to that web site. Is there a new place to get
that new script?

-Scott




Re: sa-learn problem

2004-11-19 Thread Bob Mortimer
On Friday 19 Nov 2004 17:40, you wrote:

> > when I try to sa-learn my ham/spam I get the following:
> > /usr/bin/sa-learn: /usr/bin/perl5.8.3: bad interpreter: No such file
> > or directory
>
> I've always thought that "No such file or directory" was one of the
> clearest messages from a computer, but people ask all the time what it
> means.

Yes, I know what it means, but see below (I'm not sure exactly where the 
request for  /usr/bin/perl5.8.3 is coming form or how I fix it) ...

> Bob says:
> > I think the problem has arisen since upgrading my Mandrake system from
> > 10.0 to 10.1, it looks like 10.1 uses Perl 5.8.5, I think 10.0 was on
> > 5.8.3
>
> I would bet that /usr/bin/perl5.8.3 does not exist after the upgrade,
> and possibly /usr/bin/perl5.8.5 might from the information you gave.
> Also, I would imagine that their is either a hard or soft link in
> /usr/bin that points /usr/bin/perl to whatever version you currently
> have installed, and that might be better to use vs one that is specific
> to the exact version you are running.

Like I said, I'm at the limits of my talent here ;-)
Here's an ls -l /usr/bin/perl*
lrwxrwxrwx  1 root root 5 Nov 12 06:24 /usr/bin/perl -> perl5*
lrwxrwxrwx  1 root root 9 Nov 12 06:24 /usr/bin/perl5 -> perl5.8.5*
-rwxr-xr-x  2 root root 11132 Oct 27 21:36 /usr/bin/perl5.8.5*

I'm still not sure where to go from here - I initially suspected that 
spamassassin was compiled against perl5.8.3 and was therefore looking for 
that ... which is why I recompiled it hoping that would fix things but it 
hasn't.

I fully accept that I'm probably being dim here but would be grateful if you'd 
humour me ;-)

(apols for message sent to personal address  clicked wrong button)

-- 
Regards,

Bob


Re: sa-learn problem

2004-11-19 Thread Bob Mortimer
On Friday 19 Nov 2004 17:40, Michael Barnes wrote:

> > when I try to sa-learn my ham/spam I get the following:
> > /usr/bin/sa-learn: /usr/bin/perl5.8.3: bad interpreter: No such file
> > or directory
>
> I've always thought that "No such file or directory" was one of the
> clearest messages from a computer, but people ask all the time what it
> means.

Yes, I know what it means, but see below (I'm not sure exactly where the 
request for  /usr/bin/perl5.8.3 is coming form or how I fix it) ...

> Bob says:
> > I think the problem has arisen since upgrading my Mandrake system from
> > 10.0 to 10.1, it looks like 10.1 uses Perl 5.8.5, I think 10.0 was on
> > 5.8.3
>
> I would bet that /usr/bin/perl5.8.3 does not exist after the upgrade,
> and possibly /usr/bin/perl5.8.5 might from the information you gave.
> Also, I would imagine that their is either a hard or soft link in
> /usr/bin that points /usr/bin/perl to whatever version you currently
> have installed, and that might be better to use vs one that is specific
> to the exact version you are running.

Like I said, I'm at the limits of my talent here ;-)
Here's an ls -l /usr/bin/perl*
lrwxrwxrwx  1 root root 5 Nov 12 06:24 /usr/bin/perl -> perl5*
lrwxrwxrwx  1 root root 9 Nov 12 06:24 /usr/bin/perl5 -> perl5.8.5*
-rwxr-xr-x  2 root root 11132 Oct 27 21:36 /usr/bin/perl5.8.5*

I'm still not sure where to go from here - I initially suspected that 
spamassassin was compiled against perl5.8.3 and was therefore looking for 
that ... which is why I recompiled it hoping that would fix things but it 
hasn't.

I fully accept that I'm probably being dim here, but would be grateful if 
you'd humour me ;-)

-- 
Regards,

Bob


spamd not reading local.cf

2004-11-19 Thread Greg Albrecht
spamd running as uid 'spamd', qmail-smtpd calling qmail-scanner, 
qmail-scanner calling spamd. (same problem if run as root).
sa 3.0.1.
freebsd 5.3-release, qmail 1.03, qmail-scanner 1.24.

synopsis:
if i pass the GTUBE message to SA via spamc, my localprefs are read and 
the message is treated as desired. however, if i pass the message to SA 
via spamd, the message is not treated as desired, namely, spamd ignores 
the rewrite_header and report_safe arguments. in fact, the message is 
marked as spam in the headers, but there is no indication in the body of 
the message that it is spam (regardless of my indicating report_safe 1 
or 2).

/etc/mail/spamassassin/local.cf:
rewrite_header Subject *SPAM*
report_safe 2 #i've tried 1, it doesn't work either.
spamd log attached.
thanks,
-xs
--
Greg Albrecht ([EMAIL PROTECTED])
www.undef.net * 1-415-794-5944
Nov 19 18:05:24 crane spamd[34824]: debug: daemonized. 
Nov 19 18:05:24 crane spamd[34824]: debug: Preloading modules with 
HOME=/tmp/spamd-34824-init 
Nov 19 18:05:24 crane spamd[34824]: debug: ignore: test message to precompile 
patterns and load modules 
Nov 19 18:05:24 crane spamd[34824]: debug: using 
"/etc/mail/spamassassin/init.pre" for site rules init.pre 
Nov 19 18:05:24 crane spamd[34824]: debug: config: read file 
/etc/mail/spamassassin/init.pre 
Nov 19 18:05:24 crane spamd[34824]: debug: using 
"/usr/local/share/spamassassin" for default rules dir 
Nov 19 18:05:24 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/10_misc.cf 
Nov 19 18:05:24 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/20_anti_ratware.cf 
Nov 19 18:05:24 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/20_body_tests.cf 
Nov 19 18:05:24 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/20_compensate.cf 
Nov 19 18:05:24 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/20_dnsbl_tests.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/20_drugs.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/20_fake_helo_tests.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/20_head_tests.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/20_html_tests.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/20_meta_tests.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/20_phrases.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/20_porn.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/20_ratware.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/20_uri_tests.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/23_bayes.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/25_body_tests_es.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/25_hashcash.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/25_spf.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/25_uribl.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/30_text_de.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/30_text_fr.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/30_text_nl.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/30_text_pl.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/50_scores.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/60_whitelist.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: config: read file 
/usr/local/share/spamassassin/local.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: using "/etc/mail/spamassassin" for 
site rules dir 
Nov 19 18:05:25 crane spamd[34824]: debug: config: read file 
/etc/mail/spamassassin/local.cf 
Nov 19 18:05:25 crane spamd[34824]: debug: plugin: loading 
Mail::SpamAssassin::Plugin::URIDNSBL from @INC 
Nov 19 18:05:25 crane spamd[34824]: debug: plugin: registered 
Mail::SpamAssassin::Plugin::URIDNSBL=HASH(0x83fb6c0) 
Nov 19 18:05:25 crane spamd[34824]: debug: plugin: loading 
Mail::SpamAssassin::Plugin::Hashcash from @INC 
Nov 19 18:05:25 crane spamd[34824]: debug: plugin: registered 
Mail::SpamAssassin::Plugin::Hashcash=HASH(0x8a3586c) 
Nov 19 18:05:25 crane spamd[34824]: debug: plugin: loading 
Mail::SpamAssassin::Plugin::SPF from @INC 
Nov 19 18:05:25 crane 

RE: Subject Rewrite not working

2004-11-19 Thread Elias Arends
For clarification then, I'm not running SA 3.0.0 (as you can see on the
third line of my original e-mail), so does this apply to older versions of
SA? From what I gather it doesn't.

-Original Message-
From: Jim Maul [mailto:[EMAIL PROTECTED]
Sent: Friday, November 19, 2004 1:46 PM
To: SpamAssassin (E-mail)
Subject: Re: Subject Rewrite not working


Elias Arends wrote:
> Platform: Solaris 8
> MTA: Postfix 2.1.5
> Software: SpamAssassin 2.64
> SQL: MySQL 4.0.21
>
> I have fixed the issue with spamd querying MySQL with the "-u" option.
> Thanks. Now I have another problem. It seems that incoming spam is tagged
> properly in the headers, but the subject is not being edited. Here are my
> local.cf, and my user preferences. Any idea on where the problem is?
Thanks.
>
> Local.cf:
> # SpamAssassin config file for version 2.5x
> # generated by http://www.yrex.com/spam/spamconfig.php (version 1.01)
> allow_user_rules1
> # How many hits before a message is considered spam.
> required_hits  5
> # Whether to change the subject of suspected spam
> rewrite_subject 1
> # Text to prepend to subject if rewrite_subject is used
> subject_tag *SPAM*
> # Encapsulate spam in an attachment
> report_safe 1

You are using the old syntax.  rewrite_subject  and subject_tag are no
longer used.  Check the UPGRADE file that came with SA

http://spamassassin.apache.org/full/3.0.x/dist/UPGRADE

-Jim

On a side note:
I am in no way directing this toward the original poster and no offense
is meant in any way but
I can not believe the number of times this question is asked on the list
:(   I guess it shows how many people dont read the documentation and
dont search the archives.




Re: Subject Rewrite not working

2004-11-19 Thread Kjetil Kjernsmo
On fredag 19 november 2004, 18:45, Jim Maul wrote:
> I can not believe the number of times this question is asked on the
> list
>
> :(   I guess it shows how many people dont read the documentation and
>
> dont search the archives.

Yeah... I just tried to add this as an entry to the main FAQ Wiki page, 
but it isn't editable to me... I would suggest adding it there, as it 
is a FAQ in the truest sense of the word... :-)

Cheers,

Kjetil
-- 
Kjetil Kjernsmo
Astrophysicist/IT Consultant/Skeptic/Ski-orienteer/Orienteer/Mountaineer
[EMAIL PROTECTED]  [EMAIL PROTECTED]  [EMAIL PROTECTED]
Homepage: http://www.kjetil.kjernsmo.net/OpenPGP KeyID: 6A6A0BBC


Re: Subject Rewrite not working

2004-11-19 Thread Jim Maul
Elias Arends wrote:
Platform: Solaris 8
MTA: Postfix 2.1.5
Software: SpamAssassin 2.64
SQL: MySQL 4.0.21
I have fixed the issue with spamd querying MySQL with the "-u" option.
Thanks. Now I have another problem. It seems that incoming spam is tagged
properly in the headers, but the subject is not being edited. Here are my
local.cf, and my user preferences. Any idea on where the problem is? Thanks.
Local.cf:
# SpamAssassin config file for version 2.5x
# generated by http://www.yrex.com/spam/spamconfig.php (version 1.01)
allow_user_rules1
# How many hits before a message is considered spam.
required_hits  5
# Whether to change the subject of suspected spam
rewrite_subject 1
# Text to prepend to subject if rewrite_subject is used
subject_tag *SPAM*
# Encapsulate spam in an attachment
report_safe 1
You are using the old syntax.  rewrite_subject  and subject_tag are no 
longer used.  Check the UPGRADE file that came with SA

http://spamassassin.apache.org/full/3.0.x/dist/UPGRADE
-Jim
On a side note:
I am in no way directing this toward the original poster and no offense 
is meant in any way but
I can not believe the number of times this question is asked on the list 
:(   I guess it shows how many people dont read the documentation and 
dont search the archives.


Re: sa-learn problem

2004-11-19 Thread Michael Barnes
On Fri, Nov 19, 2004 at 07:24:33AM +, Bob Mortimer wrote:
> I've run into a problem with sa-learn. I'm running SA 3.0.1, and since
> the RPMs always seem to lag behind the binaries it's compiled. I
> call spamassassin using amavsd. I think the problem has arisen since
> upgrading my Mandrake system from 10.0 to 10.1, it looks like 10.1
> uses Perl 5.8.5, I think 10.0 was on 5.8.3
>
> when I try to sa-learn my ham/spam I get the following:
> /usr/bin/sa-learn: /usr/bin/perl5.8.3: bad interpreter: No such file
> or directory

I've always thought that "No such file or directory" was one of the
clearest messages from a computer, but people ask all the time what it
means.

Bob says:
> I think the problem has arisen since upgrading my Mandrake system from
> 10.0 to 10.1, it looks like 10.1 uses Perl 5.8.5, I think 10.0 was on
> 5.8.3

I would bet that /usr/bin/perl5.8.3 does not exist after the upgrade,
and possibly /usr/bin/perl5.8.5 might from the information you gave.
Also, I would imagine that their is either a hard or soft link in
/usr/bin that points /usr/bin/perl to whatever version you currently
have installed, and that might be better to use vs one that is specific
to the exact version you are running.

Mike

-- 
/-\
| Michael Barnes <[EMAIL PROTECTED]> |
| UNIX Systems Administrator  |
| College of William and Mary |
| Phone: (757) 879-3930   |
\-/


Subject Rewrite not working

2004-11-19 Thread Elias Arends
Platform: Solaris 8
MTA: Postfix 2.1.5
Software: SpamAssassin 2.64
SQL: MySQL 4.0.21

I have fixed the issue with spamd querying MySQL with the "-u" option.
Thanks. Now I have another problem. It seems that incoming spam is tagged
properly in the headers, but the subject is not being edited. Here are my
local.cf, and my user preferences. Any idea on where the problem is? Thanks.

Local.cf:
# SpamAssassin config file for version 2.5x
# generated by http://www.yrex.com/spam/spamconfig.php (version 1.01)
allow_user_rules1
# How many hits before a message is considered spam.
required_hits  5
# Whether to change the subject of suspected spam
rewrite_subject 1
# Text to prepend to subject if rewrite_subject is used
subject_tag *SPAM*
# Encapsulate spam in an attachment
report_safe 1
# Use terse version of the spam report
use_terse_report0
# Enable the Bayes system
use_bayes   1
# Enable Bayes auto-learning
auto_learn  1
# Enable or disable network checks
skip_rbl_checks 0
use_razor2  1
use_dcc 1
use_pyzor   1
# Mail using languages used in these country codes will not be marked
# as being possibly spam in a foreign language.
ok_languagesall
# Mail using locales used in these country codes will not be marked
# as being possibly spam in a foreign language.
ok_locales  all
# Configuration for SQL Lookup
user_scores_dsn DBI:mysql:spamassassin:localhost
user_scores_sql_usernamespamd
user_scores_sql_passwordspaminator

MySQL:
| elias+sniper.aw| required_hits  | 6
| elias+sniper.aw| report_safe| 1
| elias+sniper.aw| use_terse_report   | 0
| elias+sniper.aw| always_add_headers | 1
| elias+sniper.aw| fold_headers   | 1
| elias+sniper.aw| spam_level_stars   | 1
| elias+sniper.aw| spam_level_char| *
| elias+sniper.aw| rewrite_subject| 1
| elias+sniper.aw| subject_tag| *SPAM*
| elias+sniper.aw| ok_languages   | af sq am ar hy eu bs bg be ca zh
hr cs da nl en eo et fi f
r fy ka de el he hi hu is id ga it ja ko
| elias+sniper.aw| use_razor1 | 1
| elias+sniper.aw| use_razor2 | 1
| elias+sniper.aw| use_pyzor  | 1
| elias+sniper.aw| use_dcc| 1
| elias+sniper.aw| skip_rbl_checks| 0


Elias Arends
SetarNet Operations
=
Setar N.V.
Seroe Blanco Z/N
Oranjestad, Aruba
Tel: (+297) 5251334
Mobile: (+297) 5970128
mailto:[EMAIL PROTECTED]
=





Re: spamd process using to much cpu

2004-11-19 Thread Matías López Bergero
Hello Matt,
Thank you for ur answer,

Matt Kettler dijo:
> At 04:13 PM 11/18/2004, Matias Lopez Bergero wrote:
>>I'm seeing a heavy cpu usage in some process of spamd for a long time and
>>sometimes they just hang there until I kill them(usage goes from 80% to
>> 97%).
>>Also my system is reporting a high iowait load and a high disk usage that
>>stops if a shutdown spammassassin processes.
>>
>>This is normal?
>>Anyone with the same problem??
>
> Define "for a long time"... Minutes? Hours?

Less than a minute, but wen it hangs, it hangs there until i kill it.
I haved noticed that this spamd process hanging is ocurring with the same
user almost all the times. That means anything to you?

>  From the sounds of it, it looks like SA is doing an opportunistic expiry
> on your bayes DB.. But that should only take a few minutes unless things
> are really haywire or your box is really slow.
>
> Try running a sa-learn -D --force-expire on the command line and see if
> that runs smoothly.
>
> Also, look around for bayes_toks.expire files laying around next to
> your bayes DB.. that's a very clear sign SA is being killed while running
> expiry.
>

I'm going to try that.
Thanks again!

BR,
Matías.




RE: script error?

2004-11-19 Thread Martin
|-Original Message-
|From: Stefan Suurmeijer [mailto:[EMAIL PROTECTED] 
|Sent: 18 November 2004 23:52
|To: users@spamassassin.apache.org
|Subject: script error?
|
|Hi list,
|
|this is probably some easy configuration issue, but it's now 
|almost 1am and I just spent the last 4 hours upgrading my 
|system so by now I wouldn't see a solution if it hit me in the face.
|
|I have everything working, except for spamassassin. I had it 
|working earlier, but I decided to get the newest version of SA 
|from CPAN and now when I try to start spamd I get the following error:
|
|ERROR!  spamassassin script is v3.00, but using modules v3.01!
|
|which is great in not telling me anything useful about what is wrong. 
|Can anybody point me in the right direction? Any help would be 
|GREATLY appreciated
|

I had exactly the same when I upgraded, turned out I had two versions of
spamd on, one in /usr/bin/spamd and the other /usr/sbin/spamd, one was the
old one and the other was the new one, cant remember which way round now,
shouldn't be hard to figure though. The startup script was finding the old
one, just delete it and/or copy the new one over.

Martin



Re: spamd logging with wrong timestamp?

2004-11-19 Thread Michael Barnes
On Fri, Nov 19, 2004 at 12:48:07PM +1100, Dimitry Peisakhov wrote:
>   Thanks for the advice Michael. I have a question though.
> Services from init.d start up under root, as far as i know. Root
> currently has no TZ set, but all other services log correctly. If i
> set TZ in the spamd init.d script, that would set TZ for root, wouldnt
> it? (TZ is user-specific, i htink). Do you think this will still
> work? Would it impact any other time-related things?  I still find it
> strange that spamd is the only service that is logging and ignoring
> the system time.

Dimitry,

Environment stuff can be very tricky sometimes.  The settting of an
environment variable sets that variable for that process, and it more
than likely will be inherited by any child processes that are spawned by
that process and any of thier subsequent processes.

In my situation, what I had done is that I set my TZ variable in my
shell, and then I would do something like 'sudo /etc/init.d/httpd
restart' and the sudo command the /etc/init.d/httpd script and the httpd
daemons all inherited my variable and the httpd daemon was logging in my
timezone.

This is why many people do things as root or another daemon user like
oracle by doing 'su - USER' or 'su - USER -c COMMAND'.  Where the '-'
denotes that it will read all of the user's environment variables as if
they had logged in.  I personally don't like sudo because I rarely
remember (or want to remember) root passwords, so I use sudo with my
password.  I also prefer my environment the way I set it up with things
like VISUAL, EDITOR, and PAGER and/or MANPAGER to things that I like,
and are often unapropriate for system accounts.

Another confusing thing with environment variables is that it is
impossible to inject an environment variable into an already running
process.  But it makes sense once you think about it.

Mike

-- 
/-\
| Michael Barnes <[EMAIL PROTECTED]> |
| UNIX Systems Administrator  |
| College of William and Mary |
| Phone: (757) 879-3930   |
\-/


sa-learn / autolearn=no/failed

2004-11-19 Thread Ronan
hi ive fed sa-learn about 450 each of spam ham
bash-2.03$ sa-learn --dump magic
0.000  0  3  0  non-token data: bayes db version
0.000  0426  0  non-token data: nspam
0.000  0427  0  non-token data: nham
0.000  0  87896  0  non-token data: ntokens
0.000  0 1082126382  0  non-token data: oldest atime
0.000  0 1100864490  0  non-token data: newest atime
0.000  0 1100189075  0  non-token data: last journal 
sync atime
0.000  0 1100189181  0  non-token data: last expiry atime
0.000  0  0  0  non-token data: last expire 
atime delta
0.000  0  0  0  non-token data: last expire 
reduction count
bash-2.03$ tail -f /var/log/syslog |grep autolearn
Nov 19 14:41:07 elisha spamd[18370]: result: . -2 - 
ALL_TRUSTED,FROM_ENDS_IN_NUMS 
scantime=0.8,size=1602,mid=<[EMAIL PROTECTED]>,autolearn=failed
Nov 19 14:41:18 elisha spamd[17612]: result: . -2 - 
ALL_TRUSTED,FROM_ENDS_IN_NUMS 
scantime=2.9,size=1598,mid=<[EMAIL PROTECTED]>,autolearn=failed

now from the documentation, bayes is on by default, but should i 
explicity dictate this in the local.cf config file??

thanks
r
--
Regards
Ronan McGlue
==
Analyst/Programmer
Information Services
Queens University Belfast
BT7 1NN


Re: rewrite_subject not working

2004-11-19 Thread Lior Marantenboim
Thanks everyone, it was my fault for not reading !

On Fri, 2004-11-19 at 11:26, Matt Kettler wrote:
> At 10:28 AM 11/19/2004 -0300, Lior Marantenboim wrote:
> >Hello everyone
> >
> >I'm using SpamAssassin 3.0 and for some reason, rewrite_subject is not
> >working. It's set on 1 in my local.cf (I'm starting it with spamd).
> >
> >Any clues?
> 
> rewrite_subject and subject_tag are no longer a config options in SA 3.0.
> 
> They have been replaced by the more generic rewrite_header command, adding 
> flexiblity.
> 
> See the UPGRADE file that came with SA 3.0.
> 
> http://spamassassin.apache.org/full/3.0.x/dist/UPGRADE



Re: rewrite_subject not working

2004-11-19 Thread Matt Kettler
At 10:28 AM 11/19/2004 -0300, Lior Marantenboim wrote:
Hello everyone
I'm using SpamAssassin 3.0 and for some reason, rewrite_subject is not
working. It's set on 1 in my local.cf (I'm starting it with spamd).
Any clues?
rewrite_subject and subject_tag are no longer a config options in SA 3.0.
They have been replaced by the more generic rewrite_header command, adding 
flexiblity.

See the UPGRADE file that came with SA 3.0.
http://spamassassin.apache.org/full/3.0.x/dist/UPGRADE 



Re: sa-learn problem

2004-11-19 Thread Bob Mortimer
On Friday 19 Nov 2004 08:29, alan premselaar wrote:

>what is the actual sa-learn command you're using? I ran into this
> using Maildir format by specifying sa-learn -D --spam
> /path/to/maildir/cur/*

I use one script to move the mails out of my mail directory tree into my 
ham/spam folders:
/var/lib/amavis/.spamassassin/newham
/var/lib/amavis/.spamassassin/newspam

I then use another script (called as user amavis) which learns the mails them 
moves them into the folderwhere I keep al my learned Ham/Spam. This was 
working fine until the update frmo Mandrake 10.0 to 10.1
The commands the script runs are:
sa-learn --ham --showdots /var/lib/amavis/.spamassassin/newham/
sa-learn --spam --showdots /var/lib/amavis/.spamassassin/newspam/

I then get:
./learn: /usr/bin/sa-learn: /usr/bin/perl5.8.3: bad interpreter: No such file 
or directory
./learn: /usr/bin/sa-learn: /usr/bin/perl5.8.3: bad interpreter: No such file 
or directory


> and received "/usr/bin/sa-learn: /usr/bin/perl: bad interpreter: too
> many arguments" as an error. I found that doing "find
> /path/to/maildir/cur/ -name "*" -exec sa-learn --spam {} ¥;" mostly
> worked but had some unexpected results that make me think I didn't need
> to specify the "*" after my path. (although perl choked if i didn't)

I've tried manually calling sa-learn and get the same error, if I cd into me 
HamMails directory and type sa-learn --ham *.* I get:
bash: /usr/bin/sa-learn: /usr/bin/perl5.8.3: bad interpreter: No such file or 
directory

Still puzzled ...

Bob


Re: rewrite_subject not working

2004-11-19 Thread Rick Macdougall

Lior Marantenboim wrote:
Hello everyone
I'm using SpamAssassin 3.0 and for some reason, rewrite_subject is not
working. It's set on 1 in my local.cf (I'm starting it with spamd).
Any clues?
Hi,
Read the upgrade doc.  rewrite_subject doesn't exist anymore.
From the UPGRADE file
- The "rewrite_subject" and "subject_tag" configuration options were
  deprecated and are now removed. Instead, using "rewrite_header Subject
  [your desired setting]".  e.g.
rewrite_subject 1
subject_tag SPAM(_SCORE_)
  becomes
rewrite_header Subject SPAM(_SCORE_)
You should really read all the INSTALL, README, UPGRADE files before 
attempting an upgrade.

Regards,
Rick


rewrite_subject not working

2004-11-19 Thread Lior Marantenboim
Hello everyone

I'm using SpamAssassin 3.0 and for some reason, rewrite_subject is not
working. It's set on 1 in my local.cf (I'm starting it with spamd).

Any clues?

Thanks

Lior Marantenboim
SysAdmin
bumeran.com



Re: script error?

2004-11-19 Thread Stefan Suurmeijer
tried it but didn't work. There's probably some statement in a startup 
script that is no longer available in v3.01, but I have no idea 
what. Anyone?

Noel K Hall II wrote:
First, shutdown the running spamd process (if it is), and any process that
calls the use of spamassassin.
Then from the CPAN shell  use 'force install Mail::SpamAssassin'
Then restart everything and see what happens.
-Noel
-Original Message-
From: Stefan Suurmeijer [mailto:[EMAIL PROTECTED] 
Sent: Thursday, November 18, 2004 6:52 PM
To: users@spamassassin.apache.org
Subject: script error?

Hi list,
this is probably some easy configuration issue, but it's now almost 1am and
I just spent the last 4 hours upgrading my system so by now I wouldn't see a
solution if it hit me in the face.
I have everything working, except for spamassassin. I had it working
earlier, but I decided to get the newest version of SA from CPAN and now
when I try to start spamd I get the following error:
ERROR!  spamassassin script is v3.00, but using modules v3.01!
which is great in not telling me anything useful about what is wrong. 
Can anybody point me in the right direction? Any help would be GREATLY
appreciated

TIA
Stefan
 



Re: sa-learn problem

2004-11-19 Thread alan premselaar
Bob Mortimer wrote:
I've run into a problem with sa-learn. I'm running SA 3.0.1, and since the 
RPMs always seem to lag behind the binaries it's compiled. I call 
spamassassin using amavsd. I think the problem has arisen since upgrading my 
Mandrake system from 10.0 to 10.1, it looks like 10.1 uses Perl 5.8.5, I 
think 10.0 was on 5.8.3

when I try to sa-learn my ham/spam I get the following:
/usr/bin/sa-learn: /usr/bin/perl5.8.3: bad interpreter: No such file or 
directory

I've tried recompiling spamassassin but it seems to have made no difference. 
ISTR I ran into a similar problem when the SURBL stuff came out and it 
involved moving some files around somewhere in 
the /usr/lib/perl5/vendor_perl/5.8.* tree but I seem to have run out of 
talent in trying to figure it out ;-)

Any help appreciated.
 
Regards,

Bob
Bob,
  what is the actual sa-learn command you're using? I ran into this 
using Maildir format by specifying sa-learn -D --spam /path/to/maildir/cur/*

and received "/usr/bin/sa-learn: /usr/bin/perl: bad interpreter: too 
many arguments" as an error. I found that doing "find 
/path/to/maildir/cur/ -name "*" -exec sa-learn --spam {} ¥;" mostly 
worked but had some unexpected results that make me think I didn't need 
to specify the "*" after my path. (although perl choked if i didn't)

this is on a system that I am running Perl 5.8.0 on (although it seems 
to be working now).  I had maybe 1300+ mail files in the directory.

anyways, it seems to me like either the file you're telling it to read 
is either not there, or it's been moved while it was open.

hope this helps
alan


sa-learn problem

2004-11-19 Thread Bob Mortimer
I've run into a problem with sa-learn. I'm running SA 3.0.1, and since the 
RPMs always seem to lag behind the binaries it's compiled. I call 
spamassassin using amavsd. I think the problem has arisen since upgrading my 
Mandrake system from 10.0 to 10.1, it looks like 10.1 uses Perl 5.8.5, I 
think 10.0 was on 5.8.3

when I try to sa-learn my ham/spam I get the following:
/usr/bin/sa-learn: /usr/bin/perl5.8.3: bad interpreter: No such file or 
directory

I've tried recompiling spamassassin but it seems to have made no difference. 
ISTR I ran into a similar problem when the SURBL stuff came out and it 
involved moving some files around somewhere in 
the /usr/lib/perl5/vendor_perl/5.8.* tree but I seem to have run out of 
talent in trying to figure it out ;-)

Any help appreciated.
 
Regards,

Bob


Re: more spamassassin + bayes + postgres stuff

2004-11-19 Thread Michael Parker
[ Lots of Good Stuff ]

Yes please, can you dump all of this into the bug so we have a good
record.

Thanks,
Michael


pgpy9B04ISWOI.pgp
Description: PGP signature


Re: more spamassassin + bayes + postgres stuff

2004-11-19 Thread Rupa Schomaker
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

Oh, forgot something.  The patch doesn't create an index on
bayes_seen(msgid) -- probably should.

- --
 -Rupa

-BEGIN PGP SIGNATURE-
Version: GnuPG v1.2.5 (MingW32)
Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org

iD8DBQFBnYYfL3Aub+krmycRAu6YAKCY2gDfJyqm6Fq3F4I0+u0ruFhI4gCePYyd
Dj6IuC9ax2E2gWYx3DwTln0=
=WvDb
-END PGP SIGNATURE-



Re: more spamassassin + bayes + postgres stuff

2004-11-19 Thread Rupa Schomaker
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1


On 11/18/2004 3:38 PM, Michael Parker wrote:
> On Thu, Nov 18, 2004 at 06:53:19AM -0800, Rupa Schomaker wrote:
> 
>>Some questions:
>>
>>Is bytea really necessary?  If I follow the path of the patch, the bytea
>>change was done prior to adding the index.  Since the tokens are binary
>>data it is probably more correct through, especially if one has a
>>encoding other than SQL_ASCII set for the DB...
> 
> 
> Yes, as far as I can tell from the documentation.  The fact that we're
> storing the binary value makes it necessary.  If I'm misinformed, then
> feel free to point out where in the documentation.

My understanding is that isn't necessary but it is more fragile (subject
to the database encoding and the client encoding).  This was discussed
recently on one of the postgres groups... Looking:


Message-ID: <[EMAIL PROTECTED]>

===
From: Tom Lane ([EMAIL PROTECTED])
Subject: Re: [ADMIN] evil characters #bfef cause dump failure
Date: 2004-11-16 12:19:06 PST

[snip]
BTW, SQL_ASCII is not so much an encoding as the absence of any encoding
choice; it just passes 8-bit data with no interpretation.  So it's not
*that* unreasonable a default.  You can store UTF8 data in it without
any problem, you just won't have the niceties like detection of bad
character sequences.

   regards, tom lane
===

Leave it as bytea...

>>What do you use to benchmark changes?  I'm willing to experiment but
>>would like to have some reproducable results for ya...
> 
> 
> It's not really ready for real world consumption and time has been
> short for getting it ready.  You can read a little about it here:
> http://wiki.apache.org/spamassassin/BayesBenchmark
> 
> Hopefully, I'll get some free time soon and get it into the SA tree.

I'll take a look at it when I get a chance.

Some more testing/observations with sa-learn only.  BTW: do you want me
to move this discussion to the ticket in bugzilla?  Or we can wait 'till
I/we have a summary...

General notes:

1) Why not a unique index that mimics the primary key (though do it in
token,id order not id,token)?  Won't matter in my case (since I run as
one user) and probably doen't matter at all unless running with lots 'n
lots of users...

2) bayes_seen.msgid should be type 'text' -- sa-learn (and others) don't
truncate to 200.

3) I also get differences in the backup file.

- -rw-r--r--  1 rupa users 13047214 Nov 18 13:23 backup_dbm.txt
- -rw-r--r--  1 rupa users 13047202 Nov 18 17:16 backup_new.txt

An actual diff is probably meaningless since I doubt order is guaranteed
between a dbm and sql.  I did the diff and quickly gave up.  I suppose
the data could be ordered from both sources and then compared?

Some 'benchmarks' of sa-learn.  Single run:

bayes_seen: 202863 rows
bayes_token: 150842 rows

System is:
model name  : AMD Athlon(tm) XP 2600+
MemTotal:  1031916 kB
debian unstable

With a fairly large workload from a memory standpoint but CPU generally
fairly idle.

Postgres hasn't been tuned "much" -- have to reset the stats in postgres
and do some analysis...

1) Shipped config with msgid='text' on my backup file:

real24m35.663s

2) Shipped config with indices added:

real32m33.931s

Ekk!  Analyze; delete; rerun:

Still 30min.

hrmmm..

But I know it runs better in normal operation.  Oh well *shrug* must be
the index update even though the check constraint doesn't need a table scan.

3) Patch (2004-10-31 18:53) applied, re-create tables:

real14m29.793s

Analyze, delete, rerun:

15m.

A bit better.

BTW: Using dbm the full restore takes 23s...

Time to add some small amount of stats to sa-learn (or underlying) to
see where we're spending time...  Added some more timing points and
dbg() output to SQL.pm.  Needs Time::HiRes which is bundled in perl
5.8.x but is an optional add-on for earlier stuff.

Ok, with my large set:

Token inserts start at around 1-2s per 1000 and rises to 7-8s per 1000.

Seen inserts start at around 1s per 1000 and stay there.

I can think of ways to optimize sa-learn (do it all in one TX rather
than 1TX per insert), assume an insert rather than using the generic
query then insert path for _put_token() but the restore is only done
once anyway and the changes would require some invasive changes rather
than just re-using existing logic  Not worth it.

It is however a reasonable test of the insert/update logic of learning a
single message (whether auto-learn or manual).  Doesn't test the query
side though...

> 
> Michael

- --
 -Rupa

-BEGIN PGP SIGNATURE-
Version: GnuPG v1.2.5 (MingW32)
Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org

iD8DBQFBnYS/L3Aub+krmycRAuioAJ9bh224fxsAvUTX9liLQ1pf/wYIVACgxBDQ
SllANDuelO8OWEwqOWZ9FsM=
=1cIx
-END PGP SIGNATURE-



Re: Kill spamd spawns new processes

2004-11-19 Thread Matt Kettler
At 06:57 PM 11/18/2004, Rick Macdougall wrote:
Pssh, kill -15 is for wimps :)
Yeah, true.. real sysadmins just fix their servers by throwing the main 
circuit breaker in the breaker panel :)

(and of course, REAL sysamdins don't use UPS systems either. After all, 
what user is going to be at his computer accessing the servers by the dim 
glow of the emergency lighting???)

:)


RE: script error?

2004-11-19 Thread Noel K Hall II
First, shutdown the running spamd process (if it is), and any process that
calls the use of spamassassin.

Then from the CPAN shell  use 'force install Mail::SpamAssassin'

Then restart everything and see what happens.

-Noel

-Original Message-
From: Stefan Suurmeijer [mailto:[EMAIL PROTECTED] 
Sent: Thursday, November 18, 2004 6:52 PM
To: users@spamassassin.apache.org
Subject: script error?

Hi list,

this is probably some easy configuration issue, but it's now almost 1am and
I just spent the last 4 hours upgrading my system so by now I wouldn't see a
solution if it hit me in the face.

I have everything working, except for spamassassin. I had it working
earlier, but I decided to get the newest version of SA from CPAN and now
when I try to start spamd I get the following error:

ERROR!  spamassassin script is v3.00, but using modules v3.01!

which is great in not telling me anything useful about what is wrong. 
Can anybody point me in the right direction? Any help would be GREATLY
appreciated

TIA
Stefan




RE: spamd logging with wrong timestamp?

2004-11-19 Thread Dimitry Peisakhov
Thanks for the advice Michael. I have a question though.
Services from init.d start up under root, as far as i know. Root currently
has no TZ set, but all other services log correctly. If i set TZ in the
spamd init.d script, that would set TZ for root, wouldnt it? (TZ is
user-specific, i htink). Do you think this will still work? Would it impact
any other time-related things? 
I still find it strange that spamd is the only service that is logging and
ignoring the system time.

Dimitry

-Original Message-
From: Michael Barnes [mailto:[EMAIL PROTECTED]
Sent: Friday, 19 November 2004 2:14 AM
To: users@spamassassin.apache.org
Subject: Re: spamd logging with wrong timestamp?



I've had servers log with incorrect time before and it was due to the
fact that I was in a different timezone than my server and I set my TZ
environment variable to reflect my timezone and not the timezone that
the server was located.  So, when I restarted the server, it inheireted
my TZ variable and started logging with that time and not the system
time.

If you might think that this is an issue, you could put the correct TZ
environment info in your spamd startup/shutdown script and that way it
will always be correct (or at least consistant).

Mike

On Thu, Nov 18, 2004 at 02:46:22PM +1100, Dimitry Peisakhov wrote:
> Mike, adding that switch to syslog didnt seem to work. I found some
> other posts through google which claim that stopping syslogd and spamd
> and starting them again fixes this up, but it hasnt worked for me.
>
> If anyone else has any ideas, they would be greately appreciated.
> 
> Thanks,
> Regards, Dimitry
> 
> -Original Message-
> From: Mike Kercher [mailto:[EMAIL PROTECTED]
> Sent: Thursday, 18 November 2004 12:04 AM
> To: users@spamassassin.apache.org
> Subject: RE: spamd logging with wrong timestamp?
> 
> 
> Dimitry Peisakhov wrote:
> > Hi guys,
> > 
> >I've recently discovered that my spamd is writing to the logs with
> > the incorrect timestamp. It looks like its using GMT to timestamp
> > instead of the actual time on the box (11hr difference). I fixed this
> > previously by restarting the service, but its not doing the trick
> > now.. Anyone have ideas about this? There doesnt seem to be any
> > switches for spamd to control timestamps or timezone config. 
> > 
> > thanks,
> > Regards,
> > 
> > Dimitry Peisakhov
> > Systems Administrator
> > 
> > HENRY WALKER ELTIN
> > 02 8875 4721
> > [EMAIL PROTECTED]
> 
> I had a similar problem a couple of weeks ago with a machine that had an
> older OS on it.  sendmail was logging the correct timestamp but
MailScanner
> was logging about 2 hours behind.  The way I resolved it was to add the
'-r'
> switch to my syslog initscript.
> 
> Mike

-- 
/-\
| Michael Barnes <[EMAIL PROTECTED]> |
| UNIX Systems Administrator  |
| College of William and Mary |
| Phone: (757) 879-3930   |
\-/


RE: How can I bring CPU down where Spamd takes %60 of all CPU?

2004-11-19 Thread LOGS \(Tunc Eresen\)
 

-Original Message-
From: Jeremy Rumpf [mailto:[EMAIL PROTECTED] 
Sent: 18 November 2004 17:20
To: LOGS (Tunc Eresen); users@spamassassin.apache.org
Subject: Re: How can I bring CPU down where Spamd takes %60 of all CPU?

On Thursday 18 November 2004 04:20 am, LOGS (Tunc Eresen) wrote:
>  Is there a way of reducing or caping CPU usage spamd by issuing 
> commands or making changes in config?
>
> Regards
> Tunc

If you must, nice it down to a lower priority. In your startup script add a
nice adjustment to the spamd launch:

nice -n15 spamd .

or "man nice" form more details.

HTH,
Jeremy 
--


Thanks but I don't know how to insert this line into which script, Since I
am novice for all help would be most appreciated,

Tunc