from:"Rosenbaum, Larry M."

RE: DKIM Checks

2011-05-18 Thread Rosenbaum, Larry M.

 From: Matt [mailto:lm7...@gmail.com]
 Sent: Wednesday, May 18, 2011 11:32 AM
 To: users
 Subject: DKIM Checks

 I am running spamassassin-3.2.5-1.el5 on 64 bit CentOS.

 sa-update -D seems to indicate that the DKIM libraries are installed.
 ... 
 May 18 10:25:02.683 [15134] dbg: diag: [...] module installed:
 Mail::DKIM, version 0.39
 ...
 Looking at the X-Spam-Report on various messages and I never see that
 its looked at.  I see that SPF is checked and scored.  Any idea why
 its not checking the DKIM signatures?

Check the file v312.pre and see if the loadplugin line for DKIM is commented 
out.  If it is, uncomment it.

RE: Fake MX

2010-12-13 Thread Rosenbaum, Larry M.

 From: Bob Proulx [mailto:b...@proulx.com]
 Subject: Re: Fake MX

   [...] but that is distinct from being a tarpit, which is what
   I'm trying to clarify.

  A discussion around the definition of tarpit, and why tarbaby might be a
  suboptimal, though catchy, name?

 For the record a tarbaby:

   http://en.wikipedia.org/wiki/Tar_baby

 is something different from a tarpit:

   http://en.wikipedia.org/wiki/Tarpit_%28networking%29

 Please, let's use the correct terminology.  They really are pretty far
 from being interchangeable.

I wonder if the OP was really referring to a honeypot?

RE: email address forgery

2010-11-12 Thread Rosenbaum, Larry M.

Are there domains that have actually defined SPF record type records?  I 
haven’t been able to find any, but it could be the fault of the tools I’m using.

L

From: Noel Butler [mailto:noel.but...@ausics.net]
Sent: Thursday, November 11, 2010 5:14 PM
To: users@spamassassin.apache.org
Subject: Re: email address forgery

On Thu, 2010-11-11 at 10:07 -0500, Rob McEwen wrote:

On 11/11/2010 9:11 AM, Jeremy Van Rooyen wrote:

 Can anybody explain to me how to do this and how would I be able to

 test it?

Jeremy,

I really like to use the following wizard to generate my SPF strings:

http://www.openspf.org/

Scroll down to the section that says Deploying SPF, enter the domain

name, and click GO. Then, on the next page, fine tune the answers to

the various questions before submitting the info to generate your SPF

string. Finally, go into your DNS server and, for that domain, add that

string as a TXT record.

*and* as an  SPF  record type, the TXT method is deprecated, but for time being 
it's good to use it since there are a lot, and I mean a  LOT of outdated DNS 
servers around that do not support it even today, yes, the fault of the DNS 
server admin for running antiquated rubbish, but, there's just no telling some 
people to get with the times.

RE: DOS_OE_TO_MX

2010-09-29 Thread Rosenbaum, Larry M.



 -Original Message-
 From: njjrdell [mailto:nruggi...@dellmagazines.net]
 Sent: Wednesday, September 29, 2010 11:32 AM
 To: users@spamassassin.apache.org
 Subject: Re: DOS_OE_TO_MX
 
 
 I'm pretty sure she would not send a GTUBE. Here is another from her
 
 Sep 28 08:35:26 nsmail spamd[207]: prefork: child states: II\n
 Sep 28 08:35:55 nsmail spamd[287]: spamd: connection from localhost
 [127.0.0.1] at port 50098\n
 Sep 28 08:35:55 nsmail spamd[287]: spamd: checking message
 000b01cb5f6e$b1bbfe80$6629a...@traci for (unknown):500\n
 Sep 28 08:35:55 nsmail spamd[287]: spamd: identified spam (4006.3/5.0) for
 (unknown):500 in 1.0 seconds, 142218 bytes.\n
 Sep 28 08:35:55 nsmail spamd[287]: spamd: result: Y 4006 -
 AWL,BAYES_50,DATE_IN_FUTURE_12_24,DOS_OE_TO_MX
 scantime=1.0,size=142218,user=(unknown),uid=500,required_score=5.0,rhost=l
 ocalhost,raddr=127.0.0.1,rport=50098,mid=000b01cb5f6e$b1bbfe80$6629a...@t
 raci,bayes=0.483846,autolearn=no\n
 
 
 I never seen anything with such a score of 4006. DOS_OE_TO_MX is the rule
 that is consistent, so I was hoping to find out where it is to make sure
 nothing is scored wrong

score DOS_OE_TO_MX 2.602 3.086 2.265 2.523

RE: DOS_OE_TO_MX

2010-09-29 Thread Rosenbaum, Larry M.

 From: njjrdell [mailto:nruggi...@dellmagazines.net]
 Sent: Wednesday, September 29, 2010 12:05 PM
 To: users@spamassassin.apache.org
 Subject: RE: DOS_OE_TO_MX

 also, won't whitelisting her address open her up for spoofing?

AWL has nothing to do with whitelist_from and other similar options.  It's more 
of a score averager.
http://wiki.apache.org/spamassassin/AutoWhitelist

 thanks for the scores. Now would that just go into
 /usr/local/share/spamassassin/50_scores.cf?
 and why would that score be missing.

It's not missing.  It is in
/var/lib/spamassassin/3.003001/updates_spamassassin_org/50_scores.cf
or some similar directory. To find your config directory path, try this:

spamassassin -D config --lint

 Rosenbaum, Larry M. wrote:

  -Original Message-
  From: njjrdell [mailto:nruggi...@dellmagazines.net]
  Sent: Wednesday, September 29, 2010 11:32 AM
  To: users@spamassassin.apache.org
  Subject: Re: DOS_OE_TO_MX

  I'm pretty sure she would not send a GTUBE. Here is another from her

  Sep 28 08:35:26 nsmail spamd[207]: prefork: child states: II\n
  Sep 28 08:35:55 nsmail spamd[287]: spamd: connection from localhost
  [127.0.0.1] at port 50098\n
  Sep 28 08:35:55 nsmail spamd[287]: spamd: checking message
  000b01cb5f6e$b1bbfe80$6629a...@traci for (unknown):500\n
  Sep 28 08:35:55 nsmail spamd[287]: spamd: identified spam (4006.3/5.0)
  for
  (unknown):500 in 1.0 seconds, 142218 bytes.\n
  Sep 28 08:35:55 nsmail spamd[287]: spamd: result: Y 4006 -
  AWL,BAYES_50,DATE_IN_FUTURE_12_24,DOS_OE_TO_MX

 scantime=1.0,size=142218,user=(unknown),uid=500,required_score=5.0,rhost=l

 ocalhost,raddr=127.0.0.1,rport=50098,mid=000b01cb5f6e$b1bbfe80$6629a...@t
  raci,bayes=0.483846,autolearn=no\n

  I never seen anything with such a score of 4006. DOS_OE_TO_MX is the
 rule
  that is consistent, so I was hoping to find out where it is to make
 sure
  nothing is scored wrong

  score DOS_OE_TO_MX 2.602 3.086 2.265 2.523

 --
 View this message in context: http://old.nabble.com/DOS_OE_TO_MX-
 tp29839497p29840133.html
 Sent from the SpamAssassin - Users mailing list archive at Nabble.com.

Moving from Solaris to Red Hat

2010-08-11 Thread Rosenbaum, Larry M.

We are currently running SA v3.3.1 on Solaris 9 and Solaris 10 and are planning 
to move to Red Hat.  I don't have much experience with Red Hat (or Linux in 
general).  Could you point me to some tips and documentation about installing 
and running SA on Red Hat?

FYI, on Solaris I install by downloading the Mail-SpamAssassin-3.3.1.tar.bz2 
file and running the build procedure, after installing the required Perl 
modules.

Thanks,
Larry

RE: Trouble whitelisting domain users with whitelist_from_rcvd

2010-07-28 Thread Rosenbaum, Larry M.

 What is the best way to completely whitelist all internal emails so that
 there is no danger of any internal emails being blacklisted

The best way is to not feed internal emails to SpamAssassin.

compiling: Illegal octal digit '9' ignored...

2010-07-23 Thread Rosenbaum, Larry M.

I'm seeing warnings like this when I run sa-compile:

Illegal octal digit '9' ignored at 
/usr/local/lib/perl5/site_perl/5.10.1/Mail/SpamAssassin/Plugin/BodyRuleBaseExtractor.pm
 line 1083, $fh line 5645.
Illegal octal digit '9' ignored at 
/usr/local/lib/perl5/site_perl/5.10.1/Mail/SpamAssassin/Plugin/BodyRuleBaseExtractor.pm
 line 1083, $fh line 6449.
Illegal octal digit '9' ignored at 
/usr/local/lib/perl5/site_perl/5.10.1/Mail/SpamAssassin/Plugin/BodyRuleBaseExtractor.pm
 line 1083, $fh line 7229.

I've done some digging and it looks like the rule2xs() function is trying to 
parse stuff like this:

r \2512009 microsoft | unsubscribe | more newsletters | privac:FB_SOFTTABS 
__FB_BCs __SEEK_YRQYH9,[l=1] __SUBSCRIPTION_INFO,[l=1]
r copyright \2512009 by nacha - the electronic payments 
associ:LOTTO_AGENT,[l=1] T_LOTTO_DEPT,[l=1] __ATM_CARD,[l=1] __FEES,[l=1] 
__LOTTO_WIN_01,[l=1] __SEEK_VZ7OQ6,[l=1] __YOUR_FUND,[l=1]

I think these are coming from lines like this:

sought_rules_yerp_org/20_sought.cf:body __SEEK_YRQYH9  /\x{a9}2009 Microsoft \| 
Unsubscribe \| More Newsletters \| Privacy/
sought_rules_yerp_org/20_sought.cf:body __SEEK_VZ7OQ6  /Copyright \x{a9}2009 by 
NACHA - The Electronic Payments Association/

So the year occurs right after the octal escape code for the copyright symbol, 
and the fixup_re code is assuming the 2009 is part of the octal escape.

I suspect this error prevents the rule from being used.  Is there a fix for 
this?

System info:
SpamAssassin version 3.3.1
  running on Perl version 5.10.1
SunOS email 5.9 Generic_118558-39 sun4u sparc SUNW,Sun-Fire-V210

RE: new PDF Launch malware exploit (with sample)

2010-04-29 Thread Rosenbaum, Larry M.

 From: d.h...@yournetplus.com [mailto:d.h...@yournetplus.com]
 Sent: Wednesday, April 28, 2010 2:29 PM
 To: users@spamassassin.apache.org
 Subject: RE: new PDF Launch malware exploit (with sample)

 Quoting Rosenbaum, Larry M. rosenbau...@ornl.gov:

  Please don't send live malware samples to the list.

 Um... The OP did not send malware to the list. A link was supplied to
 the original message. You must have a scanner set up to follow links.
 That isn't a good idea, in my opinion.

There was some code in the message, right after the Here's just the nifty 
Launch part paragraph.  Perhaps it's not dangerous in a text message, but 
Forefront didn't like it anyway.

  -Original Message-
  From: Chip M. [mailto:sa_c...@iowahoneypot.com]
  Sent: Wednesday, April 28, 2010 2:01 PM
  To: users@spamassassin.apache.org
  Subject: new PDF Launch malware exploit (with sample)

  FILE QUARANTINED

  Microsoft Forefront Security for Exchange Server removed a file
 since
  it was found to be infected.
  File name: Body of Message
  Virus name: TrojanDropper:Win32/Pidrop.A

RE: new PDF Launch malware exploit (with sample)

2010-04-28 Thread Rosenbaum, Larry M.

Please don't send live malware samples to the list.

 -Original Message-
 From: Chip M. [mailto:sa_c...@iowahoneypot.com]
 Sent: Wednesday, April 28, 2010 2:01 PM
 To: users@spamassassin.apache.org
 Subject: new PDF Launch malware exploit (with sample)

 FILE QUARANTINED

 Microsoft Forefront Security for Exchange Server removed a file since
 it was found to be infected.
 File name: Body of Message
 Virus name: TrojanDropper:Win32/Pidrop.A

RE: More freemail URI spam

2010-04-20 Thread Rosenbaum, Larry M.

 Generally speaking, anything deemed worthwhile is added to SA proper
 (unless there's a licensing question).  The exceptions come from
 automated rules (like Sought, MBL, SARE 2tld, and Khop-sc-neighbors),

90_2tld.cf has been replaced by the official rule file 20_aux_tlds.cf.  From 
the comments in that file:

# This file replaces the SARE http://www.rulesemporium.com/rules/90_2tld.cf
# which will be deprecated as from 2010-05-01

RE: 90_sare_freemail.cf.sare.sa-update.dostech.net

2010-03-09 Thread Rosenbaum, Larry M.

 From: Yet Another Ninja [mailto:sa-l...@alexb.ch]

 On 3/4/2010 7:34 PM, Rosenbaum, Larry M. wrote:

  From: Karsten Bräckelmann [mailto:guent...@rudersport.de]

  On Thu, 2010-03-04 at 00:12 +0100, Yet Another Ninja wrote:
  On 3/3/2010 10:09 PM, Karsten Bräckelmann wrote:
  On Wed, 2010-03-03 at 15:38 -0500, Rosenbaum, Larry M. wrote:
  Is there still a reason for this update channel?

  90_sare_freemail.cf.sare.sa-update.dostech.net

  Or is it now built in to SA v3.3.0?
^
  20_freemail.cf and 20_freemail_domains.cf ?
  90_sare_freemail.cf is still supported by for ppl who haven't
  upgraded
  to SA 3.3.x
  Thanks for that addition and confirmation of status. :)

  The original question and hence my answer was specifically about
 3.3.x,
  though, and whether it still is needed from external sources with
 that
  version.

  I'm doing the same additions to 20_freemail_domains.cf

  Later this year, 90_sare_freemail.cf, will become unsupported.

  Anybody using SA 3.3.x should drop 90_sare_freemail.cf usage.

  Thanks, but I'm confused, as there are domains in 90_sare_freemail.cf
 that are not currently in 20_freemail_domains.cf.

 Hi Larry...

 Never got around to do the diff... your msg triggered :-)
 Unless I borked it, it should now included the missing from
 90_sare_freemail.cf

I still don't see the domains in 20_freemail_domains.cf.

Thanks, Larry

RE: 90_sare_freemail.cf.sare.sa-update.dostech.net

2010-03-04 Thread Rosenbaum, Larry M.

 -Original Message-
 From: Karsten Bräckelmann [mailto:guent...@rudersport.de]
 Sent: Wednesday, March 03, 2010 6:19 PM
 To: users@spamassassin.apache.org
 Subject: Re: 90_sare_freemail.cf.sare.sa-update.dostech.net

 On Thu, 2010-03-04 at 00:12 +0100, Yet Another Ninja wrote:
  On 3/3/2010 10:09 PM, Karsten Bräckelmann wrote:
   On Wed, 2010-03-03 at 15:38 -0500, Rosenbaum, Larry M. wrote:
   Is there still a reason for this update channel?

   90_sare_freemail.cf.sare.sa-update.dostech.net

   Or is it now built in to SA v3.3.0?
   ^
   20_freemail.cf and 20_freemail_domains.cf ?

  90_sare_freemail.cf is still supported by for ppl who haven't
 upgraded
  to SA 3.3.x

 Thanks for that addition and confirmation of status. :)

 The original question and hence my answer was specifically about 3.3.x,
 though, and whether it still is needed from external sources with that
 version.

  I'm doing the same additions to 20_freemail_domains.cf

  Later this year, 90_sare_freemail.cf, will become unsupported.

  Anybody using SA 3.3.x should drop 90_sare_freemail.cf usage.

Thanks, but I'm confused, as there are domains in 90_sare_freemail.cf that are 
not currently in 20_freemail_domains.cf.

L

90_sare_freemail.cf.sare.sa-update.dostech.net

2010-03-03 Thread Rosenbaum, Larry M.

Is there still a reason for this update channel?

90_sare_freemail.cf.sare.sa-update.dostech.net

Or is it now built in to SA v3.3.0?

90_sare_freemail.cf.sare.sa-update.dostech.net

2010-02-01 Thread Rosenbaum, Larry M.

Is there still a reason for this update channel?

90_sare_freemail.cf.sare.sa-update.dostech.net

Or is it now built in to SA v3.3.0?

spamd: respawning server - why?

2010-01-15 Thread Rosenbaum, Larry M.

Yesterday one of our servers started having problems.  I found the following 
messages in the syslog file:

Jan 14 14:12:38 localhost spamd[20926]: spamd: respawning server at 
/usr/local/bin/spamd line 1080.
Jan 14 14:12:38 localhost spamd[20927]: spamd: respawning server at 
/usr/local/bin/spamd line 1080.
Jan 14 14:13:45 localhost spamd[21038]: spamd: respawning server at 
/usr/local/bin/spamd line 1080.
Jan 14 14:13:45 localhost spamd[21056]: spamd: respawning server at 
/usr/local/bin/spamd line 1080.
Jan 14 14:13:45 localhost spamd[21057]: spamd: respawning server at 
/usr/local/bin/spamd line 1080.
Jan 14 15:17:46 localhost spamd[21726]: spamd: respawning server at 
/usr/local/bin/spamd line 1080.
..etc..

What causes this to happen?  A reboot fixed the problem, but I want to make 
sure it doesn't happen again.

SunOS ornl50 5.9 Generic_118558-39 sun4u sparc SUNW,Sun-Fire-V210
SpamAssassin Server version 3.2.5
  running on Perl 5.8.8
  with zlib support (Compress::Zlib 2.011)

Spamd startup locale question

2010-01-13 Thread Rosenbaum, Larry M.

SpamAssassin Server version 3.2.5
  running on Perl 5.8.8
  with zlib support (Compress::Zlib 2.011)
SunOS ornl72 5.9 Generic_122300-07 sun4u sparc SUNW,Sun-Fire-V240

What causes the following error message when restarting spamd?

perl: warning: Setting locale failed.
perl: warning: Please check that your locale settings:
LC_ALL = (unset),
LANG = en_US
are supported and installed on your system.
perl: warning: Falling back to the standard locale (C).

This happens on some, but not all, of our systems running spamd.  All the 
startup files contain

LANG=en_US
export LANG

Thanks,
Larry

RE: spamassassin bug

2010-01-11 Thread Rosenbaum, Larry M.

 check: no loaded plugin implements 'check_main': cannot scan! at
 /usr/lib/perl5/vendor_perl/5.10.0/Mail/SpamAssassin/PerMsgStatus.pm
 line
 164.
 
What plugin do I need to have loaded to resolve this error?

It looks like you are missing the v320.pre file, which contains

loadplugin Mail::SpamAssassin::Plugin::Check

along with several other important loadplugin lines.

RE: About upgrading

2010-01-10 Thread Rosenbaum, Larry M.

 -Original Message-
 From: Bill Landry [mailto:b...@inetmsg.com]
 Sent: Sunday, January 10, 2010 12:42 PM
 To: users@spamassassin.apache.org
 Subject: Re: About upgrading

 LuKreme wrote:
  On 9-Jan-2010, at 21:23, Rosenbaum, Larry M. wrote:

  It's the number of seconds since the epoch (Jan 1, 1970).  One easy
 way to convert it to a readable time is

  # perl -e 'print scalar localtime 1263044805, \n'
  Sat Jan  9 08:46:45 2010

 Or even simpler:

 perl -le 'print scalar localtime 1263049538'
 Sat Jan  9 05:46:45 2010

   % date -r 1263044805
  Sat Jan  9 06:46:45 MST 2010

 On Linux based systems:

 date -d @1263044805
 Sat Jan  9 05:46:45 PST 2010

 I like this output better than the perl output because it also includes
 the timezone.

Excellent.  Is there one that works on Solaris (other than the Perl version)?

RE: About upgrading

2010-01-09 Thread Rosenbaum, Larry M.

--Original Message-
 From: Alex [mailto:mysqlstud...@gmail.com]
 Sent: Saturday, January 09, 2010 9:13 PM
 To: SA Mailing list
 Subject: Re: About upgrading

 Hi,

    sa-learn --dump magic gives:
        0.000          0          3          0  non-token data: bayes
 db version
        0.000          0      57538          0  non-token data: nspam
        0.000          0      74876          0  non-token data: nham
        0.000          0     166338          0  non-token data: ntokens
        0.000          0 1257478501          0  non-token data: oldest
 atime
        0.000          0 1263049426          0  non-token data: newest
 atime
        0.000          0 1263049538          0  non-token data: last
 journal sync atime
        0.000          0 1263044805          0  non-token data: last
 expiry atime
        0.000          0    5529600          0  non-token data: last
 expire atime delta
        0.000          0       1868          0  non-token data: last
 expire reduction count

  Your database has 166338 tokens which is larger than the default
  bayes_expiry_max_db_size 15.  The last expiration ran this
 morning
  at 8:46.  You could try letting the bayes database get larger and
 turn
  off bayes_auto_expire.  If you turn off bayes_auto_expire you'll have
  to add something to cron to periodically expire tokens.
  bayes_auto_expire is fine for lower volumes of email, but can get in
  the way with higher volumes.

 Can I ask how you calculate the actual time from that number? I
 suspect it's the epoch minus some division of 24hrs, but a quick
 search wasn't fruitful.

It's the number of seconds since the epoch (Jan 1, 1970).  One easy way to 
convert it to a readable time is

# perl -e 'print scalar localtime 1263044805, \n'
Sat Jan  9 08:46:45 2010

Solaris 10 requires --syslog-socket=native

2009-12-30 Thread Rosenbaum, Larry M.

I have just recently installed SA v3.3.0-rc1 on Solaris 10.  I have discovered 
that in order for syslog logging to work, I have to start spamd with the switch 
--syslog-socket=native.  It won't work if I set it to unix or inet or if 
I omit the switch entirely.  As this is my first time running SpamAssassin on 
Solaris 10, I don't know if this discovery also applies to older SpamAssassin 
versions, but I suspect it does.  I suggest the documentation be changed to 
reflect this, since currently it does not even mention native as a legitimate 
option.

In the long term, perhaps native can be made the default, or have the code 
just not call setlogsock() if the --syslog-socket switch is absent.  (I don't 
know if this is feasible with older versions of Sys::Syslog.)

This is what the current docs say:

--syslog-socket=*type*
Specify how spamd should send messages to syslogd. The options are
unix, inet or none. The default is to try unix first,
falling back to inet if perl detects errors in its unix support.

Some platforms, or versions of perl, are shipped with dysfunctional
versions of the Sys::Syslog package which do not support some socket
types, so you may need to set this. If you get error messages
regarding __PATH_LOG or similar from spamd, try changing this
setting.

Recommended version of re2c?

2009-12-21 Thread Rosenbaum, Larry M.

What is the recommended version of re2c to use with SpamAssassin v3.3.0?  What 
about with v3.2.5?

Thanks, Larry

RE: ANNOUNCE: Apache SpamAssassin 3.3.0-beta1 available

2009-12-09 Thread Rosenbaum, Larry M.

 From: Mark Martinec [mailto:mark.martinec...@ijs.si]
 
  Thanks for testing! Which version of a perl module Time::HiRes
  do you have installed? See what is reported by:
$ perl -MTime::HiRes -le 'print Time::HiRes-VERSION'

1.9719

This is the version that came with Perl 5.10.1, and is the same version that's 
on CPAN.

  Could you please try upgrading this module if yours is rather old,
  and see if that helps.
 
 P.S., does the following change to t/timeout.t on your system
 make any difference in test results?
 
 --- timeout.t   2009-12-09 03:29:12.0 +0100
 +++ timeout.t   2009-12-09 03:29:19.0 +0100
 @@ -23,3 +23,3 @@
  use strict;
 -use Time::HiRes qw(time sleep);
 +use Time::HiRes qw(time sleep alarm);

That didn't help.

BTW, has the clamav interface changed?  I'm getting errors from this statement:

$permsgstatus-{main}-{conf}-{headers_spam}-{Virus} = $header;

Dec  9 13:46:03.893 [27455] warn:  (Not a HASH reference at 
/etc/mail/spamassassin/ornl_clamav.pm line 36.
Dec  9 13:46:03.893 [27455] warn: )

If I'm analyzing correctly, $permsgstatus-{main}-{conf}-{headers_spam} used 
to be a hash ref, and now it's an array ref.

RE: ANNOUNCE: Apache SpamAssassin 3.3.0-beta1 available

2009-12-08 Thread Rosenbaum, Larry M.

 From: Warren Togami [mailto:wtog...@redhat.com]
 Subject: ANNOUNCE: Apache SpamAssassin 3.3.0-beta1 available
 ...
   - if module Digest::SHA is not available, a module Digest::SHA1
 will be used, but at least one of them must be installed;
 a DKIM plugin requires Digest::SHA (the older Digest::SHA1 does not
 support sha256 hashes), so in practice the Digest::SHA is required

It appears that Net::DNS requires Digest::HMAC_MD5 and that Digest::HMAC_MD5 
requires Digest::SHA1.  So that for full functionality, both SHA and SHA1 are 
needed.

RE: ANNOUNCE: Apache SpamAssassin 3.3.0-beta1 available

2009-12-08 Thread Rosenbaum, Larry M.

SpamAssassin version 3.3.0-beta1
  running on Perl version 5.10.1
  Solaris 9 Sparc

I am getting the following errors in make test:

t/timeout.t ... 5/27 # Failed test 5 in t/timeout.t at line 
63
t/timeout.t ... 7/27 # Failed test 7 in t/timeout.t at line 
71
t/timeout.t ... 9/27 # Failed test 9 in t/timeout.t at line 
79
t/timeout.t ... 11/27 # Failed test 11 in t/timeout.t at 
line 87
t/timeout.t ... 13/27 # Failed test 13 in t/timeout.t at 
line 95
t/timeout.t ... 16/27 # Failed test 16 in t/timeout.t at 
line 108
# Failed test 17 in t/timeout.t at line 109
# Failed test 18 in t/timeout.t at line 110
t/timeout.t ... 22/27 # Failed test 22 in t/timeout.t at 
line 122
# Failed test 24 in t/timeout.t at line 124
t/timeout.t ... 25/27 # Failed test 25 in t/timeout.t at 
line 129
# Failed test 27 in t/timeout.t at line 131
t/timeout.t ... Failed 12/27 subtests

Thanks, Larry

 -Original Message-
 From: Warren Togami [mailto:wtog...@redhat.com]
 Sent: Sunday, December 06, 2009 10:01 PM
 To: SpamAssassin Users List; Development discussions related to Fedora
 Core
 Subject: ANNOUNCE: Apache SpamAssassin 3.3.0-beta1 available
 
 Apache SpamAssassin 3.3.0-beta1 is now available for testing.

DCC problems

2009-07-07 Thread Rosenbaum, Larry M.

Recently I've started seeing message like this in our log files:

Jul  7 13:22:48 ornl73 dccifd[21907]: [ID 702911 mail.notice] no working DCC 
serversdcc1.dcc-servers.net dcc2.dcc-servers.net dcc3.dcc-servers.net ... at 
64.124.52.232 208.201.249.2
Jul  7 13:22:48 ornl73 dccifd[21907]: [ID 702911 mail.error] no working DCC 
serversdcc1.dcc-servers.net dcc2.dcc-servers.net dcc3.dcc-servers.net ... at 
64.124.52.232 208.201.249.2
Jul  7 13:22:52 ornl73 last message repeated 4 times
Jul  7 13:22:55 ornl73 dccifd[21907]: [ID 702911 mail.error] continue not 
asking DCC 2045 seconds after failure
Jul  7 13:23:00 ornl73 dccifd[21907]: [ID 702911 mail.error] continue not 
asking DCC 2040 seconds after failure

I don't see any messages on the DCC home page
http://www.rhyolite.com/dcc/
to indicate any problems with the DCC servers.  Does anybody have any ideas 
what's wrong?

Thanks,
Larry

RE: DCC problems

2009-07-07 Thread Rosenbaum, Larry M.

Thanks.  It was a firewall issue.

From: Michael Scheidell [mailto:scheid...@secnap.net]
Sent: Tuesday, July 07, 2009 2:54 PM
To: Rosenbaum, Larry M.
Cc: users@spamassassin.apache.org
Subject: Re: DCC problems

Rosenbaum, Larry M. wrote:

Recently I've started seeing message like this in our log files:

Jul  7 13:22:48 ornl73 dccifd[21907]: [ID 702911 mail.notice] no working DCC 
serversdcc1.dcc-servers.net dcc2.dcc-servers.net dcc3.dcc-servers.net ... at 
64.124.52.232 208.201.249.2

Jul  7 13:22:48 ornl73 dccifd[21907]: [ID 702911 mail.error] no working DCC 
serversdcc1.dcc-servers.net dcc2.dcc-servers.net dcc3.dcc-servers.net ... at 
64.124.52.232 208.201.249.2

Jul  7 13:22:52 ornl73 last message repeated 4 times

Jul  7 13:22:55 ornl73 dccifd[21907]: [ID 702911 mail.error] continue not 
asking DCC 2045 seconds after failure

Jul  7 13:23:00 ornl73 dccifd[21907]: [ID 702911 mail.error] continue not 
asking DCC 2040 seconds after failure

Try doing a cdcc rtt
then find out if someone monitoring the ornl.gov firewalls mucked with the udp 
settings.
if you are using the public servers and are doing more then 100,000 'opts' a 
day, you might have gotten rate limited.

oh, and if you are inside of a freebsd jail, you need version .111 or better of 
DCC.
--
Michael Scheidell, CTO
Phone: 561-999-5000, x 1259
 | SECNAP Network Security Corporation
* Certified SNORT Integrator
* 2008-9 Hot Company Award Winner, World Executive Alliance
* Five-Star Partner Program 2009, VARBusiness
* Best Anti-Spam Product 2008, Network Products Guide
* King of Spam Filters, SC Magazine 2008

This email has been scanned and certified safe by SpammerTrap(r).
For Information please see 
www.secnap.com/products/spammertrap/http://www.secnap.com/products/spammertrap/

RE: Plugin extracting text from docs

2009-07-06 Thread Rosenbaum, Larry M.

 From: Jonas Eckerman [mailto:jonas_li...@frukt.org]
 
 Rosenbaum, Larry M. wrote:
 
  It appears that pdftohtml is only available as a Windows executable
 (on Sourceforge).
 
 If you want a precompiled executable it seems Windows is the only
 platform, but AFAICS the source code is also available at
 http://sourceforge.net/projects/pdftohtml/files/

I have found the Xpdf package, which pdftohtml is based on, has a pdftotext 
command line utility.  If you build it with the --without-x option, you get 
just the command line utilities without the X-windows stuff, which eliminates 
the need to install a bunch of font software.

Bayes expiration logic

2009-07-06 Thread Rosenbaum, Larry M.

Has anybody considered revising the Bayes expiration logic?  Maybe it's just 
our data that's weird, but the built-in expiration logic doesn't seem to work 
very well for us.  Here are my observations:

There's no point in checking anything older than oldest_atime.  For this value 
and older, zero tokens will be expired.  The current estimation pass logic goes 
back 256 days, even if the oldest atime is one week and the calculations have 
already started returning zeroes.

If your target corresponds to a delta of more than a few days, you're unlikely 
to get very close to it because the estimation pass logic uses exponentially 
increasing intervals.  There could be a big difference between 8 days and 16 
days for delta.

The initial guesstimate algorithm can choose a delta that's older than the 
oldest atime, which will result in the dreaded expired 0E0 tokens.  
Conversely, it can choose a delta so new that far too many tokens are expired.  
You're guaranteed to have at least 100,000 tokens left, but that's not good 
enough if you have set the max DB size to a million or more.

I suggest using a binary search or perhaps linear interpolation.  The starting 
endpoints would be

1) Oldest atime.  We already know it will expire zero tokens.

2) 12 hours ago.  Calculate the number of tokens expired for this value.  If it 
expires too few, then use this as your delta (or quit if it expires less than 
1000).  If it expires too many, you have your two endpoints to begin the 
search.  You can decide when to quit by closeness to the target, size of the 
interval, or number of iterations, or some combination.

The only problem I've seen is that the token age distribution is nonlinear 
enough that there are some cases where linear interpolation doesn't converge 
very well, and I don't know the best way to introduce fudge factors to get 
around this.

RE: Plugin extracting text from docs

2009-07-02 Thread Rosenbaum, Larry M.

 And, please tell me of problems.
 
 pdftohtml is imho not found in gentoo, but pdf2html is maybe the same ?

It appears that pdftohtml is only available as a Windows executable (on 
Sourceforge).  I need something that will run on Solaris.

RE: Plugin extracting text from docs (was: new spam using large images)

2009-07-01 Thread Rosenbaum, Larry M.

We can use antiword to render text from MSWord files, and unrtf to render text 
from RTF files.  What is the best tool to render text from PDF files?

(We are running Solaris 9)

L

 -Original Message-
 From: Jonas Eckerman [mailto:jonas_li...@frukt.org]
 Sent: Wednesday, June 24, 2009 1:34 PM
 To: users@spamassassin.apache.org
 Subject: Plugin extracting text from docs (was: new spam using large
 images)
 
 Jason Haar wrote:
 
  Speaking of image/rtf/word attachment spam; is there any work going
 on
  to standardize this so that the textual output of such attachments
 could
  be fed back into SA?
 
 Just as a note:
 
 I'm currently working on a modular plugin for extracting text and add
 it
 to SA message parts.
 
 The plugin can use either external tools or it's own simple plugin
 modules. How to extract text from parts is configurable, and based on
 mime types and file names, so new formats can be added by simply
 configuring for new external tolls or creating a new plugin module.
 
 My *far* from finished module currently manages to extract text from
 Word documents (using antiword), OpenXML text documents (using a simple
 plugin) and RTF (using unrtf).
 
 I haven't tested where and how the extracted text is available to
 SpamAssassin yet (as noted, it's *far* from finished), but I am using
set_rendered method as in the example, so it should work. ;-)
 
 Regards
 /Jonas
 --
 Jonas Eckerman
 Fruktträdet  Förbundet Sveriges Dövblinda
 http://www.fsdb.org/
 http://www.frukt.org/
 http://whatever.frukt.org/

RE: SORBS bites the dust

2009-06-23 Thread Rosenbaum, Larry M.



 -Original Message-
 From: Matus UHLAR - fantomas [mailto:uh...@fantomas.sk]
  IMPORTANT: If sorbs does not get picked-up by a new host, will SA
  developers be ready to roll-out an SA update to remove the sorbs
 rules, so
  that we don't suffer a bunch of timeouts? Or how does that work?
 
 On 23.06.09 09:29, Jeff Moss wrote:
  WHAT?  Sorbs and Spamhaus are polar opposites.  Spamhaus is a great
  organization while SORBS is a POS that helped give all blacklists a
 bad name.
 
 sorbs makes good job, although there are some whiners not understanding
 the
 stuff...
 
  I don't know if SpamAssassin has ever used it.
 
 it still does:
 
 50_scores.cf:score RCVD_IN_SORBS_BLOCK 0 # n=1 n=2 n=3
 50_scores.cf:score RCVD_IN_SORBS_DUL 0 1.615 0 0.877 # n=0 n=2
 50_scores.cf:score RCVD_IN_SORBS_HTTP 0 0.001 0 0.001 # n=0 n=2
 50_scores.cf:score RCVD_IN_SORBS_MISC 0 0.001 0 0.353 # n=0 n=2
 50_scores.cf:score RCVD_IN_SORBS_SMTP 0 # n=0 n=1 n=2 n=3
 50_scores.cf:score RCVD_IN_SORBS_SOCKS 0 0.182 0 0.801 # n=0 n=2
 50_scores.cf:score RCVD_IN_SORBS_WEB 0 1.117 0 0.619 # n=0 n=2
 50_scores.cf:score RCVD_IN_SORBS_ZOMBIE 0 # n=0 n=1 n=2 n=3

Notice that the SORBS spam sources list (the one that charged a delisting 
fee) is not used.

 --
 Matus UHLAR - fantomas, uh...@fantomas.sk ; http://www.fantomas.sk/
 Warning: I wish NOT to receive e-mail advertising to this address.
 Varovanie: na tuto adresu chcem NEDOSTAVAT akukolvek reklamnu postu.
 Remember half the people you know are below average.

RE: new spam using large images

2009-06-19 Thread Rosenbaum, Larry M.

 From: felic...@kluge.net On Behalf Of Theo Van Dinter

 On Fri, Jun 19, 2009 at 3:04 AM, Jason Haarjason.h...@trimble.co.nz
 wrote:
  Speaking of image/rtf/word attachment spam; is there any work going
 on
  to standardize this so that the textual output of such attachments
 could
  be fed back into SA?
 
 That functionality already exists (has for almost 3 years, actually),
 but as in the past (list archives) the documentation hasn't improved
 for it. :(
 
 Here's my last(?) post about it which has some sample code and
 everything:
 
 http://www.nabble.com/Re:-PDFText-Plugin-for-PDF-file-scoring---not-
 for-PDF-images-p11595641.html

Thanks for the sample code.  Once you get the $p object from 
$msg-find_parts(), how do you extract the contents of the message part to run 
it through antiword or whatever?

L

419 scams in .doc and .rtf attachments

2009-06-16 Thread Rosenbaum, Larry M.

We get a significant number of 419 scam letters where the actual spam text is 
in a Word (.doc or .rtf) or PDF attachment.  Example:

http://pastebin.com/m4a161daa

It would be really great if there was an SA plugin to extract the text from the 
attachment and then feed the text to the regular SA body rules.  Has anybody 
looked at that possibility?

Thanks, Larry

Do I need to adjust bayes_expiry_max_db_size?

2009-03-19 Thread Rosenbaum, Larry M.

We are running
SpamAssassin version 3.2.5
  running on Perl version 5.8.8
  Solaris 9 Sparc
with the MySQL Bayes store and autolearning.  We are using
bayes_expiry_max_db_size  100
Expiry is done manually once a day.  Here is a typical output from expiry:

Thu Mar 19 00:12:00 EDT 2009 Forcing Bayes expiry run
[2541] dbg: bayes: using username: root
[2541] dbg: bayes: database connection established
[2541] dbg: bayes: found bayes db version 3
[2541] dbg: bayes: Using userid: 217
[2541] dbg: bayes: bayes journal sync starting
[2541] dbg: bayes: bayes journal sync completed
[2541] dbg: bayes: expiry starting
[2541] dbg: bayes: expiry check keep size, 0.75 * max: 75
[2541] dbg: bayes: token count: 1792961, final goal reduction size: 1042961
[2541] dbg: bayes: first pass? current: 1237435937, Last: 1237349670, atime: 
86400, count: 669993, newdelta: 55502, ratio: 1.55667447271837, period: 43200
[2541] dbg: bayes: can't use estimation method for expiry, unexpected result, 
calculating optimal atime delta (first pass)
[2541] dbg: bayes: expiry max exponent: 9
[2541] dbg: bayes: atime token reduction
[2541] dbg: bayes:  ===
[2541] dbg: bayes: 43200 1144230
[2541] dbg: bayes: 86400 732048
[2541] dbg: bayes: 172800 0
[2541] dbg: bayes: 345600 0
[2541] dbg: bayes: 691200 0
[2541] dbg: bayes: 1382400 0
[2541] dbg: bayes: 2764800 0
[2541] dbg: bayes: 5529600 0
[2541] dbg: bayes: 11059200 0
[2541] dbg: bayes: 22118400 0
[2541] dbg: bayes: first pass decided on 86400 for atime delta
[2541] dbg: bayes: expiry completed
expired old bayes database entries in 172 seconds
1060954 entries kept, 732007 deleted
token frequency: 1-occurrence tokens: 53.44%
token frequency: less than 8 occurrences: 28.22%
Thu Mar 19 00:15:09 EDT 2009 Done

This is telling me that there are no tokens more than 2 days old.  Is this good 
or bad?  Should I be increasing the max DB size?

Thanks, Larry

Note: Here is the --dump magic output:

0.000  0  3  0  non-token data: bayes db version
0.000  07894739  0  non-token data: nspam
0.000  0   10477619  0  non-token data: nham
0.000  01428534  0  non-token data: ntokens
0.000  0 1237349612  0  non-token data: oldest atime
0.000  0 1237479369  0  non-token data: newest atime
0.000  0  0  0  non-token data: last journal sync atime
0.000  0 1237436073  0  non-token data: last expiry atime
0.000  0  86400  0  non-token data: last expire atime delta
0.000  0 732007  0  non-token data: last expire reduction 
count

RE: Hostname in X-Spam-Checker-Version Header

2009-03-17 Thread Rosenbaum, Larry M.

 From: netz-haut - stephan seitz [mailto:s.se...@netz-haut.de]
 Sent: Tuesday, March 17, 2009 1:15 PM
 To: users@spamassassin.apache.org
 Subject: Hostname in X-Spam-Checker-Version Header

 Hi there,

 as shown in the manual, the X-Spam-Checker-Version header is not
 configurable for some reason. Is there some configuration magic to
 change just the hostname of this header? We're running a bunch of
 multi-homed mail servers and I just want to glue the shown name to the
 official hostname, which is also located locally (just another
 interface in the same hosts).

Try report_hostname in local.cf.

freemail_re error

2009-02-27 Thread Rosenbaum, Larry M.

What is the significance of this message in the spamd log?

Feb 27 12:56:25 localhost spamd[222]: config: dup unknown type freemail_re, 
Regexp

(Somebody asked a similar question in November, but I didn't see an answer.)

Thanks,
Larry

RE: misc_10.cf

2009-02-10 Thread Rosenbaum, Larry M.

The information on the download page should be corrected to point to the right 
file.  Or perhaps removed entirely: when you build from source, the build 
procedure prompts you for site contact information and puts it into the 
10_default_prefs.cf file.  It is also put into the sa-update script, so that 
the substitution is made whenever sa-update downloads a new version of 
10_default_prefs.cf.  (I don't know if this happens for CPAN or package 
installs.)

 -Original Message-
 From: Matt Kettler [mailto:mkettler...@verizon.net]
 Sent: Tuesday, February 10, 2009 1:16 AM
 To: RobertH
 Cc: users@spamassassin.apache.org
 Subject: Re: misc_10.cf
 
 RobertH wrote:
 
 
 
  Um, that's a file that comes with SA, and it is *NOT* user editable.
  Therefore, it's not an example, it is a standard config file
  that generates the default settings that you later over-ride
  with your local.cf.
 
  The 3.2.5 installation tarball will install the version of
  this file that is appropriate for 3.2.5, and sa-update may update
 it.
 
 
 
 
  matt,
 
  i am not seeing that file anywhere in my install and i am quite
 capable of
  using the locate command etc...
 
 Ahh, I forgot, 10_misc.cf has been renamed to 10_default_prefs.cf. My
 bad.
 
 Here's the 3.2 version.
 
 http://svn.apache.org/repos/asf/spamassassin/branches/3.2/rules/10_defa
 ult_prefs.cf
 
 It should, by default, be in /usr/share/spamassassin, along with the
 other files that create the default ruleset.
 
  Updated ones created by sa-update would be in /var/lib/spamassassin.
 
 You may want to template off that (see below)
 
  i am fairly certain i hand generated and installed via rpm generated
 by
 
  rpm -tb sa-tarballname.whateveritwas.somethingsomething
 
  something like that.
 
  on a centos aka redhat clone
 
  the misc_10.cf file looks pretty editable to me in some respects.
 
 If it looks editable, please note it contains this text near the top:
 
 # Please don't modify this file as your changes will be overwritten
 with
 # the next update. Use @@LOCAL_RULES_DIR@@/local.cf instead.
 # See 'perldoc Mail::SpamAssassin::Conf' for details.
 
 (see below for more clarification)
 
 
 
  i wouldnt have even have asked if i had not gone to
 
  spamassassin.apache.org and then clicked on downloads and on that
 page it
  says
 
  System Administrators
  Please create a local copy of the report_template text in a file
 named
  something like /etc/mail/spamassassin/10_local_report.cf,
 Ok, *that* you can do. You can, at the /etc/mail/spamassassin/ level
 create a file, with any name, that has the report_template parts of the
 file and edit that. This is, of course, creating a copy in your site
 rules dir, which is OK.
 
 I was trying to steer you away from the very common mistake of editing
 the base config files in /usr/share/spamassassin, as they get
 over-ridden, or obliterated, by sa-update runs. Editing files at the
 /usr/share/spamassassin or /var/lib/spamassassin level will just result
 in your changes being lost on the next sa-update run. Hence the
 warning.

RE: sa-update damages existing SA installation

2008-12-22 Thread Rosenbaum, Larry M.

 From: Daryl C. W. O'Shea [mailto:spamassas...@dostech.ca]
 Sent: Saturday, December 20, 2008 2:48 AM

 On 19/12/2008 5:40 AM, Marcin Krol wrote:
  Daryl C. W. O'Shea wrote:
  do it all at once.  See my SARE sa-update page for details:
 
  http://daryl.dostech.ca/sa-update/sare/sare-sa-update-howto.txt
 
  Are SARE rules still being updated a bit at least / are they still
 working?

 The only one really being updated is 90_2tld.cf:

What do I need to put in my sa-update channel file to get updates for 
90_2tld.cf?

(I can't get to the howto web page above.)

Backup command for AWL?

2008-12-05 Thread Rosenbaum, Larry M.

The Bayes database can be backed up and restored with sa-learn 
--backup/--restore.  Is there any similar way to back up and restore a 
MySQL-based AWL database?  The check_whitelist command is only good for DBM 
files.

RE: Backup command for AWL?

2008-12-05 Thread Rosenbaum, Larry M.

 From: Theo Van Dinter [mailto:[EMAIL PROTECTED]

 On Fri, Dec 05, 2008 at 11:58:26AM -0500, Rosenbaum, Larry M. wrote:
  The Bayes database can be backed up and restored with sa-learn --
 backup/--restore.  Is there any similar way to back up and restore a
 MySQL-based AWL database?  The check_whitelist command is only good
 for DBM files.

 If you're using MySQL, why not just use the standard MySQL backup
 tools?
 ie: mysqldump, etc.

That's a possibility.  I'm currently doing that for AWL and Bayes, but I have a 
question.  Why does the output from mysqldump get bigger every day, while the 
output from sa-learn --backup stays at about the same size?  (The expiry job 
appears to be working properly.)  It's like the bayes_token table is getting 
larger but still holding the same amount of data.

RE: why is SA testing my server in DNSBLs?

2008-12-02 Thread Rosenbaum, Larry M.

 From: Brian J. Murrell [mailto:[EMAIL PROTECTED]

 Hi All,

 I was doing a bit of spamassassin -D testing with SA 3.2.4 and
 noticed
 that it's running my own mail server name through various DNSBL tests.

 Here are the headers of the particular message I am testing:

 From [EMAIL PROTECTED] Tue Dec  2 05:24:59 2008
 Return-Path: [EMAIL PROTECTED]

The checks it's doing below are all RHBL checks, so it's probably testing the 
Return-Path:.

 ...
 [29986] dbg: dns: launching DNS A query for
 linux.interlinx.bc.ca.rhsbl.ahbl.org. in background
 [29986] dbg: async: starting: DNSBL-A,
 dns:A:linux.interlinx.bc.ca.rhsbl.ahbl.org. (timeout 15.0s, min 3.0s)
 [29986] dbg: dns: checking A and MX for host linux.interlinx.bc.ca
 [29986] dbg: dns: launching DNS A query for linux.interlinx.bc.ca in
 background
 [29986] dbg: async: starting: NO_DNS_FOR_FROM, DNSBL-A,
 dns:A:linux.interlinx.bc.ca (timeout 15.0s, min 3.0s)
 [29986] dbg: dns: launching DNS MX query for linux.interlinx.bc.ca in
 background
 [29986] dbg: async: starting: NO_DNS_FOR_FROM, DNSBL-MX,
 dns:MX:linux.interlinx.bc.ca (timeout 15.0s, min 3.0s)
 ...
 [29986] dbg: dns: launching DNS A query for
 linux.interlinx.bc.ca.bl.open-whois.org. in background
 [29986] dbg: async: starting: DNSBL-A,
 dns:A:linux.interlinx.bc.ca.bl.open-whois.org. (timeout 15.0s, min
 3.0s)
 ...
 [29986] dbg: dns: launching DNS A query for
 linux.interlinx.bc.ca.fulldom.rfc-ignorant.org. in background
 [29986] dbg: async: starting: DNSBL-A,
 dns:A:linux.interlinx.bc.ca.fulldom.rfc-ignorant.org. (timeout 15.0s,
 min 3.0s)

RE: optional modules

2008-11-13 Thread Rosenbaum, Larry M.

 From: Stefan Jakobs [mailto:[EMAIL PROTECTED]


   INFO: SA version: 3.2.5, 3.002005, no optional modules:
   Sys::Hostname::Long
 What is the benefit of using this modul?

Sys::Hostname::Long is required by Mail::SPF::Query.  However, Mail::SPF::Query
has been superseded by Mail::SPF, which doesn't require Sys::Hostname::Long.

   Mail::SPF Mail::SPF::Server Mail::SPF::Request Mail::SPF::Mech
   Mail::SPF::Mech::A Mail::SPF::Mech::PTR Mail::SPF::Mech::All
   Mail::SPF::Mech::Exists Mail::SPF::Mech::IP4 Mail::SPF::Mech::IP6
   Mail::SPF::Mech::Include Mail::SPF::Mech::MX Mail::SPF::Mod
   Mail::SPF::Mod::Exp Mail::SPF::Mod::Redirect
 Mail::SPF::SenderIPAddrMech
   Mail::SPF::v1::Record Mail::SPF::v2::Record Mail::SPF::Query
 SPF, I know it.

If you have Mail::SPF, you don't need Mail::SPF::Query (although the 
SpamAssassin build process may complain about it being missing).

   NetAddr::IP NetAddr::IP::Util auto::NetAddr::IP::Util::inet_n2dx
   auto::NetAddr::IP::Util::ipv6_n2d
 What are these modules doing, what is the benefit?

Mail::SPF requires NetAddr::IP.

RE: SURBL Usage Policy change

2008-11-12 Thread Rosenbaum, Larry M.

Where is the price list?  I haven't been able to find it.

 -Original Message-
 From: Joseph Brennan [mailto:[EMAIL PROTECTED]
 Sent: Wednesday, November 12, 2008 12:25 PM
 To: users@spamassassin.apache.org
 Subject: Re: SURBL Usage Policy change

 Jeff Chan [EMAIL PROTECTED] wrote:

   Does anyone know how many non-profits have more than 1,000 users
  (i.e., users with mailboxes)?

 Most universities and colleges have many more than that.  An
 undergrad-only school that admits only about 200 a year would
 pass that number, counting faculty and staff and the summer
 overlap of graduated and admitted student accounts.

 Requiring large organizations to use rsync and charging for it
 makes a lot of sense.  How much, though... and we didn't budget
 this in when we estimated last spring, for the July-June fiscal
 year schools use...

 Joseph Brennan
 Columbia University Information Technology

bayes_token table too big?

2008-10-06 Thread Rosenbaum, Larry M.

SpamAssassin version 3.2.5, running on Perl version 5.8.8, Solaris 9
Using MySQL for Bayes database.

I'm wondering if our Bayes token database is too big, and why.

Based on some posts to this list, I decided to try converting our Bayes and AWL 
databases to InnoDB to improve performance.  So I copied the database to a 
non-production MySQL server and tried to convert it there.  It has taken 4 days 
to convert!  I'm thinking something must be wrong.

Here is the output I'm getting from our Bayes expire job:

Tue Sep 30 00:12:00 EDT 2008 Forcing Bayes expiry run
expired old bayes database entries in 193 seconds
104999743 entries kept, 147355 deleted
token frequency: 1-occurrence tokens: 0.12%
token frequency: less than 8 occurrences: 0.05%
Tue Sep 30 00:15:28 EDT 2008 Done

Wed Oct 1 00:12:00 EDT 2008 Forcing Bayes expiry run
expired old bayes database entries in 210 seconds
105000814 entries kept, 242825 deleted
token frequency: 1-occurrence tokens: 0.11%
token frequency: less than 8 occurrences: 0.06%
Wed Oct 1 00:15:47 EDT 2008 Done

Thu Oct 2 00:12:00 EDT 2008 Forcing Bayes expiry run
expired old bayes database entries in 206 seconds
105032264 entries kept, 239214 deleted
token frequency: 1-occurrence tokens: 0.13%
token frequency: less than 8 occurrences: 0.06%
Thu Oct 2 00:15:39 EDT 2008 Done

And here is the information from the local.cf file:

bayes_expiry_max_db_size  50

So the config file says 500 thousand tokens, but the database has 105 million 
entries.  Have I misunderstood something, or is expiry not working correctly?

RE: bayes_token table too big?

2008-10-06 Thread Rosenbaum, Larry M.

 From: Theo Van Dinter [mailto:[EMAIL PROTECTED]

 On Mon, Oct 06, 2008 at 03:42:53PM -0400, Rosenbaum, Larry M. wrote:
  And here is the information from the local.cf file:
 
  bayes_expiry_max_db_size  50
 
  So the config file says 500 thousand tokens, but the database has 105
 million entries.  Have I misunderstood something, or is expiry not
 working correctly?

 Do an expire run w/ -D bayes and show the expiry details.

Mon Oct 6 16:11:00 EDT 2008 Forcing Bayes expiry run
[25080] dbg: bayes: using username: root
[25080] dbg: bayes: database connection established
[25080] dbg: bayes: found bayes db version 3
[25080] dbg: bayes: Using userid: 1
[25080] dbg: bayes: bayes journal sync starting
[25080] dbg: bayes: bayes journal sync completed
[25080] dbg: bayes: expiry starting
[25080] dbg: bayes: expiry check keep size, 0.75 * max: 375000
[25080] dbg: bayes: token count: 105095925, final goal reduction size: 104720925
[25080] dbg: bayes: first pass? current: 1223323871, Last: 1223266468, atime: 
43200, count: 91425, newdelta: 37, ratio: 1145.42986054143, period: 43200
[25080] dbg: bayes: can't use estimation method for expiry, unexpected result, 
calculating optimal atime delta (first pass)
[25080] dbg: bayes: expiry max exponent: 9
[25080] dbg: bayes: atime token reduction
[25080] dbg: bayes:  ===
[25080] dbg: bayes: 43200 69517
[25080] dbg: bayes: 86400 16821
[25080] dbg: bayes: 172800 6
[25080] dbg: bayes: 345600 6
[25080] dbg: bayes: 691200 6
[25080] dbg: bayes: 1382400 6
[25080] dbg: bayes: 2764800 6
[25080] dbg: bayes: 5529600 5
[25080] dbg: bayes: 11059200 3
[25080] dbg: bayes: 22118400 3
[25080] dbg: bayes: first pass decided on 43200 for atime delta
[25080] dbg: bayes: expiry completed
expired old bayes database entries in 118 seconds
105026416 entries kept, 69509 deleted
token frequency: 1-occurrence tokens: 0.15%
token frequency: less than 8 occurrences: 0.05%
Mon Oct 6 16:13:09 EDT 2008 Done

RE: Skip scanning for large mails

2008-09-15 Thread Rosenbaum, Larry M.

 From: mouss [mailto:[EMAIL PROTECTED]

 The samples I looked at could easily be stopped otherwise (I don't
 usuall get a lot of lottery mail with a large .tif from a gmail
 address!!). but it's not worth the pain. if spammers start sending
 large
 messages, things will change...

We just received a 419 spam with a 642 KB JPG file.  It would be nice if 
SpamAssassin could at least look at the text of messages like that.

RE: Skip scanning for large mails

2008-09-15 Thread Rosenbaum, Larry M.

 From: mouss [mailto:[EMAIL PROTECTED]

 Rosenbaum, Larry M. wrote:
  From: mouss [mailto:[EMAIL PROTECTED]

  The samples I looked at could easily be stopped otherwise (I don't
  usuall get a lot of lottery mail with a large .tif from a gmail
  address!!). but it's not worth the pain. if spammers start sending
  large
  messages, things will change...

  We just received a 419 spam with a 642 KB JPG file.  It would be nice
 if SpamAssassin could at least look at the text of messages like that.

 Can you post a copy on a web page, just to see if it can be stopped
 with
 MTA checks.

http://pastebin.com/m479512c6

RE: Skip scanning for large mails

2008-09-15 Thread Rosenbaum, Larry M.

 From: Evan Platt [mailto:[EMAIL PROTECTED]

 Rosenbaum, Larry M. wrote:
 
  We just received a 419 spam with a 642 KB JPG file.  It would be nice
 if SpamAssassin could at least look at the text of messages like that.
 
 
 Wouldn't FuzzyOCR pick up on that?

Not if spamc never passes it to spamd because it's over 256K.

SpamAssassin slowdown

2008-09-09 Thread Rosenbaum, Larry M.

SpamAssassin version 3.2.5
  running on Perl version 5.8.8
Solaris 9 SPARC

For the past few days I have noticed SpamAssassin takes much longer to process 
messages between about 10:00am and 1:00pm EDT.  It doesn't appear to be a 
memory problem or a problem with our DNS server.  To try to figure it out, I 
got a message which  took 58 seconds to process and ran it through 
spamassassin -D, taking note of where the output paused for a long time.  The 
results are shown below.  Can anybody tell me what SpamAssassin is doing at 
that point?  It says compiled body tests, but we are not running a compiled 
configuration.

...
[13557] dbg: Botnet: RDNS is 'mail4.smartdeals-mail.com'
[13557] dbg: Botnet: HELO is 'mail4.smartdeals-mail.com'
[13557] dbg: Botnet: EnvelopeFrom is [EMAIL PROTECTED]
[13557] dbg: Botnet: mail domain is smartdeals-mail.com
[13557] dbg: Botnet: SOHO hit
[13557] dbg: rules: ran eval rule BOTNET_SOHO == got hit (1)
[13557] dbg: dkim: performing public key lookup and signature verification
[13557] warn: Use of uninitialized value in sprintf at
/usr/local/lib/perl5/site_perl/5.8.8/Mail/SpamAssassin/Logger.pm line 213.
[13557] dbg: dkim: signing identity: , d=smartdeals-mail.com, a=rsa-sha1, 
c=nofws
[13557] dbg: dkim: public key lookup or verification failed: cannot change 
identity on
Mail::DKIM::DkSignature at 
/usr/local/lib/perl5/site_perl/5.8.8/Mail/SpamAssassin/Plugin/DKIM.pm
line 366
[13557] dbg: dkim: policy: performing lookup
[13557] dbg: dkim: policy result neutral: v=spf1 mx ip4:69.56.11.40/29 -all
[13557] dbg: Botnet: checking NORDNS
[13557] dbg: Botnet: no trusted relays
[13557] dbg: Botnet: get_relay good RDNS
[13557] dbg: Botnet: IP is '69.56.11.43'
[13557] dbg: Botnet: RDNS is 'mail4.smartdeals-mail.com'
[13557] dbg: Botnet: HELO is 'mail4.smartdeals-mail.com'
[13557] dbg: Botnet: NORDNS miss
[13557] dbg: spf: def_whitelist_from_spf: [EMAIL PROTECTED] is not in
DEF_WHITELIST_FROM_SPF
[13557] dbg: FreeMail: replyto: skipping, envelope sender looks bulk
[13557] dbg: spf: whitelist_from_spf: [EMAIL PROTECTED] is not in user's
WHITELIST_FROM_SPF
[13557] dbg: async: select found no responses ready (t.o.=0.0)
[13557] dbg: async: completed in 0.913 s: URI-DNSBL, 
DNSBL:sbl.spamhaus.org.:197.37.56.69
[13557] dbg: async: completed in 0.912 s: URI-DNSBL, 
DNSBL:sbl.spamhaus.org.:1.15.56.69
[13557] dbg: async: completed in 0.917 s: URI-DNSBL, 
DNSBL:sbl.spamhaus.org.:7.37.42.70
[13557] dbg: async: completed in 0.915 s: URI-DNSBL, 
DNSBL:sbl.spamhaus.org.:42.142.25.69
[13557] dbg: async: completed in 0.920 s: URI-DNSBL, 
DNSBL:sbl.spamhaus.org.:225.145.64.69
[13557] dbg: async: completed in 0.914 s: URI-DNSBL, 
DNSBL:sbl.spamhaus.org.:248.184.52.216
[13557] dbg: async: completed in 0.919 s: URI-DNSBL, 
DNSBL:sbl.spamhaus.org.:200.92.251.63
[13557] dbg: async: completed in 0.918 s: URI-DNSBL, 
DNSBL:sbl.spamhaus.org.:4.32.56.69
[13557] dbg: dns: harvested completed queries
[13557] dbg: rules: running body tests; score so far=12.48

 - Here is where output paused for a long time 
---

[13557] dbg: rules: compiled body tests
[13557] dbg: rules: ran body rule __SARE_PHONE_NUM == got hit: 
866-315-8447
[13557] dbg: rules: ran body rule FB_GET_MEDS == got hit: get a Medic
[13557] dbg: rules: ran body rule __ORNL_LOTTO_BINGO_6 == got hit: email 
address
[13557] dbg: rules: ran body rule __HAS_ANY_EMAIL == got hit: [EMAIL 
PROTECTED]
[13557] dbg: rules: ran body rule __NONEMPTY_BODY == got hit: G
[13557] dbg: rules: ran body rule __KAM_LOTTO3 == got hit: claim
[13557] dbg: rules: ran body rule ACT_NOW_CAPS == got hit: Act Now
[13557] dbg: rules: ran body rule __DOS_LINK == got hit: link
[13557] dbg: rules: running uri tests; score so far=14.075
[13557] dbg: rules: compiled uri tests
[13557] dbg: rules: ran uri rule __DOS_HAS_ANY_URI == got hit: m
[13557] dbg: rules: ran uri rule __LOCAL_PP_NONPPURL == got hit:
http://www.onewaytoadventure.com;
[13557] dbg: pdfinfo: Identified 0 possible mime parts that need checked for 
PDF content
[13557] dbg: pdfinfo: set_tag called for PDFCOUNT 0
[13557] dbg: pdfinfo: set_tag called for PDFIMGCOUNT 0
[13557] dbg: rules: ran eval rule __TAG_EXISTS_BODY == got hit (1)
[13557] dbg: IXHASH: IxHash querying Server nospam.login-solutions.de
[13557] dbg: IXHASH: Computed hash-value de2b33469f305b6e93687e3eb710db4d via 
method 1
[13557] dbg: IXHASH: Now checking 
de2b33469f305b6e93687e3eb710db4d.nospam.login-solutions.de
...

RE: Spam flooding recent days

2008-07-22 Thread Rosenbaum, Larry M.

 From: Michał Jęczalik [mailto:[EMAIL PROTECTED]
 Subject: Spam flooding recent days

 Hello,

 I've noticed a huge increase of spam rate in past 2-3 weeks. Most of it
 are messages with some quite normal Subject:, often (but not
 neccesarily)
 referring to some fake event (i.e. some politician stabbed to death)
 and
 there's only a link, sometimes together with a single sentence, in the
 body

It's called tabloid spam.

http://redtape.msnbc.com/2008/07/no-presidential.html#posts

RE: Upgrade SpamAssassin failing

2008-04-01 Thread Rosenbaum, Larry M.

https://issues.apache.org/SpamAssassin/show_bug.cgi?id=5591


 -Original Message-
 From: Asif Iqbal [mailto:[EMAIL PROTECTED]
 Sent: Tuesday, April 01, 2008 3:59 PM
 To: users@spamassassin.apache.org
 Subject: Re: Upgrade SpamAssassin failing

 Still looking for some suggestion on this


 On Sun, Mar 30, 2008 at 11:59 AM, Asif Iqbal [EMAIL PROTECTED] wrote:
  I have tried to upgrade spamassassin twice with no avail. Current
   version running is 3.2.3 on Solaris 8
 
   cpan install Mail::SpamAssassin
   Running install for module Mail::SpamAssassin
   Running make for J/JM/JMASON/Mail-SpamAssassin-3.2.4.tar.gz
Is already unwrapped into directory
   /usr/local/cpan/build/Mail-SpamAssassin-3.2.4
Has already been processed within this session
   Running make test
   cp sa-compile blib/script/sa-compile
   /usr/bin/perl -MExtUtils::MY -e MY-fixin(shift) blib/script/sa-
 compile
   cp spamc/spamc blib/script/spamc
   /usr/bin/perl -MExtUtils::MY -e MY-fixin(shift)
 blib/script/spamc
   cp sa-learn blib/script/sa-learn
   /usr/bin/perl -MExtUtils::MY -e MY-fixin(shift) blib/script/sa-
 learn
   cp spamassassin blib/script/spamassassin
   /usr/bin/perl -MExtUtils::MY -e MY-fixin(shift)
 blib/script/spamassassin
   cp spamd/spamd blib/script/spamd
   /usr/bin/perl -MExtUtils::MY -e MY-fixin(shift)
 blib/script/spamd
   /usr/bin/perl build/mkrules --exit_on_no_src --src rulesrc --out
 rules
   --manifest MANIFEST --manifestskip MANIFEST.SKIP
   no source directory found: exiting
   /usr/bin/perl build/preprocessor  -Mvars -DVERSION=3.002004
   -DPREFIX=/usr/local -
 DDEF_RULES_DIR=/usr/local/share/spamassassin
   -DLOCAL_RULES_DIR=/etc/mail/spamassassin
   -DLOCAL_STATE_DIR=/var/lib/spamassassin
   -DINSTALLSITELIB=/usr/local/lib/perl5/site_perl/5.8.0
   -DCONTACT_ADDRESS=the administrator of that system -Msharpbang
   -Mconditional -DPERL_BIN=/usr/bin/perl -DPERL_WARN=
   -DPERL_TAINT= -m755 -isa-update.raw -osa-update
   cp sa-update blib/script/sa-update
   /usr/bin/perl -MExtUtils::MY -e MY-fixin(shift) blib/script/sa-
 update
   PERL_DL_NONLAZY=1 /usr/bin/perl -MExtUtils::Command::MM -e
   test_harness(0, 'blib/lib', 'blib/arch') t/*.t
   t/basic_lintok
   t/basic_obj_api.ok
   t/bayesdbm..ok
   t/bayesdbm_flockok
   t/bayessdbm.ok
   t/bayessdbm_seen_delete.ok
   t/bayessql..skipped
  all skipped: no reason given
   t/blacklist_autolearn...ok
   t/body_mod..ok
   t/check_implemented.ok
   t/cidrs.ok
   t/config_errs...skipped
  all skipped: no reason given
   t/config_text...ok
   t/cpp_comments_in_spamc.ok
   t/date..ok
   t/db_awl_path...ok
   t/db_based_whitelistok
   t/db_based_whitelist_ipsok
   t/dcc...skipped
  all skipped: no reason given
   t/debug.ok
   t/desc_wrap.ok
   t/dkim..skipped
  all skipped: no reason given
   t/dnsbl.skipped
  all skipped: no reason given
   t/dnsbl_sc_meta.skipped
  all skipped: no reason given
   t/duplicatesok
   t/get_all_headers...ok
   t/get_headers...ok
   t/gtube.ok
   t/hashcash..ok
   t/html_colors...ok
   t/html_obfu.ok
   t/html_utf8.skipped
  all skipped: no reason given
   t/ifversion.ok
   t/ip_addrs..ok
   t/lang_lint.ok
   t/lang_pl_tests.couldn't set locale correctly
   t/lang_pl_tests.ok
   t/line_endings..ok
   t/lint_nocreate_prefs...ok
   t/memory_cycles.skipped
  all skipped: no reason given
   t/meta..ok
   t/metadata..ok
   t/mimeheaderok
   t/mimeparse.ok
   t/missing_hb_separator..ok
   t/mkrules...ok
   t/nonspam...ok
   t/pluginok
   t/plugin_file...ok
   t/plugin_priorities.ok
   t/prefs_include.ok
   t/prioritiesok
   t/razor2skipped
  all skipped: no reason given
   t/rcvd_parser...ok
   t/re_base_extractionok 1/115
   100% Completed  18.86 rules/sec in 00m00s
 
   100% Completed 2909.18 bases/sec in 00m00s
   t/re_base_extractionok 7/115
   100% Completed  21.38 rules/sec in 00m00s
 
   100% Completed 1984.06 bases/sec in 00m00s
   t/re_base_extractionok 10/115
   100% Completed

RE: AWL scores high after receiving spam from myself?

2008-02-22 Thread Rosenbaum, Larry M.

 From: Andreas Ntaflos [mailto:[EMAIL PROTECTED]
 To spamassassin this spam appears to come from myself. It scored a low
 AWL but
 over 16 points all in all so the next message received from
 [EMAIL PROTECTED] would certainly get high AWL score.

 My questions are these: did I get this right? Is that really what seems
 to be
 happening? If so, how do I handle such a scenario? When it is so easy
 to
 forge header fields does it even make sense to have an AWL that assigns
 scores based on where the mail *appears* to be coming from?

The AWL classifies its history by both return address and IP.  It sounds like 
in your case it is using the wrong IP, which may indicate problems with your 
trust path.  Please see

http://wiki.apache.org/spamassassin/TrustPath

RE: Rule for Russian character sets

2008-02-15 Thread Rosenbaum, Larry M.

 From: Karsten Bräckelmann [mailto:[EMAIL PROTECTED]

 I've pointed it out before. Just use ok_locales, which is all about
 these char sets. No REs, almost no thinking required, no headache. A
 single line, and you're done.

What's the best way to test the character set for use in a meta rule?  We don't 
want to reject all messages with the Russian (Cyrillic) character set, but we 
may want to use something like

if (character set is Russian)  (body contains 'xyzzy')

for instance.  How would we test the character set?

RE: sa-compile format the standard now?

2008-02-14 Thread Rosenbaum, Larry M.

 From: Bowie Bailey [mailto:[EMAIL PROTECTED]

 I would say that sa-compile is the preferred method due to its
 performance benefits.  There aren't many (any?) drawbacks to using it.

I don't use it here because it takes too long (over 20 minutes) to compile.
(This is with SA v3.2.4, which is a big improvement over v3.2.3)

L

RE: sa-compile format the standard now?

2008-02-14 Thread Rosenbaum, Larry M.

 From: Bowie Bailey [mailto:[EMAIL PROTECTED]

 Rosenbaum, Larry M. wrote:
   From: Bowie Bailey [mailto:[EMAIL PROTECTED]

   I would say that sa-compile is the preferred method due to its
   performance benefits.  There aren't many (any?) drawbacks to using
   it.

  I don't use it here because it takes too long (over 20 minutes) to
  compile. (This is with SA v3.2.4, which is a big improvement over
  v3.2.3)

 Why is this an issue?  You only have to recompile the rules if there is
 an update and this is only once every week or two even when things are
 active.

We have a lot of local rules that I fine-tune often, sometimes several times a 
day.

RE: Googlepages Livefilestore spams

2008-01-10 Thread Rosenbaum, Larry M.

Is it safe to use unbounded quantifiers like + and {2,} in uri rules?  I avoid 
them in regular body rules.

L

 -Original Message-
 From: Ben Lentz [mailto:[EMAIL PROTECTED]
 Sent: Wednesday, January 09, 2008 10:56 PM
 To: users@spamassassin.apache.org
 Subject: Re: Googlepages  Livefilestore spams

  but this URI redirection stuff isn't very friendly
  when used by a spammer.

  Ben, the key is the btnI param, which maps to the I'm feeling
 lucky
  button.
  This technique appeared last summer (I deployed my non-SA-based rule
 on
  03-Jul-2007).

 Thank you, this is very valuable. I wonder if Google will ever consider
 turning it off, since it's being abused.

 For now, I'm going with:

 uri GOOG_REDIR_SLASH
 m{^https?://(?:\w+\.)*google\.(com|co\.uk|tw)/{2,}search}
 score   GOOG_REDIR_SLASH1.0
 describeGOOG_REDIR_SLASHGoogle URL has extra slashes
 after domain
 uri GOOG_REDIR_LUCKY
 m{^https?://(?:\w+\.)*google\.(com|co\.uk|tw)/+search.*btnI}
 score   GOOG_REDIR_LUCKY3.0
 describeGOOG_REDIR_LUCKYGoogle URL uses I'm Feeling
 Lucky for blind redirect
 uri GOOG_PAGES
 m{^https?://(?:\w+\.)*googlepages\.(com|co\.uk|tw)}
 score   GOOG_PAGES  2.0
 describeGOOG_PAGES  URL hosted at GooglePages

 ...seems pretty safe.

Mismatched URLs revisited

2007-12-06 Thread Rosenbaum, Larry M.

Some time ago (and more than once) there have been discussions on this list 
about email containing hyperlinks where the link text is a URL that doesn't 
match the URL in the link HREF, and the pros and cons of testing for and 
scoring these mismatched links.  My management has raised this issue.  My 
memory is hazy on what the final opinions were - it seems like this was 
initially discouraged, but later discussions may have been less discouraging.  
Could somebody point me to the threads where this is discussed?  Also, does 
SpamAssassin currently contain any rules for this kind of testing, or are there 
third-party rules that do this?

Thanks, Larry

whitelist_from_rcvd with numeric IP?

2007-10-30 Thread Rosenbaum, Larry M.

The documentation for whitelist_from_rcvd shows examples like this:

whitelist_from_rcvd [EMAIL PROTECTED]  example.com

What if the sending server has no rDNS?  Is there a way to use this feature 
with a numeric IP instead of a rDNS domain?  If so, what is the syntax?

Thanks, Larry

spamd children killed but don't die?

2007-08-24 Thread Rosenbaum, Larry M.

This morning on one of our servers, spamd was having problems.  There
were 8 spamd children running, but top showed only two of them were
using any CPU time even though there was a backlog of messages to be
processed.  The log file included lines like this:

Aug 24 08:49:22 localhost spamd[21051]: prefork: child states: BBBK
Aug 24 08:49:23 localhost spamd[21051]: prefork: child states: BBBK
Aug 24 08:49:32 localhost spamd[21051]: prefork: child states: BBBK
Aug 24 08:49:41 localhost spamd[21051]: prefork: child states: BBBK
Aug 24 08:49:49 localhost spamd[21051]: prefork: child states: BBBK
Aug 24 08:49:51 localhost spamd[21051]: prefork: child states: BBBK
Aug 24 08:49:59 localhost spamd[21051]: prefork: child states: BBBK
Aug 24 08:50:00 localhost spamd[21051]: prefork: child states: BBBK
Aug 24 08:50:04 localhost spamd[21051]: prefork: child states: BBBK
Aug 24 08:50:06 localhost spamd[21051]: prefork: child states: BBBK
Aug 24 08:50:12 localhost spamd[21051]: prefork: child states: BBBK

which I think means 3 children busy, 5 children waiting to die.  This
(the multiple K children) had been going on for a few hours, which
prevented new children from being spawned to handle the load.
Restarting spamd via kill -HUP restored normal operation.

Why were the killed processes not dying?

System information:

SunOS email 5.9 Generic_118558-39 sun4u sparc SUNW,Sun-Fire-V210
SpamAssassin Server version 3.2.3
  running on Perl 5.8.8
  with SSL support (IO::Socket::SSL 0.97)
  with zlib support (Compress::Zlib 1.41)

Process information (combination of top and ps):

Fri Aug 24 08:56:52 2007  last pid: 23996;  load averages:  0.55,  0.54,
0.52
192 processes: 190 sleeping, 2 on cpu
CPU states: 88.4% idle,  4.2% user,  3.4% kernel,  3.9% iowait,  0.0%
swap
Memory: 2048M real, 342M free, 1323M swap in use, 6109M swap free

USER   PID   PPID   STIME TIMESTATESIZE RES
CPU
spamd23740  2105108:55:39 0:17sleep 72M 59M
11.40%
spamd20459  2105108:16:13 7:08cpu/1 77M 66M
6.27%
root 21051  109:47:26 2:17sleep 66M 58M
0.03%
spamd27830  2105103:39:24 5:28sleep 81M 70M
0.00%
spamd27926  2105103:39:37 0:26sleep 76M 63M
0.00%
spamd14411  2105100:53:09 0:11sleep 70M 51M
0.00%
spamd22780  2105102:37:46 0:06sleep 71M 57M
0.00%
spamd22775  2105102:37:32 0:04sleep 70M 56M
0.00%
spamd22776  2105102:37:32 0:01sleep 68M 54M
0.00%

spamd startup command:

ulimit -n 256
spamd -d -u spamd -r $pidfile -x -m 8 --syslog=local2
--syslog-socket=inet -i -A $me,$em1,$em2,$em3,$em4

RE: warning - score undef for rule 'MISSING_SUBJECT'...

2007-08-15 Thread Rosenbaum, Larry M.

 From: Leon Kolchinsky [mailto:[EMAIL PROTECTED]
 
  The first time I run sa-update after a v3.2.3 install, I get the
  following warnings:
 
  rules: score undef for rule 'MISSING_SUBJECT' in ''
'MISSING_SUBJECT'
 at
  ...
 
 The score MISSING_SUBJECT is removed from 3.1.x and 3.2.x now.
 You could check your local.cf (or in some .pre file) for this score
and
 remove it.

That's not what grep says:

email# grep MISSING_SUBJECT
/var/lib/spamassassin/3.002003/updates_spamassassin_org/*
/var/lib/spamassassin/3.002003/updates_spamassassin_org/20_head_tests.cf
:meta MISSING_SUBJECT   !__HAS_SUBJECT
/var/lib/spamassassin/3.002003/updates_spamassassin_org/20_head_tests.cf
:describe MISSING_SUBJECT   Missing Subject: header
/var/lib/spamassassin/3.002003/updates_spamassassin_org/30_text_de.cf:la
ng de describe MISSING_SUBJECT Betreff (Subject) fehlt
/var/lib/spamassassin/3.002003/updates_spamassassin_org/50_scores.cf:sco
re MISSING_SUBJECT 2.307 1.285 2.476 1.762

email# grep MISSING_SUBJECT /usr/local/share/spamassassin/*.cf
/usr/local/share/spamassassin/20_head_tests.cf:meta MISSING_SUBJECT
!__HAS_SUBJECT
/usr/local/share/spamassassin/20_head_tests.cf:describe MISSING_SUBJECT
Missing Subject: header
/usr/local/share/spamassassin/30_text_de.cf:lang de describe
MISSING_SUBJECT Betreff (Subject) fehlt
/usr/local/share/spamassassin/50_scores.cf:score MISSING_SUBJECT 2.307
1.285 2.476 1.762

email# grep MISSING_SUBJECT /etc/mail/spamassassin/*.cf
email#

warning - score undef for rule 'MISSING_SUBJECT'...

2007-08-14 Thread Rosenbaum, Larry M.

The first time I run sa-update after a v3.2.3 install, I get the
following warnings:

rules: score undef for rule 'MISSING_SUBJECT' in '' 'MISSING_SUBJECT' at
/usr/local/lib/perl5/site_perl/5.8.8/Mail/SpamAssassin/PerMsgStatus.pm
line 2140.
rules: score undef for rule 'EMPTY_MESSAGE' in '' 'EMPTY_MESSAGE' at
/usr/local/lib/perl5/site_perl/5.8.8/Mail/SpamAssassin/PerMsgStatus.pm
line 2140.
rules: score undef for rule 'NO_RECEIVED' in '' 'NO_RECEIVED' at
/usr/local/lib/perl5/site_perl/5.8.8/Mail/SpamAssassin/PerMsgStatus.pm
line 2140.
rules: score undef for rule 'MISSING_SUBJECT' in '' 'MISSING_SUBJECT' at
/usr/local/lib/perl5/site_perl/5.8.8/Mail/SpamAssassin/PerMsgStatus.pm
line 2140.
rules: score undef for rule 'EMPTY_MESSAGE' in '' 'EMPTY_MESSAGE' at
/usr/local/lib/perl5/site_perl/5.8.8/Mail/SpamAssassin/PerMsgStatus.pm
line 2140.
rules: score undef for rule 'NO_RECEIVED' in '' 'NO_RECEIVED' at
/usr/local/lib/perl5/site_perl/5.8.8/Mail/SpamAssassin/PerMsgStatus.pm
line 2140.
...
(repeated several times)

The update succeeds anyway.  What causes these warnings?

Thanks, Larry

RE: ANNOUNCE: Apache SpamAssassin 3.2.3 available

2007-08-09 Thread Rosenbaum, Larry M.

I don't see it.  How long does it take to get to the mirrors?

 -Original Message-
 From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Justin Mason
 Sent: Thursday, August 09, 2007 3:39 PM
 To: users@SpamAssassin.apache.org; [EMAIL PROTECTED];
 [EMAIL PROTECTED]
 Subject: ANNOUNCE: Apache SpamAssassin 3.2.3 available
 
 Apache SpamAssassin 3.2.3 is now available!  This is a maintenance
 release of the 3.2.x branch.
 
 Downloads are available from:
   http://spamassassin.apache.org/downloads.cgi
 
 The release file will also be available via CPAN in the near future.
 
   md5sum of archive files:
   e9a5fd94dead0fca3f26fb3feb0c8e57  Mail-SpamAssassin-3.2.3.tar.bz2
   2e356b70b9458b44a828c19f6e816521  Mail-SpamAssassin-3.2.3.tar.gz
   6ea8ef7f37e4b305217fa8074dd2219e  Mail-SpamAssassin-3.2.3.zip
 
   sha1sum of archive files:
   53199e0218d2f93043fcdca4db3f164f1f9f7cbc  Mail-SpamAssassin-
 3.2.3.tar.bz2
   93337a5cf6cc6f4980307c08ad65575fa08d1f54  Mail-SpamAssassin-
 3.2.3.tar.gz
   0eca91718518547323f43b5473d1362032edb592
Mail-SpamAssassin-3.2.3.zip

3.2.3 spamd_hup test failed

2007-08-09 Thread Rosenbaum, Larry M.

SpamAssassin v3.2.3, Perl 5.8.8, Solaris 9

 

What would cause this error?

 

t/spamd_hup.ok 1/110# Failed test 5 in t/spamd_hup.t
at line 40

#  t/spamd_hup.t line 40 is:   ok (-e $pid_file) or warn $pid_file does
not exist post restart;

log/spamd.pid does not exist post restart at t/spamd_hup.t line 40.

Could not open pid file log/spamd.pid: No such file or directory

Exiting subroutine via next at t/SATest.pm line 844.

Could not open pid file log/spamd.pid: No such file or directory

Exiting subroutine via next at t/SATest.pm line 844.

Could not open pid file log/spamd.pid: No such file or directory

Exiting subroutine via next at t/SATest.pm line 844.

Could not open pid file log/spamd.pid: No such file or directory

Exiting subroutine via next at t/SATest.pm line 844.

Could not open pid file log/spamd.pid: No such file or directory

Exiting subroutine via next at t/SATest.pm line 844.

Could not open pid file log/spamd.pid: No such file or directory

Exiting subroutine via next at t/SATest.pm line 844.

Could not open pid file log/spamd.pid: No such file or directory

Exiting subroutine via next at t/SATest.pm line 844.

Could not open pid file log/spamd.pid: No such file or directory

Exiting subroutine via next at t/SATest.pm line 844.

Could not open pid file log/spamd.pid: No such file or directory

Exiting subroutine via next at t/SATest.pm line 844.

Could not open pid file log/spamd.pid: No such file or directory

Exiting subroutine via next at t/SATest.pm line 844.

t/spamd_hup.FAILED tests 5, 7-110

Failed 105/110 tests, 4.55% okay

 

The same test on a different, supposedly identical system passed.  Also
it passed when I ran it manually with prove -v t/spamd_hup.t.

iXhash zone transfer?

2007-08-07 Thread Rosenbaum, Larry M.

Is it possible to get zone transfers of the iXhash data?

What is bug 5548

2007-08-03 Thread Rosenbaum, Larry M.

In the SA 3.2.2 announcement was this item:

- bug 5548: Certain mail input can take a long time to scan with 100%
CPU
  utilisation, due to backtracking in a rule's regexp. fix

but when I went to look up the bug, I didn't have permission.  Could
somebody give me more detail on this bug?  In particular, we had a
problem with v3.2.1 using a lot of CPU until we put the use bytes;
hack (* see footnote) into Message.pm and I would like to know if 3.2.2
fixes this problem.

* Note that the CPU usage was a problem even after we upgraded Perl to
5.8.8 to get rid of the UTF bug and associated log file messages.

Microsoft Antigen Spam Manager

2007-07-06 Thread Rosenbaum, Larry M.

I have been asked to look into Microsoft Antigen Spam Manager (and MS
Forefront for Exchange Server).  Does anybody have any information (or
can point me to independent reviews) as to how good this product is and
how it compares with SpamAssassin?

 

Thanks ...

RE: URIBL_BLACK matching on messages with no URLs in them...

2007-07-02 Thread Rosenbaum, Larry M.

 From: Jo Rhett [mailto:[EMAIL PROTECTED]
 
  SA doesn't just look for full URLs, it looks for things that could
be
  hostnames ala copy www.example.com into your browser.
 
 This is fairly nonfunctional.  I've been chasing around all sorts of
 FPs that seem to hit pretty much every message that comes to me with
 source code inside it, and you've probably nailed every one of them
 on the head.  I didn't realize that they were related at the SA level.
 
 I need to completely disable this over-opportunistic behavior.  90%
 of my e-mails have either system output, or are concerning code
 segments or router interfaces, etc, etc.  I need these mails to get
 through.
 
 At the very least, common collisions like script.pl need to be
 disabled.

uridnsbl_skip_domain script.pl

RE: SA 3.2.1 not using SQL for bayes

2007-06-20 Thread Rosenbaum, Larry M.

 From: Hamie [mailto:[EMAIL PROTECTED]
 
 Small problem with SA 3.2.1... I'm using a mysql database. The DB
works
 fine for amavisd-new, and SA AWL. e.g.
 
 [438288] dbg: auto-whitelist: sql-based connected to
 DBI:mysql:sadb:ixx:
 [438288] dbg: auto-whitelist: sql-based using username: vscan01
 [438288] dbg: auto-whitelist: sql-based get_addr_entry: found existing
 entry for [EMAIL PROTECTED]|ip=xx.xx
 [438288] dbg: auto-whitelist: sql-based [EMAIL PROTECTED]|ip=195.53
 scores 21/8.128
 [438288] dbg: auto-whitelist: AWL active, pre-score: -1.498, autolearn
 score: -1.498, mean: 0.387047619047619, IP: xx.xx.xx.xx
 [438288] dbg: auto-whitelist: sql-based add_score: new count: 22, new
 totscore: 6.63 for [EMAIL PROTECTED]|ip=xx.xx
 [438288] dbg: auto-whitelist: sql-based finish: disconnected from
 DBI:mysql:sadb:
 
 
 but the SA bayes SQL spits out the error
 
 [463326] dbg: plugin: loading Mail::SpamAssassin::BayesStore::SQL from
 @INC
 [463326] dbg: bayes: invalid config, must set bayes_sql_dsn config
 variable
 plugin: failed to create instance of plugin
 Mail::SpamAssassin::BayesStore::SQL:
 
 
 Which would be fine... Except my local.cf file (Which also defines the
 config for AWL) says
 
 bayes_sql_dsn  DBI:mysql:sadb:xxx:
 bayes_sql_username spamassassin
 bayes_sql_password fred
 bayes_sql_override_usernamevscan01
 bayes_store_module Mail::SpamAssassin::BayesStore::SQL

I think you want this:

bayes_store_module  Mail::SpamAssassin::BayesStore::MySQL

RE: make test dnsbl tests sporadically fail

2007-06-15 Thread Rosenbaum, Larry M.

I installed both patches and still get errors in some of the dnsbl
tests.  Here is a possibly relevant section of t/log/d.dns/1 from a
system where the test succeeded:

 

[27718] dbg: check: running tests for priority: 500

[27718] dbg: async: select found 1 socks ready

[27718] dbg: uridnsbl: query for uribl-example-b.com took 4 seconds to
look up (multi.surbl.org.:uribl-example-b.com)

[27718] dbg: uridnsbl: query for uribl-example-a.com took 4 seconds to
look up (multi.uribl.com.:uribl-example-a.com)

...

[27718] dbg: uridnsbl: query for uribl-example-a.com took 4 seconds to
look up (bl.open-whois.org.:uribl-example-a.com)

[27718] dbg: async: queries completed: 73 started: 0

[27718] dbg: async: queries active: at Fri Jun 15 11:42:27 2007

[27718] dbg: dns: success for 0 of 73 queries

[27718] dbg: rules: running head tests; score so far=18.85

 

And here is the corresponding log where the tests failed:

 

[10362] dbg: check: running tests for priority: 500

[10362] dbg: async: select found no socks ready

[10362] dbg: uridnsbl: query for uribl-example-b.com took 2 seconds to
look up (multi.surbl.org.:uribl-example-b.com)

[10362] dbg: uridnsbl: query for uribl-example-a.com took 2 seconds to
look up (multi.uribl.com.:uribl-example-a.com)

...

[10362] dbg: uridnsbl: query for uribl-example-a.com took 2 seconds to
look up (bl.open-whois.org.:uribl-example-a.com)

[10362] dbg: async: queries completed: 44 started: 0

[10362] dbg: async: queries active: DNSBL-A=10 DNSBL-TXT=19 at Fri Jun
15 11:59:06 2007

[10362] dbg: async: select found no socks ready

[10362] dbg: async: queries completed: 0 started: 0

[10362] dbg: async: queries active: DNSBL-A=10 DNSBL-TXT=19 at Fri Jun
15 11:59:07 2007

[10362] dbg: async: select found no socks ready

...

[10362] dbg: async: queries completed: 0 started: 0

[10362] dbg: async: queries active: DNSBL-A=10 DNSBL-TXT=19 at Fri Jun
15 11:59:27 2007

[10362] dbg: async: escaping: must have lost requests

[10362] dbg: async: aborting remaining lookups

[10362] dbg: dns: success for 44 of 73 queries

[10362] dbg: rules: running head tests; score so far=14.85

 

So what is going on, and why aren't my socks ready? (Sounds like a
laundry problem...)

 

BTW, looking up 134.88.73.210.sb.dnsbltest.spamassassin.org (one of
the failed lookups) from the command line returns a successful answer
immediately.

 

 

From: Randal, Phil [mailto:[EMAIL PROTECTED] 
Sent: Thursday, June 14, 2007 3:41 AM
To: users@spamassassin.apache.org
Subject: RE: make test dnsbl tests sporadically fail

 

Possibly related to

 

  http://issues.apache.org/SpamAssassin/show_bug.cgi?id=5511

 

as discussed in the DNS tests getting aborted thread?

 

Cheers,

 

Phil

--
Phil Randal
Network Engineer
Herefordshire Council
Hereford, UK 

 

 





From: Rosenbaum, Larry M. [mailto:[EMAIL PROTECTED] 
Sent: 13 June 2007 22:01
To: users@spamassassin.apache.org
Subject: make test dnsbl tests sporadically fail

When I run make test for v3.2.1, why do some of the dnsbl
tests sporadically fail?  For instance:

 

t/dnsbl.Not found: P_2 =
dns:134.88.73.210.dnsbltest.spamassassin.org [127.0.0.4]

# Failed test 1 in t/SATest.pm at line 635

Not found: P_7 =
dns:134.88.73.210.sb.dnsbltest.spamassassin.org?type=TXT

# Failed test 2 in t/SATest.pm at line 635 fail #2

Not found: P_4 =
dns:14.35.17.212.dnsbltest.spamassassin.org [127.0.0.1]

t/dnsbl.NOK 1# Failed test 3 in t/SATest.pm
at line 635 fail #3

Not found: P_3 =
dns:18.13.119.61.dnsbltest.spamassassin.org [127.0.0.12]

# Failed test 4 in t/SATest.pm at line 635 fail #4

Not found: P_5 =
dns:226.149.120.193.dnsbltest.spamassassin.org [127.0.0.1]

# Failed test 5 in t/SATest.pm at line 635 fail #5

t/dnsbl.NOK 2   Not found: P_1 =
dns:98.3.137.144.dnsbltest.spamassassin.org [127.0.0.2]

# Failed test 6 in t/SATest.pm at line 635 fail #6

Not found: P_6 =
dns:example.com.dnsbltest.spamassassin.org [127.0.0.2]

# Failed test 7 in t/SATest.pm at line 635 fail #7

Not found: P_15 =  DNSBL_RHS

t/dnsbl.NOK 3# Failed test 8 in t/SATest.pm
at line 635 fail #8

Not found: P_17 =  DNSBL_SB_FLOAT

t/dnsbl.NOK 4# Failed test 9 in t/SATest.pm
at line 635 fail #9

Not found: P_18 =  DNSBL_SB_STR

# Failed test 10 in t/SATest.pm at line 635 fail #10

Not found: P_16 =  DNSBL_SB_TIME

# Failed test 11 in t/SATest.pm at line 635 fail #11

t/dnsbl.NOK 5   Not found: P_10 =
DNSBL_TEST_DYNAMIC

# Failed test 12 in t/SATest.pm at line 635 fail #12

Not found: P_12

v3.2.1 gives spamd: handle_user unable to find user:

2007-06-13 Thread Rosenbaum, Larry M.

SpamAssassin Server version 3.2.1

  running on Perl 5.8.8

  with zlib support (Compress::Zlib 2.004)

 

I've started seeing the spamd: handle_user unable to find user:
message in the spamd log file.  This was not happening in v3.2.0.   We
are starting spamd with this command:

 

spamd -d -u spamd -r $pidfile -x -m 12 --syslog=local2
--syslog-socket=inet -i -A $me,$em1,$em2,$em3,$em4

 

We are not using any kind of per-user configuration or per-user Bayes or
anything like that, and I don't expect the username that is running
spamc (on another machine) to exist on the spamd machine.   What options
do I need to specify to suppress this error?  Is spamd doing anything it
shouldn't do because of this error?

 

The change in behavior seems to be related to this change to the
got_user_header() function:

 

@@ -1886,9 +1911,12 @@

   handle_user_setuid_with_ldap($current_user);

   $setuid_to_user = 1;# as above

 }

+else {

+  handle_user_setuid_basic($current_user);

+}

   }

   else {

-handle_user($current_user);

+handle_user_setuid_basic($current_user);

 if ( $opt{'sql-config'} ) {

   unless ( handle_user_sql($current_user) ) {

 service_unavailable_error(Error fetching user preferences via
SQL);

make test dnsbl tests sporadically fail

2007-06-13 Thread Rosenbaum, Larry M.

When I run make test for v3.2.1, why do some of the dnsbl tests
sporadically fail?  For instance:

 

t/dnsbl.Not found: P_2 =
dns:134.88.73.210.dnsbltest.spamassassin.org [127.0.0.4]

# Failed test 1 in t/SATest.pm at line 635

Not found: P_7 =
dns:134.88.73.210.sb.dnsbltest.spamassassin.org?type=TXT

# Failed test 2 in t/SATest.pm at line 635 fail #2

Not found: P_4 =  dns:14.35.17.212.dnsbltest.spamassassin.org
[127.0.0.1]

t/dnsbl.NOK 1# Failed test 3 in t/SATest.pm at line
635 fail #3

Not found: P_3 =  dns:18.13.119.61.dnsbltest.spamassassin.org
[127.0.0.12]

# Failed test 4 in t/SATest.pm at line 635 fail #4

Not found: P_5 =
dns:226.149.120.193.dnsbltest.spamassassin.org [127.0.0.1]

# Failed test 5 in t/SATest.pm at line 635 fail #5

t/dnsbl.NOK 2   Not found: P_1 =
dns:98.3.137.144.dnsbltest.spamassassin.org [127.0.0.2]

# Failed test 6 in t/SATest.pm at line 635 fail #6

Not found: P_6 =  dns:example.com.dnsbltest.spamassassin.org
[127.0.0.2]

# Failed test 7 in t/SATest.pm at line 635 fail #7

Not found: P_15 =  DNSBL_RHS

t/dnsbl.NOK 3# Failed test 8 in t/SATest.pm at line
635 fail #8

Not found: P_17 =  DNSBL_SB_FLOAT

t/dnsbl.NOK 4# Failed test 9 in t/SATest.pm at line
635 fail #9

Not found: P_18 =  DNSBL_SB_STR

# Failed test 10 in t/SATest.pm at line 635 fail #10

Not found: P_16 =  DNSBL_SB_TIME

# Failed test 11 in t/SATest.pm at line 635 fail #11

t/dnsbl.NOK 5   Not found: P_10 =
DNSBL_TEST_DYNAMIC

# Failed test 12 in t/SATest.pm at line 635 fail #12

Not found: P_12 =  DNSBL_TEST_RELAY

# Failed test 13 in t/SATest.pm at line 635 fail #13

t/dnsbl.NOK 6   Not found: P_11 =
DNSBL_TEST_SPAM

# Failed test 14 in t/SATest.pm at line 635 fail #14

Not found: P_8 =  DNSBL_TEST_TOP

# Failed test 15 in t/SATest.pm at line 635 fail #15

Not found: P_9 =  DNSBL_TEST_WHITELIST

t/dnsbl.NOK 7# Failed test 16 in t/SATest.pm at line
635 fail #16

Not found: P_14 =  DNSBL_TXT_RE

# Failed test 17 in t/SATest.pm at line 635 fail #17

Not found: P_13 =  DNSBL_TXT_TOP

t/dnsbl.NOK 8# Failed test 18 in t/SATest.pm at line
635 fail #18

t/dnsbl.NOK 9Output can be examined in: log/d.dns/1

t/dnsbl.FAILED tests 1-18

Failed 18/23 tests, 21.74% okay

 

If I run t/dnsbl.t later, a smaller number of the subtests fail.  If I
repeat it later, a different set of dnsbl subtests fail.

 

There is nothing obviously wrong with the DNS server.  What causes this
problem?

RE: Error on startup after upgrade to 3.2.1:CompiledRegexps

2007-06-13 Thread Rosenbaum, Larry M.

 From: Steven W. Orr [mailto:[EMAIL PROTECTED]
 
 [EMAIL PROTECTED] ~]# rpm -Uvh
 /usr/src/redhat/RPMS/i386/spamassassin-3.2.1-1.i386.rpm
 /usr/src/redhat/RPMS/i386/perl-Mail-SpamAssassin-3.2.1-1.i386.rpm
 Preparing...
###
 [100%]
 1:perl-Mail-SpamAssassin
 ### [ 50%]
 2:spamassassin
 ### [100%]
 Stopping spamd: [  OK  ]
 Starting spamd: [13775] error: Can't locate
 Mail/SpamAssassin/CompiledRegexps/body_0.pm in @INC (@INC contains:
...
 
 Am I concerned? Where is CompiledRegexps supposed to be and why is it
 not
 there?

I think it means you have Rule2XSBody uncommented in v320.pre but you
don't have a compiled ruleset.  Perhaps you ran sa-compile under v3.2.0
but you haven't run it under v3.2.1.

RE: ANNOUNCE: Apache SpamAssassin 3.2.1 available

2007-06-12 Thread Rosenbaum, Larry M.

 From: Duncan Hill [mailto:[EMAIL PROTECTED]
 
 On Tue, June 12, 2007 13:33, Justin Mason wrote:
  Daniel J McDonald writes:
  So, you can't build the RPM as root.
 
 
  I just added all of the various groups to my user, set up a user
 build
  directory tree, compiled it under my username and it tested fine,
at
 least to the point
  that it normally bombs
 
  aha, that's it alright.  could you open a bug to get that fixed?
 
 One argument is that you shouldn't be building RPMs as root since you
 don't know if the
 spec file is perfect and won't break outside of the semi-jail that RPM
 sets up.  I've
 had at least one such RPM build when working with qmail, and it went
 and buggered up the
 build server by actually installing itself outside of the jail when
 building.  Doing it
 as a regular reason will limit that kind of activity.

Very interesting, but I ran into this problem on a Solaris system and I
wasn't trying to build an RPM.  I was just trying to build SA from
source with the usual

perl Makefile.PL
make
make test (this step gave errors when run as root)

Does the same logic apply when RPMs are not involved?

RE: sa-compile and SARE

2007-06-08 Thread Rosenbaum, Larry M.

 From: Doc Schneider [mailto:[EMAIL PROTECTED]
 
 Rosenbaum, Larry M. wrote:
 
  Does this fix the performance problems I was having, or does it just
 fix
  the UTF errors showing in the logs with Perl  5.8.8 ?
 
 You might try it and see if it helps with the performance. Since it
 does
 fix the UTF-8 issue it won't be doing as much logging and grinding.
let
 us know if it helps.

Performance is still bad unless I use bytes.

RE: sa-compile and SARE

2007-05-22 Thread Rosenbaum, Larry M.

 -Original Message-
 From: Loren Wilton [mailto:[EMAIL PROTECTED]
 Sent: Tuesday, May 22, 2007 1:36 PM
 To: users@spamassassin.apache.org
 Subject: Re: sa-compile and SARE

  So, can we now remove the use bytes; line?

 Should be able to.  If this doesn't work we have some more files that
need
 to be updated, and would like to know what they might be.

Does this fix the performance problems I was having, or does it just fix
the UTF errors showing in the logs with Perl  5.8.8 ?

RE: Spamassassin 3.2.0

2007-05-21 Thread Rosenbaum, Larry M.

 From: Christopher X. Candreva [mailto:[EMAIL PROTECTED]
 
 On Sat, 19 May 2007, Ming Hou wrote:
 
  My issue Mail::DKIM and Mail::DomainKeys are required
 Crypt::OpenSSL::Random
  and Crypt::OpenSSL::RSA.  But, I could not get
Crypt::OpenSSL::Random to
 be
  built successfully because it always complained the following
messages:
 
  Note (probably harmless): No library found for -lssl
  Note (probably harmless): No library found for -lcrypto
 
 Edit Makefile.PL and add -lssl and -lcrypto to the LIBS line. I also
had
 to
 add an include line for the SSL headers.
 
 Then rerun Makefile.PL

Or you can try the following:

perl Makefile.PL INC='-I/usr/local/ssl/include' 
  LIBS='-L/usr/local/ssl/lib -lssl -lcrypto'

(all on one line)

RE: possible conflict in a SA setup between .pre and local.cf issue

2007-05-21 Thread Rosenbaum, Larry M.

 From: Abba Communications [mailto:[EMAIL PROTECTED]
 
 I think I just noticed a conflict. Not sure if I made the mistake or
not,
 yet I probably did.
 
 In my init.pre I have
 
 loadplugin Mail::SpamAssassin::Plugin::URIDNSBL
 
 uncommented
 
 and in my
 
 local.cf I have purposely set the config below.
 
 skip_rbl_checks 1
 
 do these settings conflict?
 
 I believe they do, correct?

No.

skip_rbl_checks 1 does not turn off the URI DNSBL checks.

spamc -H favors one host (v3.2.0)

2007-05-14 Thread Rosenbaum, Larry M.

We have just upgraded from v3.1.8 to v3.2.0.  We invoke spamc as
follows:

 

spamc -H -E -t 180 -s 20 -d spamd.ornl.gov

 

# nslookup spamd.ornl.gov

 

Name:spamd.ornl.gov

Addresses:  160.91.4.92, 160.91.1.172

 

This used to connect equally to the two hosts, but now it makes almost
all the connections to one host (.92).  Has the host randomization logic
changed?  Is it broken?

RE: Poor performance with v3.2.0

2007-05-10 Thread Rosenbaum, Larry M.

 
 Took 10 mins on my 2.8gh 512mb ram, with a bunch of sares rules.
 
 You using .12.0 of re2c?

Yes.

I think most of the time is spent in the rule extraction steps and the
gcc compiles, and not in the re2c steps.  (gcc is v3.4.6)

  Yes, you are right, after use warnings;. I ran SA3.2 on my site
with
  use bytes; added, no problem so far. But it seems SA developers
did
 not
  mention this, they might have their reasons (break normalize_charset
for
  one reason).
 
 Yes, exactly -- breaking one of the major 3.2.0 features is not a good
 thing. :(

Where can I find documentation on what normalize_charset does?

RE: Poor performance with v3.2.0

2007-05-10 Thread Rosenbaum, Larry M.

 From: Loren Wilton [mailto:[EMAIL PROTECTED]
 Subject: Re: Poor performance with v3.2.0

 It would be interesting on some system experiencing this slowdown to
put
 'use bytes' back into SA and see what happens with the performance.
This
 wouldn't be any sort of a solution, but it would be an interesting
data
 point.

Interesting indeed.  I added use bytes and performance is much
improved.  It's approximately back to where it was with v3.1.8.  So what
does this all mean?

In case it matters, here's the output of perl -V:

Summary of my perl5 (revision 5 version 8 subversion 8) configuration:
  Platform:
osname=solaris, osvers=2.9, archname=sun4-solaris
uname='sunos email 5.9 generic_118558-39 sun4u sparc
sunw,sun-fire-v210 '
config_args='-Dcc=gcc -d'
hint=recommended, useposix=true, d_sigaction=define
usethreads=undef use5005threads=undef useithreads=undef
usemultiplicity=undef
useperlio=define d_sfio=undef uselargefiles=define usesocks=undef
use64bitint=undef use64bitall=undef uselongdouble=undef
usemymalloc=n, bincompat5005=undef
  Compiler:
cc='gcc', ccflags ='-fno-strict-aliasing -pipe
-Wdeclaration-after-statement -I/usr/local/include -D_LARGEFILE_SOURCE
-D_FILE_OFFSET_BITS=64',
optimize='-O',
cppflags='-fno-strict-aliasing -pipe -Wdeclaration-after-statement
-I/usr/local/include'
ccversion='', gccversion='3.4.6', gccosandvers='solaris2.9'
intsize=4, longsize=4, ptrsize=4, doublesize=8, byteorder=4321
d_longlong=define, longlongsize=8, d_longdbl=define, longdblsize=16
ivtype='long', ivsize=4, nvtype='double', nvsize=8, Off_t='off_t',
lseeksize=8
alignbytes=8, prototype=define
  Linker and Libraries:
ld='gcc', ldflags =' -L/usr/local/lib '
libpth=/usr/local/lib /usr/lib /usr/ccs/lib
libs=-lsocket -lnsl -ldl -lm -lc
perllibs=-lsocket -lnsl -ldl -lm -lc
libc=/lib/libc.so, so=so, useshrplib=false, libperl=libperl.a
gnulibc_version=''
  Dynamic Linking:
dlsrc=dl_dlopen.xs, dlext=so, d_dlsymun=undef, ccdlflags=' '
cccdlflags='-fPIC', lddlflags='-G -L/usr/local/lib'

Characteristics of this binary (from libperl):
  Compile-time options: PERL_MALLOC_WRAP USE_LARGE_FILES USE_PERLIO
  Built under solaris
  Compiled at May  4 2007 15:28:54
  @INC:
/usr/local/lib/perl5/5.8.8/sun4-solaris
/usr/local/lib/perl5/5.8.8
/usr/local/lib/perl5/site_perl/5.8.8/sun4-solaris
/usr/local/lib/perl5/site_perl/5.8.8
/usr/local/lib/perl5/site_perl/5.8.7/sun4-solaris
/usr/local/lib/perl5/site_perl/5.8.7
/usr/local/lib/perl5/site_perl/5.8.5/sun4-solaris
/usr/local/lib/perl5/site_perl/5.8.5
/usr/local/lib/perl5/site_perl
.

RE: Poor performance with v3.2.0

2007-05-10 Thread Rosenbaum, Larry M.

 From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
 Did you have a massive volume of Malformed UTF-8 warning messages in
the
 syslog output?

No, I upgraded Perl to v5.8.8, which got rid of the warning messages but
there was still a performance problem.  Adding use bytes seems to have
fixed the performance problem.

 I have a theory that this would indeed cause major slowdowns, since
 every warning message has to be transmitted via UDP to the syslogd
 daemon, who then writes it synchronously to disk.  That is a pretty
 slow operation, and causes I/O.

RE: Poor performance with v3.2.0

2007-05-10 Thread Rosenbaum, Larry M.



 -Original Message-
 From: Doc Schneider [mailto:[EMAIL PROTECTED]

 If he is getting the UTF-8 error, this would indeed be odd, since he
is
 using perl-5.8.8 which supposedly handles those regexps which causes
the
 error.
 
 What SARE rules are you running Larry?

/usr/local/spamassassin/70_sare_adult.cf
/usr/local/spamassassin/70_sare_bayes_poison_nxm.cf
/usr/local/spamassassin/70_sare_evilnum0.cf
/usr/local/spamassassin/70_sare_evilnum1.cf
/usr/local/spamassassin/70_sare_evilnum2.cf
/usr/local/spamassassin/70_sare_genlsubj0.cf
/usr/local/spamassassin/70_sare_genlsubj1.cf
/usr/local/spamassassin/70_sare_header0.cf
/usr/local/spamassassin/70_sare_header1.cf
/usr/local/spamassassin/70_sare_html0.cf
/usr/local/spamassassin/70_sare_html1.cf
/usr/local/spamassassin/70_sare_obfu.cf
/usr/local/spamassassin/70_sare_oem.cf
/usr/local/spamassassin/70_sare_random.cf
/usr/local/spamassassin/70_sare_specific.cf
/usr/local/spamassassin/70_sare_spoof.cf
/usr/local/spamassassin/70_sare_stocks.cf
/usr/local/spamassassin/70_sare_unsub.cf
/usr/local/spamassassin/70_sare_uri0.cf
/usr/local/spamassassin/70_sare_uri1.cf
/usr/local/spamassassin/70_sare_whitelist_rcvd.cf
/usr/local/spamassassin/70_sare_whitelist_spf.cf
/usr/local/spamassassin/70_zmi_german.cf
/usr/local/spamassassin/72_sare_bml_post25x.cf
/usr/local/spamassassin/72_sare_redirect_post3.0.0.cf
/usr/local/spamassassin/99_sare_fraud_post25x.cf

also clamav, Botnet, and FuzzyOcr, and some local rules.

RE: Poor performance with v3.2.0

2007-05-09 Thread Rosenbaum, Larry M.

 Bayes auto expiries (taking to long and getting killed)?  I think 
 that's a 600 second timeout.

We're not using auto-expiry.  Bayes expiry is being done with a batch
job.

 It would be interesting on some system experiencing this slowdown to
put
 'use bytes' back into SA and see what happens with the performance.
This
 wouldn't be any sort of a solution, but it would be an interesting
data
 point.

We have Perl v5.8.8, which doesn't have the UTF8 bug.  Would it still be
worth trying this experiment?  If so, where do I put use bytes?

 And, God Blues you Theo!  sa-compile (we have it working on FREEBSD
for
 the non-sares rules) brings that performance back to 3.18 levels with
 all the new rules.

sa-compile took 3 hours to run.  (System is a SunFire v210 with 2
processors and 2 GB ram.)

Poor performance with v3.2.0

2007-05-08 Thread Rosenbaum, Larry M.

Running SpamAssassin v3.2.0 on Solaris 9, perl v5.8.8.

 

I am getting really poor performance with v3.2.0 compared with v3.1.8.
Average scan time per message is doubled (or worse).  I'm also seeing
messages like this in the log file (although not all the time):

 

May  7 17:10:20 localhost spamd[19457]: plugin: eval failed: child
processing timeout at /usr/local/bin/spamd line 1259, GEN390 line 377.

 

May  7 17:11:24 localhost spamd[19447]: plugin: eval failed: child
processing timeout at /usr/local/bin/spamd line 1259, GEN464 line
1097.

 

Is there any way to tell what the timeout was waiting for, or why
performance has gotten worse?  I haven't changed the rules.

 

Thanks, Larry

RE: Justa a small nag from 3.2.0...

2007-05-04 Thread Rosenbaum, Larry M.

 From: Luis Hernán Otegui [mailto:[EMAIL PROTECTED]
 
 Hi, list, I have a cron job running in order to learn from each user's
 HAM and SPAM subdirs. Whenever it runs, it complains like this:
 
 Subroutine FuzzyOcr::O_NONBLOCK redefined at
 /usr/share/perl/5.8/Exporter.pm line 65.
  at /usr/lib/perl/5.8/POSIX.pm line 19
 
 Any hints on how to avoid this nag?

Edit /usr/local/spamassassin/FuzzyOcr.pm and make the following change:

 use POSIX;
---
 use POSIX qw(SIGTERM);

RE: ANNOUNCE: Apache SpamAssassin 3.2.0-rc3 PRERELEASE available!

2007-04-24 Thread Rosenbaum, Larry M.

 From: Justin Mason [mailto:[EMAIL PROTECTED]
 Subject: ANNOUNCE: Apache SpamAssassin 3.2.0-rc3 PRERELEASE available!

 Apache SpamAssassin 3.2.0-rc3 is now available!  This is a
*PRERELEASE*,
 not the full release of 3.2.0.

I have found the following (non-fatal) issues with SA v3.2.0-rc3:

1) Now that spamc is linked with zlib, it doesn't link correctly on
Solaris 8 where libz.so is in /usr/local/lib.  Attempting to run spamc
gives the following error:

ld.so.1: ../spamc/spamc: fatal: libz.so: open failed: No such file or
directory

A workaround is to use the following make command

# LD_RUN_PATH=/usr/local/lib make

but it would be better if the configuration code supplied the right link
options.

2) I tried to compile the rules to see how much speed increase I could
get, but it didn't work.  After running sa-compile and uncommenting the
Rule2XSBody plugin, I got the following error:

# spamassassin --lint
ld.so.1: /usr/local/bin/spamassassin: fatal: relocation error: file
/var/lib/spamassassin/compiled/3.002000/auto/Mail/SpamAssassin/CompiledR
egexps/body_0/body_0.so: symbol
Mail_SpamAssassin_CompiledRegexps_body_0_scan17: referenced symbol not
found
Killed

This is with rc2 on Solaris 8.  I haven't tried it with rc3 yet because
it takes about 3 hours to compile on our development system.

3) I noticed the following rules were removed:

SUBJECT_DRUG_GAP_P
HTML_NONELEMENT_50_60
HTML_NONELEMENT_70_80
HTML_NONELEMENT_90_100
HTML_FONT_INVISIBLE
HTML_BACKHAIR_8
RISK_FREE
PORN_URL_SEX

I was wondering why some of the HTML_NONELEMENT_nn_nn rules were removed
and some were not.

  * bug 5236: Support Mail::SPF replacement for Mail::SPF::Query.

Makefile.PL complains if Mail::SPF::Query is missing, even though it
knows that Mail::SPF is preferred.

RE: ANNOUNCE: Apache SpamAssassin 3.2.0-rc3 PRERELEASE available! - UTF error

2007-04-24 Thread Rosenbaum, Larry M.

Testing v3.2.0-rc3 on Solaris 8
I'm finding zillions of the following errors in the log file:

Apr 24 10:13:27 emaildev.ornl.gov spamd[12593]: Malformed UTF-8
character (unexpected non-continuation by
te 0x00, immediately after start byte 0xce) in pattern match (m//) at
/etc/mail/spamassassin/70_sare_obfu
.cf, rule __SARE_OBFU_VISIT1, line 1, GEN174 line 390.
Apr 24 10:13:27 emaildev.ornl.gov last message repeated 1 time
Apr 24 10:13:28 emaildev.ornl.gov spamd[12593]: Malformed UTF-8
character (unexpected non-continuation by
te 0x00, immediately after start byte 0xc4) in pattern match (m//) at
/etc/mail/spamassassin/70_sare_obfu
.cf, rule __SARE_OBFU_VISIT1, line 1, GEN174 line 390.
Apr 24 10:13:28 emaildev.ornl.gov last message repeated 3 times
...
Apr 24 10:39:13 emaildev.ornl.gov spamd[12592]: Malformed UTF-8
character (unexpected non-continuation byte 0x00, immediately after
start byte 0xce) in pattern match (m//) at
/etc/mail/spamassassin/72_sare_bml_post25x.cf, rule SARE_OBFUAUCTION,
line 1, GEN192 line 832.
Apr 24 10:39:13 emaildev.ornl.gov last message repeated 1 time
Apr 24 10:39:13 emaildev.ornl.gov spamd[12592]: Malformed UTF-8
character (unexpected non-continuation byte 0x00, immediately after
start byte 0xd0) in pattern match (m//) at
/etc/mail/spamassassin/72_sare_bml_post25x.cf, rule SARE_OBFUAUCTION,
line 1, GEN192 line 832.

I don't see this error in v3.1.8.  What is causing this error, and how
do I fix it?

Thanks, Larry

RE: ANNOUNCE: Apache SpamAssassin 3.2.0-rc3 PRERELEASE available!

2007-04-24 Thread Rosenbaum, Larry M.

  2) I tried to compile the rules to see how much speed increase I
could
  get, but it didn't work.  After running sa-compile and uncommenting
the
  Rule2XSBody plugin, I got the following error:
 
  # spamassassin --lint
  ld.so.1: /usr/local/bin/spamassassin: fatal: relocation error: file
 
/var/lib/spamassassin/compiled/3.002000/auto/Mail/SpamAssassin/CompiledR
  egexps/body_0/body_0.so: symbol
  Mail_SpamAssassin_CompiledRegexps_body_0_scan17: referenced symbol
not
  found
  Killed
 
  This is with rc2 on Solaris 8.  I haven't tried it with rc3 yet
because
  it takes about 3 hours to compile on our development system.
 
 Again, sounds like some kind of -R related issue.  However, this
*should*
 work since this module is simply built using the ExtUtils::MakeMaker
 support for building XS modules.  Are you able to build XS modules
 (e.g. HTML::Parser) from source ok on that system?

I have no trouble compiling other XS Perl modules.

It doesn't seem to be a problem finding the dynamic libraries:

# ldd /var/lib/spamassassin/compiled/3.002000/auto/Mail/
SpamAssassin/CompiledRegexps/body_0/body_0.so
libc.so.1 = /usr/lib/libc.so.1
libdl.so.1 =/usr/lib/libdl.so.1
/usr/platform/SUNW,Ultra-5_10/lib/libc_psr.so.1

It's more a matter of finding the symbols:

# ldd -r /var/lib/spamassassin/compiled/3.002000/auto/Mail/
SpamAssassin/CompiledRegexps/body_0/body_0.so
libc.so.1 = /usr/lib/libc.so.1
libdl.so.1 =/usr/lib/libdl.so.1
/usr/platform/SUNW,Ultra-5_10/lib/libc_psr.so.1
symbol not found: main
(/var/lib/spamassassin/compiled/3.002000/auto/Mail/SpamAssassin/Compiled
Regexps/body_0/body_0.so)
symbol not found: PL_markstack_ptr
(/var/lib/spamassassin/compiled/3.002000/auto/Mail/SpamAssassin/Compiled
Regexps/body_0/body_0.so)
symbol not found: PL_stack_sp
(/var/lib/spamassassin/compiled/3.002000/auto/Mail/SpamAssassin/Compiled
Regexps/body_0/body_0.so)
symbol not found: PL_sv_yes
(/var/lib/spamassassin/compiled/3.002000/auto/Mail/SpamAssassin/Compiled
Regexps/body_0/body_0.so)
symbol not found: PL_stack_base
(/var/lib/spamassassin/compiled/3.002000/auto/Mail/SpamAssassin/Compiled
Regexps/body_0/body_0.so)
symbol not found: Perl_newSVpvn_share
(/var/lib/spamassassin/compiled/3.002000/auto/Mail/SpamAssassin/Compiled
Regexps/body_0/body_0.so)
symbol not found: Perl_av_push
(/var/lib/spamassassin/compiled/3.002000/auto/Mail/SpamAssassin/Compiled
Regexps/body_0/body_0.so)
symbol not found: Perl_sv_2pvutf8
(/var/lib/spamassassin/compiled/3.002000/auto/Mail/SpamAssassin/Compiled
Regexps/body_0/body_0.so)
symbol not found: Perl_newAV
(/var/lib/spamassassin/compiled/3.002000/auto/Mail/SpamAssassin/Compiled
Regexps/body_0/body_0.so)
symbol not found: Perl_sv_2mortal
(/var/lib/spamassassin/compiled/3.002000/auto/Mail/SpamAssassin/Compiled
Regexps/body_0/body_0.so)
symbol not found:
Mail_SpamAssassin_CompiledRegexps_body_0_scan17
(/var/lib/spamassassin/compiled/3.002000/auto/Mail/SpamAssassin/Compiled
Regexps/body_0/body_0.so)
symbol not found: Perl_croak
(/var/lib/spamassassin/compiled/3.002000/auto/Mail/SpamAssassin/Compiled
Regexps/body_0/body_0.so)
symbol not found: Perl_newRV
(/var/lib/spamassassin/compiled/3.002000/auto/Mail/SpamAssassin/Compiled
Regexps/body_0/body_0.so)
symbol not found: Perl_form
(/var/lib/spamassassin/compiled/3.002000/auto/Mail/SpamAssassin/Compiled
Regexps/body_0/body_0.so)
symbol not found: Perl_get_sv
(/var/lib/spamassassin/compiled/3.002000/auto/Mail/SpamAssassin/Compiled
Regexps/body_0/body_0.so)
symbol not found: Perl_sv_2pv_flags
(/var/lib/spamassassin/compiled/3.002000/auto/Mail/SpamAssassin/Compiled
Regexps/body_0/body_0.so)
symbol not found: Perl_newXS
(/var/lib/spamassassin/compiled/3.002000/auto/Mail/SpamAssassin/Compiled
Regexps/body_0/body_0.so)

Bug 5420 - RE: 3.2.0-rc2?

2007-04-17 Thread Rosenbaum, Larry M.

Any chance of fixing Bug #5420 before the final release?

http://issues.apache.org/SpamAssassin/show_bug.cgi?id=5420

 -Original Message-
 From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
 Sent: Tuesday, April 17, 2007 12:05 PM
 To: [EMAIL PROTECTED]
 Cc: [EMAIL PROTECTED]; users@SpamAssassin.apache.org
 Subject: Re: 3.2.0-rc2?

 How's this working out?  Any good/bad reports?

 --j.

 jm writes:
  Apache SpamAssassin 3.2.0-rc2 is now available!  This is a
*PRERELEASE*,
  not the full release of 3.2.0.

  Downloads are available from:
 http://people.apache.org/~jm/devel/

  Downloading
  ---

http://people.apache.org/~jm/devel/Mail-SpamAssassin-3.2.0-rc2.tar.bz2

http://people.apache.org/~jm/devel/Mail-SpamAssassin-3.2.0-rc2.tar.gz
http://people.apache.org/~jm/devel/Mail-SpamAssassin-3.2.0-rc2.zip

  md5sum of archive files:
fcc0242642826191a58d45bf5777e3b2
Mail-SpamAssassin-3.2.0-rc2.tar.bz2
c9cc09334b04bc76f08e22c1aee6d07e
Mail-SpamAssassin-3.2.0-rc2.tar.gz
08f7a46d124e7abe50493ec4ddad7609  Mail-SpamAssassin-3.2.0-rc2.zip

  sha1sum of archive files:
c592640242ef8f7b93a99235f836a0d33cadfa10  Mail-SpamAssassin-3.2.0-
 rc2.tar.bz2
d11350d3d418f75682b06098aeb438696faa688d  Mail-SpamAssassin-3.2.0-
 rc2.tar.gz
43dd35eef6482cbd3176472e1ef0055eb4694dfe  Mail-SpamAssassin-3.2.0-
 rc2.zip

  The release files also have a .asc accompanying them.  The file
serves
  as an external GPG signature for the given release file.  The
signing
  key is available via the wwwkeys.pgp.net key server, as well as
  http://spamassassin.apache.org/released/GPG-SIGNING-KEY

  The key information is:

  pub 1024D/265FA05B 2003-06-09 SpamAssassin Signing Key
 [EMAIL PROTECTED]
  Key fingerprint = 26C9 00A4 6DD4 0CD5 AD24  F6D7 DEE0 1987 265F
A05B

  See the INSTALL and UPGRADE files in the distribution for important
  installation notes.

  Summary of major changes since 3.1.8

  Changes to the core code:

   * new behavior for trusted_networks/internal_networks: the 127.*
 network is now always considered trusted and internal, regardless of
 configuration.

   * bug 3109: short-circuiting of 'definite ham' or 'definite spam'
 messages based on individual short-circuit rules using the
'shortcircuit'
 setting, by Dallas Engelken dallase /at/ uribl.com.

   * bug 5305: implement 'msa_networks', for ISPs to specify their
Mail
 Submission Agents, and extend network trust accordingly.

   * bug 4636: Add support for charset normalization, so rules can be
 written in UTF-8 to match text in other charsets.

   * sa-compile: compilation of SpamAssassin rules into a fast
parallel-
 matching DFA, implemented in native code.

   * tflags multiple: allow writing of rules that count multiple
hits in
 a single message.

   * bug 4363: if a message uses CRLF for line endings, we should use
it
 as well, otherwise stay with LF as usual; important for Windows users.

   * bug 4515: content preview was omitting first paragraph when no
 Subject: header was present.

   * The third-party modules used by sa-update are now required by the
 SpamAssassin package, instead of being optional.

   * Bug 5165: 'sa-update --checkonly' added to check for updates
without
 applying them; thanks to anomie /at/ users.sourceforge.net

   * Bugs 4606, 4609: Adjust MIME parsing limits for nested
multipart/*
 and message/rfc822 MIME parts.

   * bug 5295: add 'whitelist_auth', to whitelist addresses that send
mail
 using sender-authorization systems like SPF, Domain Keys, and DKIM

   * Removed dependency on Text::Wrap CPAN module.

   * Received header parsing updates/fixes/additions.

  Spamc / spamd:

   * bug 4603: Mail::SpamAssassin::Spamd::Apache2 -- mod_perl2 module,
 implementing spamd as a mod_perl module, contributed as a Google
Summer of
 Code project by Radoslaw Zielinski.

   * bug 3991: spamd can now listen on UNIX domain, TCP, and SSL
sockets
 simultaneously.  Command-line semantics extended slightly, although
fully
 backwards compatibly; add the --ssl-port switch to allow TCP and SSL
 listening at the same time.

   * bug 3466: do Bayes expiration, if required, after results have
been
 passed back to the client from spamd; this helps avoid client
timeouts.

   * more complete IPv6 support.

   * spamc: Add '-K' switch, to ping spamd.

   * spamc: add '-z' switch, which compresses mails to be scanned
using
 zlib compression; very useful for long-distance use of spamc over the
 internet.

   * bug 5296: spamc '--headers' switch, which scans messages and
 transmits back just rewritten headers.  This is more
bandwidth-efficient
 than the normal mode of scanning, but only works for 'report_safe 0'.

   * Bump spamd's protocol version to 1.4, to reflect new HEADERS verb
 used for '--headers'.

  Mail::SpamAssassin modules and API:

   * bug 4589: allow M::SA::Message to use IO::File objects to read in
 message (same as GLOB).

RE: Simple hosting user: can I install my own SA?

2007-02-02 Thread Rosenbaum, Larry M.

 From: Andy Figueroa [mailto:[EMAIL PROTECTED]
 
 Woo Hoo - that's a good trick and good intel.  Thinking ... Won't it
 still try to use /usr/share/spamassassin for rules and
 /var/lib/spamassassin for updates and /etc/mail/spamasassin for
 configuration and so forth?  Can't do that without root access.  I'm
 thinking out-loud here ...  Thanks.

You can get around that by building it with something like

perl Makefile.PL PREFIX=/path/to/yourhomedir

RE: use or not use awl

2007-01-22 Thread Rosenbaum, Larry M.

 From: Dave Koontz [mailto:[EMAIL PROTECTED]
 Not neccessarily. Put your awl on a sql database and add a timestamp
 column
 to the awl table, which gets automagically a new timestamp by the dbms
 each
 time a record is updated. The timestamp column type in Mysql is such
a
 type.
 
 show create table awl:
 
 CREATE TABLE `awl` (
   `username` varchar(100) collate latin1_german1_ci NOT NULL default
'',
   `email` varchar(200) collate latin1_german1_ci NOT NULL default '',
   `ip` varchar(10) collate latin1_german1_ci NOT NULL default '',
   `count` int(11) default '0',
   `totscore` float default '0',
   `timestamp` timestamp NOT NULL default CURRENT_TIMESTAMP on update
 CURRENT_TIMESTAMP,
   PRIMARY KEY  (`username`,`email`,`ip`)
 ) ENGINE=InnoDB DEFAULT CHARSET=latin1 COLLATE=latin1_german1_ci
 
 Then you can easily expire by date with a cron job, for example expire
all
 that was not updated for the last 30 days:
 
 delete from awl where timestamp  now() - interval 30 day

Can you tell me how to do something similar for the bayes_seen table and
MySQL?

Thanks, Larry

BODY rule fails with double-spaced text

2006-12-14 Thread Rosenbaum, Larry M.

The doc for BODY rules says All HTML tags and line breaks will be
removed before matching.  I was also told on this list that multiple
whitespace was compressed to single space characters.  So if I have text
like this:

 

xyzzy

abcde

 

and the following rules:

 

bodyT_LMRTESTB1 /xyzzy abcde/

bodyT_LMRTESTB2 /xyzzy\s{1,4}abcde/

 

then both rules will match.  However, if the text is double-spaced like
this:

 

xyzzy

 

abcde

 

then *neither* rule will match, even though I would have expected them
both to still match.  Is this a designed feature or a bug?

RE: spam

2006-12-05 Thread Rosenbaum, Larry M.

Has anybody come up with a rule for these yet?  I tried the following:

body ORNL_B0RKEN1 /^\d{3,5}\n{1,3}$/s
describe ORNL_B0RKEN1 B0rken spamware, message just contains a short
number
scoreORNL_B0RKEN1 1

This matches the spam message, but it also matches messages where the
number is followed by a blank line and more text, which is a false
positive.  If I replace body with full, then it doesn't match the
spam message.  I have also tried the following variations:

-  using /s, /m, or neither switch
-  using the ^ and $ anchors or the \A and \Z anchors
-  using \n, \s, or neither (i.e. the pattern /^\d{3,5}$/)

In all cases I got the same results.  What am I missing?

Thanks, Larry

 -Original Message-
 From: Nigel Frankcom [mailto:[EMAIL PROTECTED]
 Posted At: Monday, December 04, 2006 8:02 PM
 Posted To: sa-users
 Conversation: spam
 Subject: Re: spam
 
 On Mon, 04 Dec 2006 16:35:33 -0800, Evan Platt
 [EMAIL PROTECTED] wrote:
 
 At 04:24 PM 12/4/2006, you wrote:
 On Mon, 4 Dec 2006 16:11:28 -0800 (PST), san [EMAIL PROTECTED]
 wrote:
 
  
  Hi,
  
  Am recieving a spam mails which is just having number on the body
just
 like
  1265 or 2196...
  
  any thoughts how to stop this kind of spam..
  
  thanks
  san
 
 Ditto
 
 How in the hell does one write a  rule for this sh*?
 
 Maybe a rule if the message body is less than X characters?
 
 I mean unless you expect lots of legitimate mail that says
 Hello.
 
 
 Good point; thanks.
 
 Though I think I'll do one that picks only numerals. That said I'm
 pretty sure there's a sare rule that covers this sort of thing
 though I could easily be wrong; it wouldn't be the 1st time :-D
 
 KR
 
 Nigel

RE: new Botnet plugin version soon

2006-11-30 Thread Rosenbaum, Larry M.

 From: Dennis Davis [mailto:[EMAIL PROTECTED]
 ...
 
  Question 2: someone asked why my module is Botnet instead of
  Mail::SpamAssassin::Plugin::Botnet.  The answer is: when I
  first started this (and this is/was my first SA Plugin authoring
  attempt), I tried that and it didn't work.  If someone wants to
  look at it, and figure out how to make that work
 
 I prefer to have all the SpamAssassin plugins grouped together where
 the default install puts them.  This is in the directory:
 
 /usr/local/libdata/perl5/site_perl/Mail/SpamAssassin/Plugin/

I would prefer to use the xxx/site_perl/Mail/SpamAssassin/Plugin for
plugins that are packaged with SpamAssassin, and that any added-in
plugins that I install separately go into /etc/mail/spamassassin.  I
also see no advantage to moving the loadplugin statement into the
init.pre file unless there are rules in other .cf files that depend on
the plugin.  In other words, it's fine the way it is.

RE: sa-update and 'doesnotexist'

2006-10-10 Thread Rosenbaum, Larry M.

 From: Ben Lentz [mailto:[EMAIL PROTECTED]
 
 Greetings, List!
 I just upgraded from sa 3.1.3 to sa 3.1.6 and am having some weird
 problems with sa-update that I've never seen before. It would seem
that
 my sys rules/default rules directory (/usr/share/spamassassin) is not
 being loaded by sa-update's internal lint test, but that my site rules
 directory (/etc/mail/spamassassin) is. And because by site rules
 directory has references to things in my sys rules directory,
sa-update
 refuses to run because it thinks my rules are borked. Really, it's
just
 not reading what's there.

This is a known bug in 3.1.6.  It is fixed in 3.1.7, which was just
released today.

RE: spamd Sys-Hostname-Long-1.4 error

2006-10-06 Thread Rosenbaum, Larry M.

In Long.pm, go to line 91 and change this

 my $tmp = `hostname` . '.' . `domainname`;

to this

 my $tmph = `hostname`;
 my $tmpd = `domainname`;
 my $tmp = $tmph.$tmpd;

 -Original Message-
 From: John Goubeaux [mailto:[EMAIL PROTECTED]
 Posted At: Friday, October 06, 2006 12:54 PM
 Posted To: sa-users
 Conversation: spamd Sys-Hostname-Long-1.4 error
 Subject: spamd Sys-Hostname-Long-1.4 error
 
 
 I am getting the following error generated from spamd, apparently due
 to the fact that the  sys::hostname::long module behaves differently
 on different OSs  in this case Solaris 8.
 While i realize this is not specific to spmassassin and a perl /OS
 issue  was wondering if anyone running SA on Solaris has seen and
 been able to eliminate this error?
 
 SpamAssassin version 3.1.5
running on Perl version 5.8.5
 
 Oct  5 09:28:21 kady spamd[870]: Insecure dependency in `` while
 running with -T switch at
 /usr/local/perl-5.8.5/lib/site_perl/5.8.5/Sys/Hostname/Long.pm line
 91, GEN26746 line 77.
 
 any clues are appreciated !   -john
 
 --
 John Goubeaux
 Systems Administrator
 Gevirtz Graduate School of Education
 UC Santa Barbara
 Phelps Hall 3534
 805 893-8190

RE: 2 different scores?

2006-10-06 Thread Rosenbaum, Larry M.

 From: Evan Platt [mailto:[EMAIL PROTECTED]
... 
 I changed my procmailrc to;
 
 :0fw: spamassassin.lock
 | spamd -L -c -s 512000

Shouldn't that be spamc?

RE: sa-learn and Caught spams

2006-09-27 Thread Rosenbaum, Larry M.

 From: Mike Woods [mailto:[EMAIL PROTECTED]
 
 The internet is a great place for raising more questions than it
answers
 :D
 
 Given all the opinions I think I will move the caught spam's into the
 learning cycle however i'm also going to make sure that each spam is
 only ever fed through the system once, this wont be a problem since I
 already make use of their checksums to avoid duplicating files and I
had
 intended to use it to remove old spam anyway.

If you look at the X-Spam-Status header, it will tell you if the message
was already autolearned:

X-Spam-Status: Yes, score=19.5 required=5.0 tests=...
  (list of tests)...
autolearn=spam version=3.1.5

RE: URIBL false matches

2006-09-07 Thread Rosenbaum, Larry M.

 From: Mark G. Thomas [mailto:[EMAIL PROTECTED]
 
 Hi,
 
 I have a problem with incorrect URIBL hits on incoming forwarded
messages
 that have been mangled by Lotus Notes.
 
 I have a customer with the domain name Yimaging.com.
 (Not really Y).
 
 ng.com is on the URIBL blacklist.  I think for awhile it has been
 removed, but it's there again now.
 ...
 Is there some
 easy way I can exclude just the one domain name ng.com from being
looked
 up at all, but otherwise still use the URIBL?

uridnsbl_skip_domain ng.com

1 2 >

1 - 100 of 138 matches

Mail list logo