Re: DomainKeys in SA

2006-03-26 Thread Philip Prindeville
Benny Pedersen wrote:

>>>I had set up Postfix to check incoming mails for DK sigs but when I did
>>>that, I was no longer able to DISCARD emails sent by known spammers and spam 
>>>networks.  That was not
>>>acceptable, so SpamAssassin provides me with another way of doing it, 
>>>without breaking any Postfix
>>>functionality.
>>>  
>>>
>>I don't like to reject spam, I prefer to DISCARD so that the innocent
>>bystanders that own the addresses the spammers forged are not bothered with 
>>the bounces.
>>
>>
>
>http://www.t29.dk/header_check_notes.php
>
>think about it one more time :-)
>  
>

I concur with Benny.

Knowledge is power, and you're usually better off knowing than not
knowing (what you don't know can hurt you).

You're also assuming that most people are clueless, or can't be bothered
to defend themselves.

What if someone really doesn't like having someone else impersonate him,
and decides to go after such parties with a vengeance?  They need all the
data they can get to present in court.

If the person, on the other hand, really doesn't want to know or doesn't
care,
then let him make the extra effort to filter the bouncers (at his peril).

-Philip




Re: URIDNSBL.pm invalid bitwise or?

2006-03-26 Thread Jason Parsons


On Mar 23, 2006, at 06:15 , Justin Mason wrote:

I agree -- that should be removed.  Could you open a new bugzilla  
bug about this?


Submitted bug ID 4842.  Attached a proposed patch to the bug.

Thank you.
 - Jason Parsons




RE: Perl Modules?

2006-03-26 Thread Bradley Walker
I have the CentOS 3.2 system going and noticed something when doing research
on the perl modules that SA uses.  Currently I have Digest-SHA1 installed
that came with CentOS and it's version 2.01-15.1.  However when checking my
control panel software (www.directadmin.com) I saw that when running the
script file to update SpamAssassin to 3.1.1, they had downloaded and
appearantly installed Digest-SHA1 version 2.10.  Likewise I was unable to
figure out what version of libhtml-parser-perl I have installed.

Anyway, would any of this be leading to problems of STILL getting the 421
BSMTP timeout errors with Exim/SpamAssassin?  These problems appear as soon
as I start SpamAssassin with messages becoming frozen containing:

The following text was generated during the delivery attempt:

*
-- @teazles.com --

An error was detected while processing a file of BSMTP input.
The error message was:

  421 Lost incoming connection

The SMTP transaction started in line 0.
The error was detected in line 3.
0 previous messages were successfully processed.
The rest of the batch was abandoned.
421 Lost incoming connection
Transaction started in line 0
Error detected in line 3
*

Within 5-10 minutes after starting SpamAssassin I had 8 frozen messages and
they continued to add up the longer I let SA run.  I had to kill the process
and comment the SA parts in the exim.conf file to at least allow email to
get through to my clients.

This has me baffled, and quite honestly frustrated to no end as to why it's
refusing to work correctly when it has in the past.

Brad


-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On
Behalf Of Matt Kettler
Sent: Sunday, March 26, 2006 12:53 PM
To: [EMAIL PROTECTED]
Cc: users@spamassassin.apache.org
Subject: Re: Perl Modules?

Bradley Walker wrote:
> What Perl modules does SA require?  I'm looking into this as being a 
> possible cause of my 421 SMTP timeout errors.

You didn't specify a version of SA. So assuming SA 3.1

http://svn.apache.org/repos/asf/spamassassin/branches/3.1/INSTALL

Read the section about half-way down titled "Required Perl Modules"



Re: On RelayCountry, IP::Country, etc.

2006-03-26 Thread Philip Prindeville
Kai Schaetzl wrote:

>Philip Prindeville wrote on Fri, 24 Mar 2006 20:46:21 -0700:
>
>  
>
>>IP::Country
>>
>>
>
>Is that getting used for Plugin::RelayCountry or how can one utilize it?
>
>Kai
>  
>

It is used by RelayCountry (hence the subject line), but it can also be
used in
MimeDefang in the filter_relay() section to figure out where the connection
is coming from.

-Philip



Re: Bayes store recommendation

2006-03-26 Thread Matt Kettler
Jakob Hirsch wrote:
> Quoting Matt Kettler:
> 
>> I personally use sdbm on my production servers. I chose it for better speed 
>> than
>> dbm, without the extra ram overhead of mysql.
> 
> Thanks for your input.
> 
> So I backuped my bayes db (around 10mib) and restored it with the sqlite
> module, which took several hours. Using SDBM it was a matter of 20
> seconds (so I can confirm what Michael wrote).
> 
> I don't think SQLite itself is _that_ slow (in fact, I don't think it's
> slow at all), it's most probably a matter of optimization, but the
> additional SQL layer will surely not speed up things and SDBM seems to
> be quite fast. If time permits, I'll do a benchmark run, anyway, and
> post it here.

Yeah, one thing to be aware of is that insertion seems to be the weak point of
the SQL mode. Even with the best settings SQL is slower than even DBM in phase
1, which is all learning.

I'm a little surprised at your result however.. Such a huge difference implies
that SQLite isn't doing nearly as well as Mysql/pgsql. The worst-case phase 1
difference was 436% slower for SQL than SDBM, but your results are on the order
of 18000% slower..

Where SQL seems to shine is in force-expire (phase 3), and it does well in scan
(phase 2 and 5). It also seems to do poorly in the forget (phase 4).

At first you might think force expire and forget should be delete based, and
thus similar, but in reality force-expire is read/delete intensive, but forget
is update intensive, with few records deleted.

Based on that it appears:
SQL does better than DBM at reading SA bayes records
SQL does better than DBM at deleting SA bayes records
SQL does worse than DBM at inserts.
SQL does worse than DBM at updates.

Whereas SDBM does better than DBM in all tests.


Re: Bayes store recommendation

2006-03-26 Thread Jakob Hirsch
Quoting Matt Kettler:

> I personally use sdbm on my production servers. I chose it for better speed 
> than
> dbm, without the extra ram overhead of mysql.

Thanks for your input.

So I backuped my bayes db (around 10mib) and restored it with the sqlite
module, which took several hours. Using SDBM it was a matter of 20
seconds (so I can confirm what Michael wrote).

I don't think SQLite itself is _that_ slow (in fact, I don't think it's
slow at all), it's most probably a matter of optimization, but the
additional SQL layer will surely not speed up things and SDBM seems to
be quite fast. If time permits, I'll do a benchmark run, anyway, and
post it here.



Regards,
Jakob


Re: On RelayCountry, IP::Country, etc.

2006-03-26 Thread Matt Kettler
Kai Schaetzl wrote:
> Philip Prindeville wrote on Fri, 24 Mar 2006 20:46:21 -0700:
> 
>> IP::Country
> 
> Is that getting used for Plugin::RelayCountry or how can one utilize it?
> 

Yes, Plugin::RelayCountry uses IP::Country, specifically IP::Country::Fast.




Re: DomainKeys in SA

2006-03-26 Thread Benny Pedersen
>> I had set up Postfix to check incoming mails for DK sigs but when I did
>> that, I was no longer able to DISCARD emails sent by known spammers and spam 
>> networks.  That was not
>> acceptable, so SpamAssassin provides me with another way of doing it, 
>> without breaking any Postfix
>> functionality.
>
> I don't like to reject spam, I prefer to DISCARD so that the innocent
> bystanders that own the addresses the spammers forged are not bothered with 
> the bounces.

http://www.t29.dk/header_check_notes.php

think about it one more time :-)




Re: Perl Modules?

2006-03-26 Thread Matt Kettler
Bradley Walker wrote:
> What Perl modules does SA require?  I'm looking into this as being a
> possible cause of my 421 SMTP timeout errors.

You didn't specify a version of SA. So assuming SA 3.1

http://svn.apache.org/repos/asf/spamassassin/branches/3.1/INSTALL

Read the section about half-way down titled "Required Perl Modules"


Re: Bayes store recommendation

2006-03-26 Thread Michael Parker
Jakob Hirsch wrote:
> Hi everyone,
> 
> I've searched a little through the archive but found not much about that.
> Right now I'm using DBM for bayes. I know the officially recommended
> storage is a SQL DB (though I don't know why). I have a mysql server
> running on my mail system and tried using it for bayes a while ago (with
> SA 3.1.0) but didn't see lower scan times. For a local RDBMS I think
> SQLite is a very good thing, but there was a list post where somebody
> said ist not suitable for SA (though README.bayes mentions it).
> According to the bayes benchmark in the wiki
> (http://wiki.apache.org/spamassassin/BayesBenchmarkResults), SDBM is the
> fastest store (though there are no sqlite numbers), but I wonder why
> it's not mentioned much. I'll probably run the bayes benchmark myself
> (with sqlite, sdbm and dbm), but somebody has probably some experience
> to share.

You're welcome to try SQLite, but all testing I did was too slow to even
wait for results (multi-day runs vs multi-hour for other slow systems).

The bayes storage module is such that you can modify the various queries
to work better with whatever engine you would like (see Pg
modifications).  If a SQLite expert wants to take up the challenge and
provide patches I'll happily review them for inclusion in the code.

Michael


Re: Bayes store recommendation

2006-03-26 Thread Matt Kettler
Jakob Hirsch wrote:
> Hi everyone,
> 
> I've searched a little through the archive but found not much about that.
> Right now I'm using DBM for bayes. I know the officially recommended
> storage is a SQL DB (though I don't know why). I have a mysql server
> running on my mail system and tried using it for bayes a while ago (with
> SA 3.1.0) but didn't see lower scan times. For a local RDBMS I think
> SQLite is a very good thing, but there was a list post where somebody
> said ist not suitable for SA (though README.bayes mentions it).
> According to the bayes benchmark in the wiki
> (http://wiki.apache.org/spamassassin/BayesBenchmarkResults), SDBM is the
> fastest store (though there are no sqlite numbers), but I wonder why
> it's not mentioned much. I'll probably run the bayes benchmark myself
> (with sqlite, sdbm and dbm), but somebody has probably some experience
> to share.

I personally use sdbm on my production servers. I chose it for better speed than
dbm, without the extra ram overhead of mysql.

I posted some questions about this in December, with no real response:
http://mail-archives.apache.org/mod_mbox/spamassassin-users/200512.mbox/[EMAIL 
PROTECTED]

I'd be interested in hearing any results you get from using sqlite...

As for SDBM, I'll warn you that if you're not using SA 3.1.1, there's a bug in
sa-learn --restore that affects sdbm.

http://issues.apache.org/SpamAssassin/show_bug.cgi?id=4670

That said, I've been running bayes with SDBM in production since December, using
SA 3.1.0, and am fairly happy with it.


Re: Perl Modules?

2006-03-26 Thread Matthias Fuhrmann
On Sun, 26 Mar 2006, Bradley Walker wrote:

hI,

> What Perl modules does SA require?  I'm looking into this as being a
> possible cause of my 421 SMTP timeout errors.

have a look here:
http://spamassassin.apache.org/dist/INSTALL

regards,
Matthias


Re: On RelayCountry, IP::Country, etc.

2006-03-26 Thread Kai Schaetzl
Philip Prindeville wrote on Fri, 24 Mar 2006 20:46:21 -0700:

> IP::Country

Is that getting used for Plugin::RelayCountry or how can one utilize it?

Kai

-- 
Kai Schätzl, Berlin, Germany
Get your web at Conactive Internet Services: http://www.conactive.com





Perl Modules?

2006-03-26 Thread Bradley Walker



What Perl modules 
does SA require?  I'm looking into this as being a possible cause of my 421 
SMTP timeout errors.


Bayes store recommendation

2006-03-26 Thread Jakob Hirsch
Hi everyone,

I've searched a little through the archive but found not much about that.
Right now I'm using DBM for bayes. I know the officially recommended
storage is a SQL DB (though I don't know why). I have a mysql server
running on my mail system and tried using it for bayes a while ago (with
SA 3.1.0) but didn't see lower scan times. For a local RDBMS I think
SQLite is a very good thing, but there was a list post where somebody
said ist not suitable for SA (though README.bayes mentions it).
According to the bayes benchmark in the wiki
(http://wiki.apache.org/spamassassin/BayesBenchmarkResults), SDBM is the
fastest store (though there are no sqlite numbers), but I wonder why
it's not mentioned much. I'll probably run the bayes benchmark myself
(with sqlite, sdbm and dbm), but somebody has probably some experience
to share.



Regards,
Jakob