Re: Using SA code to extract URLs ?

2007-01-13 Thread Jeff Chan
On Friday, January 12, 2007, 6:10:32 PM, Michael Cocke wrote: I was told a while back that the best way to extract urls from emails was to use code from SpamAssassin. Ok - Now, I need to do just that. Any pointers? I've looked thru the code in SpamCopURI, but unless there are some docs

that's how they got my email address?

2007-01-13 Thread hamann . w
Hi, I recently received this ad (1 million opt-in addresses of german companies) I believe them that they collect information from a lot of sources (phone books, public registers, etc) but I doubt the victims on their list were given a chance to opt in. Maybe someone is interested to download

Re: that's how they got my email address?

2007-01-13 Thread vertito
I was thinking that they are making use of what google has been successful for, crawling bots that _crawls_ web sites for targeted data for retrieval like email address as for their case, but this time on a much different objective compared to google. Secondly, right after using zen.spamhaus,

RE: Rules always triggering.

2007-01-13 Thread Dave Koontz
Just a wild stab here, run a lint check on all your rules. I once fat fingered a rule in my local.cf file and got similar hit results as you are describing here. -Original Message- From: Daniel Staal [mailto:[EMAIL PROTECTED] Sent: Friday, January 12, 2007 9:05 PM To:

Re: Huge File Size

2007-01-13 Thread Benny Pedersen
On Fri, January 12, 2007 02:14, Matt Kettler wrote: form of expiry is one reason why I say the AWL isn't really ready for production use on any servers that have decent mail volume) if one entry is just deleted when will there be records with 2 ? awl is tricky but good, we have to live with

Re: Huge File Size

2007-01-13 Thread Benny Pedersen
On Fri, January 12, 2007 03:35, Christopher Jett wrote: OK - thanks. So, for example, it's safe to delete the bayes_seen file after it gets over a certain size? Is there a particular size after which performance degrades significantly? i remember that file based bayes is huge, where sql

Re: Huge File Size

2007-01-13 Thread Matt Kettler
Benny Pedersen wrote: On Fri, January 12, 2007 02:14, Matt Kettler wrote: form of expiry is one reason why I say the AWL isn't really ready for production use on any servers that have decent mail volume) if one entry is just deleted when will there be records with 2 ? I don't

RE: Rules always triggering.

2007-01-13 Thread Daniel Staal
--As of January 13, 2007 7:17:46 AM -0500, Dave Koontz is alleged to have said: Just a wild stab here, run a lint check on all your rules. I once fat fingered a rule in my local.cf file and got similar hit results as you are describing here. --As for the rest, it is mine. I fixed a couple

Re: Rules always triggering.

2007-01-13 Thread Matt Kettler
Daniel Staal wrote: --As of January 13, 2007 7:17:46 AM -0500, Dave Koontz is alleged to have said: Just a wild stab here, run a lint check on all your rules. I once fat fingered a rule in my local.cf file and got similar hit results as you are describing here. --As for the rest, it is

Re: Using SA code to extract URLs ?

2007-01-13 Thread Dallas Engelken
Michael W. Cocke wrote: div class=moz-text-flowed style=font-family: -moz-fixedI was told a while back that the best way to extract urls from emails was to use code from SpamAssassin. Ok - Now, I need to do just that. Any pointers? I've looked thru the code in SpamCopURI, but unless there

Re: Huge File Size

2007-01-13 Thread Gary V
Benny Pedersen wrote: On Fri, January 12, 2007 02:14, Matt Kettler wrote: form of expiry is one reason why I say the AWL isn't really ready for production use on any servers that have decent mail volume) if one entry is just deleted when will there be records with 2 ? I don't understand

Re: Huge File Size

2007-01-13 Thread Gary V
I *think* you're in agreement with what I just said. Using last-accessed time instead of hit-count makes substantially more sense. By moving AWL to SQL this can be accomplished. Here is a sample for MySQL: Add a new field: ALTER TABLE awl ADD lastupdate timestamp(14) NOT NULL; If you have a

comprehensive perl module site like cpan or other for SA needs ???

2007-01-13 Thread R Lists06
Greetings Seeking some list wisdom please? We have some well functioning boxes running SA out there Most run RHEL 4 or CentOS 4 I am wondering where to go to find out specifically for each perl module if we have the latest greatest and most stable version(s) etc Please note the sa-update

Re: Huge File Size

2007-01-13 Thread Gary V
I don't see why this method could not also be used for bayes_seen. I was not aware bayes_seen would grow forever so I am going to implement this on my own system next week. ALTER TABLE bayes_seen ADD lastupdate timestamp(14) NOT NULL; Then wait a few weeks before implementing: DELETE FROM

Install or enable net::ident

2007-01-13 Thread Andy Figueroa
I'm running 3.1.7 under Gentoo. It seems like I might want to install or enable Net::Ident, but I can't find anything about it in the configuration files, though it shows up in debugging output as dbg: diag: module not installed: Net::Ident ('require' failed) Then I installed Net-Ident and

Re: Install or enable net::ident

2007-01-13 Thread Theo Van Dinter
On Sat, Jan 13, 2007 at 02:40:54PM -0500, Andy Figueroa wrote: What else do I need to do? I don't seem to be able to find a clue on my own. It's only used in spamd, so man spamd for more information. -- Randomly Selected Tagline: There's nothing wrong with [Microsoft] systems until Back

Re: Install or enable net::ident

2007-01-13 Thread Andy Figueroa
Thank you for the reply. I certainly agree that Net::Ident is mentioned in the man for spamd. But, it doesn't help me enough. It did raise these questions: Is the spamd option --auth-ident the only thing Net::Ident is used for? What is identd (I haven't found one on my system)? Will

Re: Install or enable net::ident

2007-01-13 Thread Theo Van Dinter
On Sat, Jan 13, 2007 at 03:28:07PM -0500, Andy Figueroa wrote: Is the spamd option --auth-ident the only thing Net::Ident is used for? Yes. What is identd (I haven't found one on my system)? The idea is that a remote machine can ask who (user) is connecting to it. In this case, spamd can

Any rules to catch EXE's?

2007-01-13 Thread Robert Nicholson
At this time I'm forwarding mail that SA considers spam to my gmail account. The following bounces with SMTP error from remote mail server after end of data: host gmail-smtp-in.l.google.com [64.233.185.27]: 552 5.7.0 Illegal Attachment g5si5192165wra error None of the rules

Checking for PTR record

2007-01-13 Thread Peter Smith
Hi, I'm interested in filtering mail relayed from hosts which have no reverse dns (PTR) record. A lot of MTAs support rejection of mail from such hosts, but I feel this will reject too much genuine mail, so I'm looking to approach the problem via Spam Assassin - perhaps score 1 or 2 for such

Re: Checking for PTR record

2007-01-13 Thread René Berber
Peter Smith wrote: I'm interested in filtering mail relayed from hosts which have no reverse dns (PTR) record. A lot of MTAs support rejection of mail from such hosts, but I feel this will reject too much genuine mail, so I'm looking to approach the problem via Spam Assassin - perhaps score 1

Re: comprehensive perl module site like cpan or other for SA needs ???

2007-01-13 Thread John Andersen
On Saturday 13 January 2007 09:01, R Lists06 wrote: Also, please note the modules that are not active or found in the install below during the SA update Would installing them just be plug and play and they start working or do I search them out individually too and their activation configs

Re: Any rules to catch EXE's?

2007-01-13 Thread Matt Kettler
Robert Nicholson wrote: At this time I'm forwarding mail that SA considers spam to my gmail account. The following bounces with SMTP error from remote mail server after end of data: host gmail-smtp-in.l.google.com [64.233.185.27]: 552 5.7.0 Illegal Attachment g5si5192165wra

Re: Any rules to catch EXE's?

2007-01-13 Thread Michele Neylon :: Blacknight
Matt Kettler wrote: Robert Nicholson wrote: At this time I'm forwarding mail that SA considers spam to my gmail account. The following bounces with SMTP error from remote mail server after end of data: host gmail-smtp-in.l.google.com [64.233.185.27]: 552 5.7.0 Illegal Attachment

RE: comprehensive perl module site like cpan or other for SA needs ???

2007-01-13 Thread R Lists06
So how did you install in the first place? Yes, installing these would cause them to start working. I recommend installing using CPAN, as it is portable, reliable, picks up its pre-requsites very well and you are not dependent on some Distro specific packager. If you have the latest

Re: Any rules to catch EXE's?

2007-01-13 Thread Matt Kettler
Michele Neylon :: Blacknight wrote: Matt Kettler wrote: Robert Nicholson wrote: At this time I'm forwarding mail that SA considers spam to my gmail account. The following bounces with SMTP error from remote mail server after end of data: host gmail-smtp-in.l.google.com

Re: comprehensive perl module site like cpan or other for SA needs ???

2007-01-13 Thread David Morton
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 On Jan 13, 2007, at 8:01 PM, R Lists06 wrote: It is my experience that CPAN installs can or will tend to do things I do not want it to do (or cannot control) in a RPM environment among other things... Heh, my experience is just opposite,

Re: comprehensive perl module site like cpan or other for SA needs ???

2007-01-13 Thread Theo Van Dinter
On Sat, Jan 13, 2007 at 08:58:55PM -0600, David Morton wrote: It is my experience that CPAN installs can or will tend to do things I do not want it to do (or cannot control) in a RPM environment among other things... Heh, my experience is just opposite, RPM environment often

Re: Any rules to catch EXE's?

2007-01-13 Thread John D. Hardin
None of the rules indicate that it had any exe or zip attachment Why would they? SA is a spam filter, not a virus filter. You could try MailScanner (http://www.mailscanner.info) Or this if you already have a procmail infrastructure: