Re: [SAtalk] sa-learn says it learnt from 134 messages but check_bayes__db only reports 19 as nspam?

2003-09-07 Thread Peter Kiem
I just fed sa-learn with a maildir of spam emails and it said it learnt from them but bayes is reporting a significantly less number of learnt spams. OK, since noone could offer any ideas, just for kicks I renamed the bayes* files from the .spamassassin directory and fed it my ham and spam

Re: [SAtalk] I want SA to handle mailinglists. Need some creativity here.

2003-09-07 Thread Carlo Wood
On Sat, Sep 06, 2003 at 11:54:22PM -0500, Bill Polhemus wrote: What's the problem with just putting *:catch-all mailbox In your ALIASES file? That would catch all mail with an unknown To: field, not with unsubscribed 'From:' fields. I suggest to use procmail to filter your mail

Re: [SAdev] Re: [SAtalk] [RD] new rules for listwashing tokens,ROT-13 etc.

2003-09-07 Thread Yorkshire Dave
On Sat, 2003-09-06 at 21:03, Theo Van Dinter wrote: On Wed, Aug 27, 2003 at 01:29:52AM +0100, Yorkshire Dave wrote: 0= 0=Q A quick roundup. Can anyone fill any gaps? 2queers cipher -- in html comment plain = abcdefghijklmnopqrstuvwxyz0123456789-_.@ coded = dO,9F_ [EMAIL

[SAtalk] sa-learn

2003-09-07 Thread landy
i ran the follwoing command several times withing seconds, why did it learn from different messages every time? [EMAIL PROTECTED]:~ sa-learn --spam --dir evolution/local/Inbox/subfolders/spam/ Learned from 2 messages. [EMAIL PROTECTED]:~ sa-learn --spam --dir

[SAtalk] writing rules

2003-09-07 Thread skopel
1 How do I write my own rules? is there a tutorial someplace? 2 Spam changes with time, which means the tests should be updated. How to I get the current tests performed? ie can I simply copy the new .cf files from a newer version of spamassassin? or do I have to reinstall completely? or is

[SAtalk] Bug: SA classifies TheBat 2.0 mail as spam

2003-09-07 Thread Roman Katzer
SA thinks mail from TheBat 2.0 is spam: Date: Sat, 06 Sep 2003 13:17:08 -0400 From: Stefan Tanurkov [EMAIL PROTECTED] Subject: Re: 2.00: bug in Scheduler To: Peter Fjelsten [EMAIL PROTECTED] [...] Message-id: [EMAIL PROTECTED] Organization: RITLABS Inc. MIME-version: 1.0 X-Mailer: The

Re: [SAtalk] I want SA to handle mailinglists. Need some creativity here.

2003-09-07 Thread Steven W. Orr
On Sunday, Sep 7th 2003 at 12:39 +0200, quoth Carlo Wood: =On Sat, Sep 06, 2003 at 11:54:22PM -0500, Bill Polhemus wrote: = What's the problem with just putting = = *: catch-all mailbox = = In your ALIASES file? = =That would catch all mail with an unknown To: field, =not with unsubscribed

Re: [SAtalk] sa-learn

2003-09-07 Thread Theo Van Dinter
On Sun, Sep 07, 2003 at 08:40:38AM -0400, landy wrote: i ran the follwoing command several times withing seconds, why did it learn from different messages every time? impossible to say without seeing the messages. I'd bet it's related to messages w/out message-id headers. -- Randomly

Re: [SAtalk] Bug: SA classifies TheBat 2.0 mail as spam

2003-09-07 Thread Theo Van Dinter
On Sun, Sep 07, 2003 at 02:11:58PM +0200, Roman Katzer wrote: Since Stefan Tanurkov is the head programmer of TheBat, I don't think he has to fake his mail client ;-) please post bugs on bugzilla.spamassassin.org, otherwise we may not see them and/or remember about them. -- Randomly Generated

Re: [SAtalk] Bug: SA classifies TheBat 2.0 mail as spam

2003-09-07 Thread Roman Katzer
On Sunday, September 7, 2003, 18:11:29, Theo Van Dinter wrote: please post bugs on bugzilla.spamassassin.org, otherwise we may not see them and/or remember about them. Thanks for the pointer, I just did. Roman -- Roman Katzer, Aachen, Germany You know those people who won't take no for an

Re: [SAtalk] I want SA to handle mailinglists. Need some creativity here.

2003-09-07 Thread Bart Schaefer
On Sun, 7 Sep 2003, Steven W. Orr wrote: If someone sends *me* a copy of Sobig:F I catch it as spam just fine. But if it's sent to listaddr, then it comes in with no processing from SA. What I want is to say that *all* incoming mail needs to go through SA. This probably means that your

[SAtalk] Header problem, intermittent, SpamAssassin 2.60 rc3

2003-09-07 Thread Jerry Glomph Black
I've gotten a few spams which were scored appropriately, such as: the example below, which got 12.7 spampoints. BUT: the X-Spam-Status header indicates a score of 0.7 points! Why is this very partial sum being used for this header? This problem happens for about 1% of the spams I get. Have

Re: [SAtalk] Header problem, intermittent, SpamAssassin 2.60 rc3

2003-09-07 Thread Theo Van Dinter
On Sun, Sep 07, 2003 at 10:06:51AM -0700, Jerry Glomph Black wrote: I've gotten a few spams which were scored appropriately, such as: the example below, which got 12.7 spampoints. BUT: the X-Spam-Status header indicates a score of 0.7 points! Why is this very partial sum being used for

Re: [SAtalk] redhat 8 rpm packages missing some files?

2003-09-07 Thread Karl Larsen
sa-learn is in every SA version 2.50 and up. You can get the latest stable from the SA web page. On Sat, 6 Sep 2003, Scott wrote: Theo Van Dinter wrote: On Sat, Sep 06, 2003 at 08:04:22PM -0700, Scott wrote: My redhat 8 box with courier and spamassassin doesn't have sa-learn

[SAtalk] Fitz, an add-on to Spamassassin

2003-09-07 Thread Thorsten Sick
Hello, everybody As a part of my diploma thesis I developed an AI to sort Mails into Ham/Spam. The AI alone was able to get about the same results as Spamassassin. Which means it is very good. I released it as Open Source and it is downloadable at http://spam-fitz.de You just overwrite a few

Re: [SAtalk] Header problem, intermittent, SpamAssassin 2.60 rc3

2003-09-07 Thread Jerry Glomph Black
That's fine, but why is the score number not the same? I've been using procmail to read the X-Spam-Status numerical value, and thus decide where to deliver a given message. Is this header now irrelevant? On Sun, 7 Sep 2003, Theo Van Dinter wrote: On Sun, Sep 07, 2003 at 10:06:51AM -0700,

Re: [SAtalk] I want SA to handle mailinglists. Need some creativity here.

2003-09-07 Thread Steven W. Orr
On Sunday, Sep 7th 2003 at 10:02 -0700, quoth Bart Schaefer: =On Sun, 7 Sep 2003, Steven W. Orr wrote: = = If someone sends *me* a copy of Sobig:F I catch it as spam just fine. = But if it's sent to listaddr, then it comes in with no processing from = SA. What I want is to say that *all* incoming

[SAtalk] bayes db management philosophy

2003-09-07 Thread Diffenderfer, Randy
Folks, Looking for conventional wisdom on handling the bayes db in a centralized production environment. I don't intend to do autolearning, but instead to feed a carefully selected set of messages into a central SA instance, creating the db via sa-learn. I then want to take that database and

[SAtalk] Cannot open bayes_path $path R/O:

2003-09-07 Thread Edward Muller
I moved my site from one of my servers to another one of my servers over the weekend. Everything was copied and user permissions were reset just in case. I am now getting the following errors in my log files... Cannot open bayes_path /home/edwardam/.spamassassin/bayes R/O: Cannot open bayes_path

[SAtalk] Spam with score of 0.0

2003-09-07 Thread Jason
This message made is past SpamAssassin without setting off ANYTHING? (Score of 0.0.) Return-Path: [EMAIL PROTECTED] Received: by bluehome.net (CommuniGate Pro PIPE 4.1.3) with PIPE id 360205; Sun, 07 Sep 2003 12:05:04 -0700 Received: from [68.198.199.174] (HELO atlantech.fr) by bluehome.net

Re: [SAtalk] adjusting a few things in local.cf

2003-09-07 Thread Lucas Albers
#Higher scores for bayes score BAYES_80 2.50 score BAYES_90 4.00 I think that I messed up with the BAYES ones.. BAYES_80 isn't adding the 2.5 points like I wanted. =/ 2.60 signifigantly increases default bayes scores. --- This sf.net

Re: [SAtalk] Spam with score of 0.0

2003-09-07 Thread Jason
No. I use CGPSA to get the messages to CommuniGate Pro. I think it's interesting that it Identified non-spam (0.0/5.0) for default in 0.0 seconds Seems too quick. Anyway, here's the message. 12:12:30.24 4 EXTFILTER(CGPSA) out(25): 54 FILE Queue/360223.msg\n 12:12:30.28 2 PWD-00035([127.0.0.1])

Re: [SAtalk] writing rules

2003-09-07 Thread Scott Kopel
yes thanks, but how does SA know which rbl checks to do... ie how do I configure what services I want SA to check? thanks again 3 I don't see any evidence that RBL checks are being made in my spam headers. How can I make sure the RBL checks are being done? enable them in local.cf:

Re: [SAtalk] FW: Feedback on how identified spam is being handled

2003-09-07 Thread Nigel Wilkinson
On Sun, 07 Sep 2003 15:55:40 -0400 Burt Juda [EMAIL PROTECTED] wrote: You can pipe it thru 'formail' to nuke the headers that SA added, like so: cat FILE | formail -I'X-Spam-Checker-Version:' -I'X-Spam-Level: \ -I'X-Spam-Status:' -ds sa-learn --[s]ham The '-ds'

Re: [SAtalk] Header problem, intermittent, SpamAssassin 2.60 rc3

2003-09-07 Thread Jerry Glomph Black
Thanks to Theo Van Dinter for his polite patience following my 'bug' report. It was a problem, but related to a known bug in 2.60rc3, that the PREFIX of installation of the local config files is not being set to the usual /etc/mail/spamassassin, but rather /mail/spamassassin.Thus my local.cf

Re: [SAtalk] writing rules

2003-09-07 Thread Matt Kettler
At 09:05 AM 9/7/03 -0400, [EMAIL PROTECTED] wrote: 1 How do I write my own rules? is there a tutorial someplace? Yes there is a tutorial someplace, or a least a fairly decent howto on the subect. http://mywebpages.comcast.net/mkettler/sa/SA-rules-howto.txt 2 Spam changes with time, which means

Re: [SAtalk] writing rules

2003-09-07 Thread SpamAssassin Talk
for example use these: tflagsBL_BLKHOLE_INTERSILnet headerBL_BLKHOLE_INTERSIL rbleval:check_rbl('intersil', 'blackholes.intersil.net.') describeBL_BLKHOLE_INTERSILSpammer list in: blackholes.intersil.net (Basis entry) score

Re: [SAtalk] Spam with score of 0.0

2003-09-07 Thread Simon Byrnand
At 12:08 7/09/2003 -0700, Jason wrote: This message made is past SpamAssassin without setting off ANYTHING? (Score of 0.0.) [snip] Very odd. You must have something wrong with your setup there, as I see heaps of spams identical to that one that score quite highly... (somewhere between 10 and 20

Re: [SAtalk] writing rules

2003-09-07 Thread AltGrendel
[EMAIL PROTECTED] wrote: 1 How do I write my own rules? is there a tutorial someplace? 2 Spam changes with time, which means the tests should be updated. How to I get the current tests performed? ie can I simply copy the new .cf files from a newer version of spamassassin? or do I have to

Re: [SAtalk] OSIRUSOFT

2003-09-07 Thread Justin Mason
Bob Proulx writes: I prefer the benevolent dictator approach, where I get to choose and to ignore the dictators of my choice. Yeah, me too -- a web of trust, basically. If I know that J. Random Listing Guy is publishing a bl, which a public reviews site measures as listing 80% of spam with

Re: Re[2]: [SAtalk] [RD] MSGID_GOOD_EXCHANGE

2003-09-07 Thread Justin Mason
Robert Menschel writes: * What specific business are your people in? Much of our mail deals with lumber and building materials. ... Ssh! This is a public forum! Don't mention the lumber cartel!! http://lumbercartel.freeyellow.com/ ;) --j.

Re: [SAtalk] Performance optimization for bigger setups

2003-09-07 Thread Justin Mason
trey valenta writes: On Fri, Sep 05, 2003 at 04:56:45PM +0200, Jochen Tuchbreiter wrote: Did any of you analyze (profile) where most of the CPU in spamassassin is spent? I ran spamassassin --lint through Perl's profiler back in July. I know this isn't the same as checking mail for

Re: [SAdev] Re: [SAtalk] [RD] new rules for listwashing tokens,ROT-13 etc.

2003-09-07 Thread Justin Mason
Yorkshire Dave writes: I don't know if I should be adding these incomplete ciphers to detoken.pl, they're useful to a lot of people before they're complete. Any opinions? I'd say so -- you could leave the unknowns as ., and count the dots once the RE is generated; if there's too many (ie. 3 or

Re: [SAtalk] Spam with score of 0.0

2003-09-07 Thread Justin Mason
Jason writes: No. I use CGPSA to get the messages to CommuniGate Pro. I think it's interesting that it Identified non-spam (0.0/5.0) for default in 0.0 seconds Seems too quick. Anyway, here's the message. I would guess that's not finding the config files -- if there's no rules, it'll run

Re: [SAtalk] sa-learn says it learnt from 134 messages but check_bayes__db only reports 19 as nspam?

2003-09-07 Thread Justin Mason
Peter Kiem writes: I just fed sa-learn with a maildir of spam emails and it said it learnt from them but bayes is reporting a significantly less number of learnt spams. OK, since noone could offer any ideas, just for kicks I renamed the bayes* files from the .spamassassin directory and

Re: [SAtalk] bayes db management philosophy

2003-09-07 Thread Justin Mason
Diffenderfer, Randy writes: Looking for conventional wisdom on handling the bayes db in a centralized production environment. I don't intend to do autolearning, but instead to feed a carefully selected set of messages into a central SA instance, creating the db via sa-learn. I then want to

Re: [SAtalk] Cannot open bayes_path $path R/O:

2003-09-07 Thread Edward Muller
Okay I did check the output of `file` for the bayes* files and it was GDBM, I removed them and re-created them. They are now DB files and everything is working. Thanks to #spamassassin. On Sun, 2003-09-07 at 13:14, Edward Muller wrote: I moved my site from one of my servers to another one of my

Re: [SAtalk] Performance optimization for bigger setups

2003-09-07 Thread Justin Mason
trey valenta writes: On Sun, Sep 07, 2003 at 08:37:59PM -0700, Justin Mason wrote: BTW I would suggest instead profiling spamd -- --lint is not the case you want to optimize for ;) Thanks, I'll try that. I realized --lint isn't the ideal test case, but made the assumption (which might be

Re: [SAtalk] Performance optimization for bigger setups

2003-09-07 Thread trey valenta
On Sun, Sep 07, 2003 at 08:37:59PM -0700, Justin Mason wrote: BTW I would suggest instead profiling spamd -- --lint is not the case you want to optimize for ;) Thanks, I'll try that. I realized --lint isn't the ideal test case, but made the assumption (which might be wrong) that Conf::_parse

Re: [SAtalk] Performance optimization for bigger setups

2003-09-07 Thread Simon Byrnand
At 20:37 7/09/2003 -0700, Justin Mason wrote: trey valenta writes: On Fri, Sep 05, 2003 at 04:56:45PM +0200, Jochen Tuchbreiter wrote: Did any of you analyze (profile) where most of the CPU in spamassassin is spent? I ran spamassassin --lint through Perl's profiler back in July. I know