Hi!
We've just successfully setup SA 3.0.2 on our Apple X Serve 10.3.8, using postfix, procmail, and Squirrelmail.
I have trained the Bayes with some spam archives and I would like our users to send mail to a specific mailbox for training.
For certain specific users, I have had them add IsSpam and NotSpam folders in their Squirrelmail interfaces and I have been able to use the command:
sa-learn --showdots --spam /var/spool/imap/user/<username>/IsSpam/*.
This works nicely because the cyrus imap structure has all the individual messages named <some_number><period>
I also would like every user to be able to "bounce" mail to a special spamreport account and to that end, I have added a Squirrelmail "bounce" plug-in that come the closest to allowing our users to "bounce" mail unscathed. Unfortunately, the Return-Path header is changed to the last person (our own user) when the message is bounced.
Here are the two sets of headers (the original and after the bounce) with a little bit of personal identification removed:
The Original: ==========
Return-Path: <[EMAIL PROTECTED]>
Received: from mail.my_domain.org ([unix socket]) (authenticated user=this_is_me bits=0)
by mail.my_domain.org (Cyrus v2.1.13) with LMTP; Tue, 15 Feb 2005 23:11:02 -0500
X-Sieve: CMU Sieve 2.2
Received: from smtp812.mail.sc5.yahoo.com (smtp812.mail.sc5.yahoo.com [66.163.170.82])
by mail.my_domain.org (Postfix) with SMTP id 8053D22D779
for <[EMAIL PROTECTED]>; Tue, 15 Feb 2005 23:10:58 -0500 (EST)
Received: from unknown (HELO ?192.168.2.5?) ([EMAIL PROTECTED]@66.159.222.222 with plain)
by smtp812.mail.sc5.yahoo.com with SMTP; 16 Feb 2005 04:10:57 -0000
Mime-Version: 1.0 (Apple Message framework v619.2)
Content-Transfer-Encoding: 7bit
Message-Id: <[EMAIL PROTECTED]>
Content-Type: text/plain; charset=US-ASCII; format=flowed
To: Charlie Itsme <[EMAIL PROTECTED]>
From: Someone Else <[EMAIL PROTECTED]>
Subject: Test1
Date: Tue, 15 Feb 2005 23:10:56 -0500
X-Mailer: Apple Mail (2.619.2)
X-Spam-Level:
X-Spam-Status: No, score=-1.8 required=4.0 tests=AWL,BAYES_00 autolearn=ham
version=3.0.2
X-Spam-Checker-Version: SpamAssassin 3.0.2 (2004-11-16) on mail.my_domain.org
After the Bounce: =============
Return-Path: <[EMAIL PROTECTED]>
Received: from mail.my_domain.org ([unix socket]) (authenticated user=bounce_recipient bits=0)
by mail.my_domain.org (Cyrus v2.1.13) with LMTP; Tue, 15 Feb 2005 23:12:28 -0500
X-Sieve: CMU Sieve 2.2
Received: from mail.my_domain.org (localhost [127.0.0.1])
by mail.my_domain.org (Postfix) with ESMTP id CBF1F22D7D6
for <[EMAIL PROTECTED]>; Tue, 15 Feb 2005 23:12:26 -0500 (EST)
Received: from 66.159.222.222
(SquirrelMail authenticated user this_is_me);
by mail.my_domain.org with HTTP;
Tue, 15 Feb 2005 23:12:26 -0500 (EST)
X-Received: from mail.my_domain.org ([unix socket]) (authenticated
user=this_is_me bits=0) by mail.my_domain.org (Cyrus v2.1.13)
with LMTP; Tue, 15 Feb 2005 23:11:02 -0500
X-Sieve: CMU Sieve 2.2
X-Received: from smtp812.mail.sc5.yahoo.com (smtp812.mail.sc5.yahoo.com
[66.163.170.82]) by mail.my_domain.org (Postfix) with SMTP id
8053D22D779 for <[EMAIL PROTECTED]>; Tue,
15 Feb 2005 23:10:58 -0500 (EST)
X-Received: from unknown (HELO ?192.168.2.5?)
([EMAIL PROTECTED]@66.159.222.222 with plain) by
smtp812.mail.sc5.yahoo.com with SMTP; 16 Feb 2005 04:10:57 -0000
Mime-Version: 1.0 (Apple Message framework v619.2)
Content-Transfer-Encoding: 7bit
Message-Id: <[EMAIL PROTECTED]>
Content-Type: text/plain; charset=US-ASCII; format=flowed
To: Charlie Itsme <[EMAIL PROTECTED]>
From: Someone Else <[EMAIL PROTECTED]>
Subject: Test1
Date: Tue, 15 Feb 2005 23:10:56 -0500
X-Mailer: Apple Mail (2.619.2)
ReSent-Date: Tue, 15 Feb 2005 23:12:26 -0500 (EST)
Resent-From: "Charlie Itsme" <[EMAIL PROTECTED]>
Resent-To: [EMAIL PROTECTED]
ReSent-Message-ID: <[EMAIL PROTECTED]>
X-Spam-Level:
X-Spam-Status: No, score=-3.0 required=4.0 tests=ALL_TRUSTED,AWL,BAYES_00,
BLANK_LINES_70_80 autolearn=ham version=3.0.2
X-Spam-Checker-Version: SpamAssassin 3.0.2 (2004-11-16) on mail.my_domain.org
========
I think I should add (in local.cf):
bayes_ignore_header ReSent-Date bayes_ignore_header ReSent-From bayes_ignore_header ReSent-To bayes_ignore_header ReSent-Message-ID
I don't believe I have to add any of the X-Spam headers since SA knows about them.
But I am very reluctant to add:
bayes_ignore_header Return-Path
since that header seems important for training for all other sources other than these "bounced" messages.
So my questions:
Is SA smart enough to ignore the Return-Path if its bounced mail?
Will adding "bayes_ignore_header Return-Path" undermine the training?
Should I somehow copy these bounced messages and strip out the Return-Path in these specific messages before training?
Any other suggestions (forward and forward as attachment didn't seem to provide me with any better alternatives)?
[I was going to add shared IMAP folders but I can't seem to get past a "Can't locate Cyrus/IMAP/Shell.pm in @INC ..." error trying to start cyradm.]
Thanks for any help, Charlie