RE: Manually training SpamAssassin by forwarding mail

Sander Holthaus - Orange XL 4 Feb 2005 18:07:51 -0000

 

> -----Original Message-----
> From: Stuart Johnston [mailto:[EMAIL PROTECTED] 
> Sent: Friday, February 04, 2005 5:20 PM
> To: users@spamassassin.apache.org
> Cc: Peter Marshall
> Subject: Re: Manually training SpamAssassin by forwarding mail
> 
> Peter Marshall wrote:
> > Kevin Sullivan wrote:
> > 
> >> --On 02/03/05 01:59:21 +0100 Sander Holthaus - Orange XL wrote:
> >>
> >>> I've been interested in offering customers to train 
> manually train 
> >>> the SpamAssassin Bayes filter for ham and spam (to reduce false 
> >>> positives and negatives). However, I can only find 
> documentation to 
> >>> this for local mailboxes and IMAP. Most users however, retrieve 
> >>> their mail through POP and use Outlook (Express) as mail 
> client. Is 
> >>> there a way to train SpamAssassin with such a setup (e.g. 
> forwarding 
> >>> mail with Outlook
> >>> (Express) using SMTP)?
> >>
> >>
> >>
> >> If you want to do a lot of programming, you could save all 
> incoming 
> >> messages for a few days in a database somewhere.  When a user 
> >> forwards a message to a special "ham" or "spam" mailbox, 
> you pull the 
> >> message-id from the message and use it to recover the original 
> >> message from your database.
> >>
> >>     -Kevin
> > 
> > 
> > My question is the same as Henrik, I have a bunch of email that is 
> > spam (either tagged by spam assassin or not tagged at all.  
> I forwared 
> > it as an attachment to a "spam" mail box.  What do I have to do now 
> > before I can get bayes to learn the message ... I read you have to 
> > remove the headers .... Could anyone give me a little more detail ?
> 
> I use a modified version of the DMZS-sa-learn.pl from: 
> http://www.dmzs.com/tools/files/spam.phtml
> 
> When someone forwards a spam to me, I move the message to a 
> special imap folder that gets processed by the script.  My 
> additions look something like:
> 
> use Email::MIME;
> ...
> my $msg = Email::MIME->new($raw_message_body);
> 
> my @parts = $msg->parts;
> 
> foreach (@parts) {
>    if ($_->content_type =~ m|message/rfc822|) {
>      sa_learn($_->body_raw);
>    }
> }
> 
> 
> I've tested this with messages forwarded as attachment from 
> Outlook and Thunderbird.  I'm not sure how effective it is 
> though.  I'm sure that it still looses something in the 
> translation.  All imap is really the way to go if you can.
> 
> 
> Stuart Johnston


Would it be an idea to stip the delivered to-header from the message, as
this will have no meaning to distinct between ham/spam? 

Also, I was wondering if anybody who is using spam-learn and ham-learn has
any protection build in to stop non-system users from mailing to those
addresses? 

Kind Regards,
Sander Holthaus

RE: Manually training SpamAssassin by forwarding mail

Reply via email to