Re: Feeding SA-learn

Anthony Peacock Thu, 24 Jan 2008 04:20:17 -0800

John Thompson wrote:

On 2008-01-23, Anthony Peacock <[EMAIL PROTECTED]> wrote:
My intention was to manually feed the few spam messages that slip thruundetected. By the time I get a hold of those, they are in therecipient's mail client inbox, not in the server.I was thinking, if I save the mail as EML files, would that preserve theheaders in a way that sa-learn can parse correctly?
Depends on the client.
For instance, Thunderbird stores it's folders in mbox format, sosa-learn can work against those files as-is. Other email clients cansave emails in text format complete with headers.
The biggest problem with this is training the users to do that consistantly.
Isn't that what "cron" is for? :-)
I have a cron job on my imap server to regularly feed ham and spamthrough sa-learn.

I have a cron job that runs the learning process nightly. I wasrefering to the process of gathering the false-negatives andfalse-positives. That has to be done by hand, as a decision needs to bemade about whether they are spam or not. And, by definition, theautomatic process has got it wrong.



--
Anthony Peacock
CHIME, Royal Free & University College Medical School
WWW:    http://www.chime.ucl.ac.uk/~rmhiajp/
"A CAT scan should take less time than a PET scan.  For a CAT scan,
 they're only looking for one thing, whereas a PET scan could result in
 a lot of things."    - Carl Princi, 2002/07/19

Re: Feeding SA-learn

Reply via email to