Re: sa-learn: have i seen this before?

Faisal N Jawdat Mon, 16 Apr 2007 22:55:03 -0700

On Apr 16, 2007, at 9:34 PM, Matt Kettler wrote:

Try to learn it, if it comes back with something to the affect of:
"learned from 0 messages, processed 1.." then it's already beenlearned.


this seems to be the common suggestion.

it has a couple drawbacks, as i see it:

1. it's relatively cpu-intensive if i want to do it all the time(e.g. scan my spam folder to learn only the messages which haven'talready been learned)


2.  which way do i learn it.

to step back a bit, my final goal is to be able to figure out whichmessages in a folder haven't been learned, and learn only those. inthe ideal situation i can also figure out (ahead of time), whether alearned message was learned as ham or spam.


this may be semi-impossible.

on the other hand, what can i learn from the headers?

e.g. it looks like autolearn=[something] will tell me about theautolearner, but is there anything for manual learns?


where i'm going with all this:

i can run a cron job to learn the contents of different mailboxes ona regular basis. what i do now is have a TrainSpam and TrainHammailbox, and when something gets misfiled (in Spam or any ham folder)i just move it in there. every 5 minutes a cron job goes through andscans things appropriately. <http://www.faisal.com/software/sa-harvest/quicktrain.html>

first, i'd like to be able to do that within the mailboxes ratherthan using special mailboxes.

second, i'd like to be able to key off junk mail flags set by theclient (thunderbird, apple mail). i'm using dovecot, so it's afairly simple matter of parsing Maildir filenames, but to do it righti need to combine the knowledge with what spamassassin thinks.

i might just go write a dovecot plugin to do this in real-time, buti'm not feeling the motivation to break the mail server with amisplaced pointer.


-faisal

Re: sa-learn: have i seen this before?

Reply via email to