> -----Original Message----- > From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] > Sent: Wednesday, June 18, 2008 12:10 PM > To: John GALLET > Cc: users@spamassassin.apache.org > Subject: Re: [Rule Set proposal] French Rules > > ...omissis... > > by the way, if you're reasonably perl-capable, it might be worthwhile > using the algorithm I use to generate the JM_SOUGHT ruleset for english > spam: http://taint.org/tag/rule-discovery > > you just give it a corpus of spam samples and it generates the rules > for > you. The code is in SpamAssassin SVN. > > --j.
Nah, that's great! I regret I can only occasionally read interesting messages due to my own time constraints. I could have read about this set of scripts weeks ago, otherwise... How this code is supposed to be used? I see these scripts in rule-dev: maildir-scan-headers, seek-phrases-in-corpus, seek-phrases-in-log and strip-high-scorers-from-log. Give us a brief description of their work and usage. Nice idea, Justin! Giampaolo