John Hardin wrote:
Would there be any benefit to having an offline version - i.e. something that evaluates the log or a corpus to generate new meta rules, that could be added onto the default ruleset? For instance:

cron @ 0200:
sa_meta_eval > /etc/mail/spamassassin/metarules.cf
/etc/init.d/spamassassin restart



This is definetly a good idea. You can create the SVM model offline from a logfile only, if it includes the rules that scored and the ham/spam status. However, you cannot generate metarules with SVMs, for that purpose you need a different learning algorithm (for example bayes, or decision trees).

However, SVM classification is very cheap, so once you created the model offline, you can use it online really quickly with a plugin.



Cheers,



Chris

Attachment: smime.p7s
Description: S/MIME Cryptographic Signature

Reply via email to