On Sun, Apr 25, 2010 at 00:57, Sidney Markowitz <sid...@sidney.com> wrote: > Adam Katz wrote, On 25/04/10 8:22 AM: >> Today, I saw this in svn at masses/README.perceptron: > > See this that Justin posted to sa-dev that explains the history of our > using GA, then perceptron, then back to GA. > > It also links to Duncan Findlay's thesis work on using logistic > regression as a faster algorithm that gets better results, but I don't > know what ended up happening with that. > > http://mail-archives.apache.org/mod_mbox/spamassassin-dev/200707.mbox/%3c20070701224117.f1a7732...@radish.jmason.org%3e > > or if that link gets garbled, also archived at > > http://www.mail-archive.com/dev@spamassassin.apache.org/msg21162.html
Yep. Basically, the perceptron implementation seems to require a lot of hand-tuning to produce decent results. The GA is a lot more "fire and forget", if slower. --j.