On 04/01, John Hardin wrote: > These appear to be doing pretty good, I've exposed them for scoring > and renamed them to __DX_TEXT_*
Cool, thanks. Two related thoughts: 1) Has there been any consideration in recent years of adjusting whatever thresholds limit the number of rules, to include more rules? That's some kind of automated decision, right? 2) More automation of rule generation and testing.... Maybe modify the mass check script to run seek-phrases-in-corpus, only on spams below the default threshold? Upload results automatically, score them automatically? -- "Hello, babies. Welcome to Earth. It's hot in the summer and cold in the winter. It's round and wet and crowded. At the outside, babies, you've got about a hundred years here. There's only one rule that I know of, babies—God damn it, you've got to be kind." - Kurt Vonnegut http://www.ChaosReigns.com
