On 18-May-2009, at 19:02, Michael Monnerie wrote:
I didn't mean that the final result be a FP, just this one ruleset.
Shouldn't the goal be to have no FPs and lots of corrects?
In a word? No.
Test are designed to be cumulative. Something that is seen 75% of the
time in spam and 25% of the time in ham is still useful to give a
positive (albeit small) score to. Something that is seen 99.7% of the
time in spam and 0.3% of the time in ham is a lot more useful, so it
gets a higher score. This doesn't mean we discard the first rule.
The point of SA is not to have rules that ONLY hit spam or ONLY hit
ham because the world does not work that way. If something is a
potential spam indicator it can get a low score (0.1) or it can be
used in combination with other rules to generate a rather high score.
--
A marriage is always made up of two people who are prepared to
swear that only the other one snores.