Nigel Wilkinson wrote: > Why does a 99-100% probability score less than an 80-95% probability???
Because the Bayes engine is not the only factor in classifying a message as spam. Along with that all of the other rules are factored into it too. A message which is 99-100% probability is going to trigger many of the other SA rules. The total is enough to push the message over the 5 point threshold. The scoring program therefore did not need to make the BAYES_99 score any higher than it did. And I also believe there is a value in the SA development team that no single rule should be too large. It can lead to false positives. It is better to be conservative and avoid false positives for the masses. However, *I* don't like seeing the same spam again and again. With the default values I would see a spam, train for it, and still see the same spam again and again because it would only score BAYES_99 and be below the threshold. Often this is before it is reported and before network tests such as RBLs and SURBL can tag the sender. So I increase the BAYES_95 and BAYES_99 points to 4.0 and 5.0 for my own personal use. That way if the same spam comes through again, as I know it will, it will get tagged. But I can't say with any authority that this won't generate false positives. I can only say that I have only myself to blame in that case and also that since I know what it is doing I won't be surprised by it. Bob
signature.asc
Description: Digital signature