So this is really top 8 rules if you rule out HTML_MESSAGE and MIME_HTML_ONLY. Everything else was more than likely spam. If there are hams hitting WS_URI_RBL then we're all in trouble...
Gary -----Original Message----- From: jdow [mailto:[EMAIL PROTECTED] Sent: Thursday, August 26, 2004 9:40 PM To: [EMAIL PROTECTED] Subject: Re: Top ten rules... That looks like top ten hits. I'd like to know the top ten hits to misses ratios. For example I presume HTML_MESSAGE hits a lot of ham, too. So what is the ratio of its spam to ham hits? The higher the ratio, such as for BAYES_99 the better the rule. I can craft a rule that hits EVERY spam I receive. Um, it would also hit every ham, too. So it's not a useful rule, is it? (A good part of engineering is learning to ask the RIGHT questions.) {^_-} ----- Original Message ----- From: "Mike French" <[EMAIL PROTECTED]> > Just for grins the top 10 list... > > > Ranking of Tests in Spammails: > ------------------------------ > 90.19 % 2870 : BAYES_99 > 81.14 % 2582 : HTML_MESSAGE > 78.79 % 2507 : RAZOR2_CHECK > 78.69 % 2504 : RAZOR2_CF_RANGE_51_100 > 66.72 % 2123 : WS_URI_RBL > 60.91 % 1938 : OB_URI_RBL > 44.85 % 1427 : SPAMCOP_URI_RBL > 34.41 % 1095 : AB_URI_RBL > 32.50 % 1034 : MIME_HTML_ONLY > 30.89 % 983 : NO_RDNS > > > > Mike French > MIS OnlineServices
