Hi Bob, Sorry for the long delay in my response. I have taken a little break. Thanks for running the rules through masscheck against your corpus. I have no where near the corpus that you do and find the testing methodology in your first and second run, and results very interesting. Thanks again!
--Larry > -----Original Message----- > From: Robert Menschel > > Tuesday, November 25, 2003, 10:12:27 PM, I wrote: > > LG>> Attached is a custom rule file. It has been working rather well > LG>> and I will be increasing the score from 0.5 to 1.0. The cf file > LG>> also has some rules looking for words obfuscated by pipes. They > LG>> have been working well also. > > RM> FYI, My masscheck results with your rules (run against my corpus > RM> of 58,857 emails). Final number on each line is what I would > RM> initially score them based on these hits (per my algorithm posted > RM> at http://www.exit0.us/index.php/RM_RuleScoring -- most sites > RM> should probably score these lower, and I would probably want to > RM> do a 2-pass or 3-pass GA on these to refine the scores myself). > > Ran a second mass-check pass, assigning those scores > (repeated below), and with those scores, > * I had no FPs based on these rules -- highest score in any ham was > less than 6.0 out of 9.0 > * These rules by themselves, with these scores, correctly flagged 263 > as spam with scores of 9.0 or higher. > > I'm guessing that others could take my scores, adjust for > threshold and other rules, and use these rules productively. > If you score spam at 5.0, then you'd want to multiply each of > my scores by 5/9 (or maybe a little less, on the theory that > other rules should hit the spam also). > > My own hesitation rests with that "and other rules" part ... > If this ruleset by itself can bring a ham to (say) 4.0 of > 9.0, and that ham matches 5.0 worth of rules without this > ruleset, it'll then be wrongly flagged as spam. > > I am just about at the point where I can do a mass-check > including all of my custom rules, but I haven't yet figured > out how to incorporate all of the distributed rules in that > mass-check. > > I'm supplying my ruleset in testdir/spamassassin/user_prefs, > and pointing to that directory using the -c parameter to > mass-check. If I were to copy all of the 2.60 *.cf rules > files into that testdir/spamassassin directory, would that > activate the default rules for the mass-check test? > > Bob Menschel > > RM> MY_RBDY_PDS_1P3 -- 375s / 22h -- 1.163 > RM> MY_RBDY_PDS_1P4 -- 365s / 5h -- 1.608 > RM> MY_RBDY_PDS_1P5 -- 210s / 3h -- 1.700 > RM> MY_RBDY_PDS_1P6 -- 165s / 2h -- 0.550 > RM> MY_RBDY_PDS_1P7 -- 88s / 0h -- 1.880 > RM> MY_RBDY_PDS_1P8 -- 121s / 4h -- 1.302 > RM> MY_RBDY_PDS_2P2 -- 168s / 14h -- 1.112 > RM> MY_RBDY_PDS_2P3 -- 105s / 45h -- 0.228 > RM> MY_RBDY_PDS_2P4 -- 311s / 14h -- 1.207 > RM> MY_RBDY_PDS_2P5 -- 56s / 7h -- 0.700 > RM> MY_RBDY_PDS_2P6 -- 161s / 8h -- 1.179 > RM> MY_RBDY_PDS_2P7 -- 89s / 5h -- 1.148 > RM> MY_RBDY_PDS_2P8 -- 4s / 5h -- 0.067 > RM> MY_RBDY_PDS_3P1 -- 200s / 15h -- 1.125 > RM> MY_RBDY_PDS_3P2 -- 173s / 25h -- 6.654 > RM> MY_RBDY_PDS_3P3 -- 179s / 58h -- 0.303 > RM> MY_RBDY_PDS_3P4 -- 74s / 15h -- 0.463 > RM> MY_RBDY_PDS_3P5 -- 195s / 12h -- 1.150 > RM> MY_RBDY_PDS_3P6 -- 43s / 5h -- 0.717 > RM> MY_RBDY_PDS_3P7 -- 3s / 5h -- 0.050 > RM> MY_RBDY_PDS_3P8 -- 42s / 49h -- 0.084 > RM> MY_RBDY_PDS_4P1 -- 285s / 32h -- 0.864 > RM> MY_RBDY_PDS_4P2 -- 417s / 21h -- 1.190 > RM> MY_RBDY_PDS_4P3 -- 259s / 82h -- 0.312 > RM> MY_RBDY_PDS_4P4 -- 160s / 26h -- 0.593 > RM> MY_RBDY_PDS_4P5 -- 56s / 17h -- 0.311 > RM> MY_RBDY_PDS_4P6 -- 7s / 0h -- 0.700 > RM> MY_RBDY_PDS_4P7 -- 3s / 12h -- 0.023 > RM> MY_RBDY_PDS_4P8 -- 2s / 0h -- 0.200 > RM> MY_RBDY_PDS_5P1 -- 84s / 21h -- 0.382 > RM> MY_RBDY_PDS_5P3 -- 99s / 464h -- 0.021 > RM> MY_RBDY_PDS_5P5 -- 81s / 12h -- 0.623 > RM> MY_RBDY_PDS_6P6 -- 99s / 464h -- 0.021 > RM> MY_HDR_PDS_1P5 -- 140s / 0h -- 2.400 > RM> MY_HDR_PDS_2P1 -- 244s / 3h -- 1.610 > RM> MY_HDR_PDS_2P4 -- 176s / 13h -- 1.126 > RM> MY_HDR_PDS_3P2 -- 308s / 9h -- 1.308 > RM> MY_HDR_PDS_3P3 -- 607s / 528h -- 0.115 > RM> MY_HDR_PDS_3P5 -- 108s / 0h -- 2.080 > RM> MY_HDR_PDS_3P8 -- 73s / 0h -- 1.730 > RM> MY_HDR_PDS_4P3 -- 481s / 519h -- 0.093 > RM> MY_HDR_PDS_4P4 -- 114s / 13h -- 0.814 > RM> MY_HDR_PDS_4P5 -- 82s / 0h -- 1.820 > RM> MY_HDR_PDS_5P1 -- 171s / 0h -- 2.710 > RM> MY_HDR_PDS_6P1 -- 159s / 0h -- 2.590 > RM> MY_HDR_PDS_6P2 -- 122s / 9h -- 1.122 > RM> MY_BDY_PIPE_S233S -- 17s / 0h -- 1.170 > RM> MY_BDY_PIPE_S23S -- 35s / 0h -- 1.350 > RM> MY_BDY_PIPE_S23C -- 17s / 0h -- 1.170 > RM> MY_BDY_PIPE_S24S -- 42s / 0h -- 1.420 > RM> MY_BDY_PIPE_S34P -- 0s / 0h -- 0.100 > RM> MY_HDR_PIPE_S233S -- 0s / 0h -- 0.100 > RM> MY_HDR_PIPE_S23S -- 0s / 0h -- 0.100 > RM> MY_HDR_PIPE_S23C -- 0s / 0h -- 0.100 > RM> MY_HDR_PIPE_S24S -- 0s / 0h -- 0.100 > RM> MY_HDR_PIPE_S34P -- 0s / 0h -- 0.100 ------------------------------------------------------- This SF.net email is sponsored by: SF.net Giveback Program. Does SourceForge.net help you be more productive? Does it help you create better code? SHARE THE LOVE, and help us help YOU! Click Here: http://sourceforge.net/donate/ _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk