http://issues.apache.org/SpamAssassin/show_bug.cgi?id=5686





------- Additional Comments From [EMAIL PROTECTED]  2007-10-18 13:15 -------
K3=2
SCORE  NUMHIT   DETAIL     OVERALL HISTOGRAM  (. = ham, # = spam)
0.240 (39.524%) 
..........|.......................................................
0.280 (36.032%) ..........|..................................................
0.280 ( 0.110%) ##        |
0.360 ( 2.227%) ..........|...
0.400 (21.306%) ..........|..............................
0.400 ( 0.055%) #         |
0.440 ( 0.911%) ..........|.
0.440 ( 0.827%) ##########|#
0.480 ( 2.205%) ##########|##
0.520 (56.505%) 
##########|#######################################################
0.560 (40.132%) ##########|#######################################
0.680 ( 0.055%) #         |
0.720 ( 0.110%) ##        |


I've also implemented the Bayes chain rule algorithm described
in the EDDC paper, in r585450.  Here's a histogram using K3=1 and
that combiner:

SCORE  NUMHIT   DETAIL     OVERALL HISTOGRAM  (. = ham, # = spam)
0.000 (100.000%) 
..........|.......................................................
0.000 ( 1.764%) ##########|#
0.960 (98.236%) 
##########|#######################################################

Threshold optimization for hamcutoff=0.30, spamcutoff=0.70: cost=$32.00
Total ham:spam:   1976:1814
FP:     0 0.000%    FN:    32 1.764%
Unsure:     0 0.000%     (ham:     0 0.000%    spam:     0 0.000%)
TCRs:              l=1 56.688    l=5 56.687    l=9 56.688
SUMMARY: 0.30/0.70  fp     0 fn    32 uh     0 us     0    c 32.00


that's pretty cool.  0% FP rate!  but the 1.7% FN rate is not great.
the tweaks continue...



------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

Reply via email to