jdow wrote: > From: <[EMAIL PROTECTED]> > > Kristopher Austin wrote: >> RANK RULE NAME COUNT %OFRULES %OFMAIL %OFSPAM >> %OFHAM >> ------------------------------------------------------------ >> 1 HTML_MESSAGE 45870 5.13 27.72 70.37 >> 55.36 > > Wait... so 27% of all mail is HTML, 70% of spam is HTML, and 55% of ham > is HTML? > > <<jdow>> > So what's the problem? (He's not running Bayes or it's badly broken, > though.)
If 55% of HAM is HTML, and 70% of spam is HTML, then at LEAST 55% of mail must be html. Unless of course all ham + all spam is less than 100% of mail. In which case where's the magic third category that isn't ham or spam that is less than 27% HTML to drive the total percentage down to 27%?