On Fri, Aug 04, 2006 at 09:20:41AM -0500, Dave Augustus wrote: > > 7.162 8.3673 0.0000 1.000 0.95 3.00 T_DC_GIF_UNO_LARGO > > 4.016 4.6920 0.0000 1.000 0.84 3.00 T_DC_IMAGE_SPAM > > 0.666 0.7786 0.0000 1.000 0.36 4.00 T_DC_GIF_MULTI_LARGO > > 0.576 0.6732 0.0000 1.000 0.31 3.00 T_DC_PNG_UNO_LARGO > > 0.000 0.0000 0.0000 0.500 0.25 4.00 T_DC_PNG_MULTI_LARGO > Pardon the question but how are you generating these stats?
That's the output from hit-frequencies, which reads in the logs from mass-check, which is the tool used during development (though it has other uses too) to gather information about rule hits on message corpora. Both of those, and many other, tools are under the masses directory in the tarball (along with some documentation). There's also a wiki page that has some quick coverage of some of the different tools in there: http://wiki.apache.org/spamassassin/MassesOverview -- Randomly Generated Tagline: The only way to learn a new programming language is by writing programs in it. - Brian Kernighan
pgpXj8f2E3068.pgp
Description: PGP signature