On Tue, Aug 17, 2004 at 01:15:33AM -0600, Ryan Thompson wrote:
> 
> Has anybody else done any benchmarks using BayesStore::SQL vs DB_file?
> I looked and didn't find any, so I did my own.
> 

Hi Ryan,

I've done a little benchmarking, probably several hundred different
benchmarks.  I'm currently in the middle of nowhere with really bad
dialup so can't elaborate much.  For a quick look at some very recent
benchmarks look at Bug 3331 in Bugzilla.  The base case is very close
to the code in 3.0.0-rc1.  Bug 3331 also briefly discusses the
benchmark methodology.

The short answer, keeping in mind SQL is optimized for scan
operations, is:

SQL is twice as slow for learn operations.
SQL is a few % faster for scanning via spamd
SQL is ~7 times faster for expire
SQL is pretty slow for forgetting
SQL is a few % faster for scanning via spamassassin

I'll happily answer questions about bayes in SQL, it's not for
everyone but a lot of work was put in to make it as fast and useful as
possible.

More when I get back to some decent connectivity.

Michael

Reply via email to