On Tue, Aug 17, 2004 at 01:15:33AM -0600, Ryan Thompson wrote: > > Has anybody else done any benchmarks using BayesStore::SQL vs DB_file? > I looked and didn't find any, so I did my own. >
Hi Ryan, I've done a little benchmarking, probably several hundred different benchmarks. I'm currently in the middle of nowhere with really bad dialup so can't elaborate much. For a quick look at some very recent benchmarks look at Bug 3331 in Bugzilla. The base case is very close to the code in 3.0.0-rc1. Bug 3331 also briefly discusses the benchmark methodology. The short answer, keeping in mind SQL is optimized for scan operations, is: SQL is twice as slow for learn operations. SQL is a few % faster for scanning via spamd SQL is ~7 times faster for expire SQL is pretty slow for forgetting SQL is a few % faster for scanning via spamassassin I'll happily answer questions about bayes in SQL, it's not for everyone but a lot of work was put in to make it as fast and useful as possible. More when I get back to some decent connectivity. Michael
