https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6046
--- Comment #5 from Mark Martinec <[email protected]> 2009-01-22 07:31:13 PST --- > Well we don't have to do the dance of move-and-rebuild that DBM does, in part > because we are able to create a secondary index on atime that makes it very > easy to estimate whether we would expire too many tokens in a given run. So I > would expect it to be as efficient as the SQL back-ends in that respect. Good to hear. > Still, it's a good question that would not be answered out of the box by the > benchmark code, so I've changed the local.cf files in the benchmark directory > to remove bayes_auto_expire 0, and I'll re-run and see what the results look > like. Thanks! > I'm curious, which SQL back-end are you using? Mail::SpamAssassin::BayesStore::MySQL > And if it's MySQL, do you have any performance tuning tips? Initially not. Later I added (/etc/my.cnf) : [mysqld] bind=127.0.0.1 key_buffer_size=60M innodb_buffer_pool_size=384M innodb_log_buffer_size=6M innodb_flush_log_at_trx_commit=0 max_connections=60 based on some tips from: http://www.mysqlperformanceblog.com/2006/09/29/what-to-tune-in-mysql-server-after-installation/ This is with MySQL 5.1.24 InnoDB (current size 5.5 GB), on FreeBSD, SA 3.3; currently bayes_token has 1M records, bayes_seen has 26M records (I know, I need to ditch bayes_seen and start it from scratch). Initially I used MyISAM and Mail::SpamAssassin::BayesStore::SQL, which would get me in trouble every now and then, requiring a REPAIR TABLE. Now with InnoDB and the dedicated BayesStore::MySQL it never again got me into trouble in two years. > I've been a little startled to find that PgSQL is > actually outperforming MySQL on my benchmarks (given their reps) > but then I know how to tune PgSQL well; I'm more ignorant about MySQL. I was running Bayes for a while on PostgreSQL 8.2 using Mail::SpamAssassin::BayesStore::PgSQL, but the SELECT ... IN (...) with a large set of tokens in the IN-set was quite slow, much slower than with MySQL. I'm still using PostgreSQL for everything else except Bayes, i.e. for AWL, and for amavisd-new SQL logging / pen pals database, which is quite large and outperforms MySQL, especially on purging. -- Configure bugmail: https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the assignee for the bug.
