Michael Monnerie wrote:
On Montag, 3. April 2006 14:34 Lars Ringh wrote:

Now, since in each case the source data can come from two different
servers scanning the same kind of mails, should I try to merge the
bayes-data from servers home1 and home2 into the the same myqsl-db
and then merge the data from corp1 and corp2 into the other mysql-db,
or should I pick my starting sourcedata from only one server in each
pair? Would spamassassin benefit from having the greater source to
look at, or would I only be adding close-to-identical data which
would then only be expired faster than it was to merge them?


I believe you should *not* mix two different bayes DBs. Use just one, and the rest will fill up with the next SPAM jumping in...


Yes, I've done some more thinking myself and this must be the only reasonable approach.


165MB...335MB


Did you not bayes_auto_expire?


I was under the impression that i did, but since I've done some import of the data into mysql-dbs (where I am able to examine the data easier than when they are in bdb-files) I must say that I don't seem to...

A bit strange though, since the files reach this size from scratch in quite a short time, and then the file sizes stays at this size, that is they don't grow bigger than this. That's why I thought auto expire did it's work... One might suspect that I've given bayes_expiry_max_db_size some really odd value but that's not the case either...

Well, anyway, thanks for your input.

//maccall

--

lars-dot-ringh-at-bahnhof-dot-net

Reply via email to