Re: OOM with high KahaDB index time

Rob Davies Mon, 18 Jan 2010 22:42:07 -0800


On 18 Jan 2010, at 22:14, Daniel Kluesing wrote:

Hi,
I'm running the 5.3 release as a standalone broker. In one case, aproducer is running without a consumer, producing small, persistentmessages, with the FileCursor pendingQueuePolicy (per https://issues.apache.org/activemq/browse/AMQ-2512)option and flow control memoryLimit set to 100mb for the queue inquestion. (Through a policy entry)
As the queue grows above 300k messages, KahaDB indexing startsclimbing above 1 second. At around 350k messages, the indexing istaking over 8 seconds. At this point, I start getting java out ofheap space errors in essentially random parts of the code. After awhile, the producers timeout with a channel inactive for too longerror, and the entire broker basically wedges itself. At this point,consumers are generally unable to bind to the broker quitting withtimeout errors. When they can connect, consuming a single messagetriggers an index re-build, which takes 2-8seconds. Turning onverbose garbage collection, the jvm is collecting like mad butreclaiming no space.
If I restart the broker, it comes back up, I can consume the oldmessages, and can handle another 350k messages until it wedges.
I can reproduce under both default gc and incremental gc.

Two questions:
- It seems like someone is holding onto a handle to the messagesafter they have been persisted to disk - is this a known issue?Should I open a JIRA for it? (Or is there another explanation?)
- Is there any documentation about the internals of KahaDB - thekind of indices etc? I'd like to get a better understanding of theindex performance and in general how KahaDB compares to somethinglike BerkeleyDB.
Thanks

There's is some confusion over naming of our persistence options thatdoesn't help. There is Kaha - which uses multiple log files and a Hashbased index - this is currently used by the FileCursor - whilst KahaDBis a newer implementation, which is more robust and typically uses aBTreeIndex. There is currently a new implementation of the Filecursorbtw - but that's a different matter. You can't currently configure theHashIndex via the FileCursor - but it looks like this is the problemyou are encountering - as it looks like you need to increase the maxhash buckets.



So I would recommend the following

1. Use the default pendingQueuePolicy (which only uses a FileCursorfor non-persistent messages - and uses the underlying database forpersistent messages2. Try KahaDB - which - with the BTreeIndex - will not hit theproblems you are seeing with the Filecursor

or - increase the maximum number of hash buckets for the FileCursorindex - by setting a Java system property - maximumCapacity to 65536(the default is 16384)


cheers,

Rob

http://twitter.com/rajdavies
I work here: http://fusesource.com
My Blog: http://rajdavies.blogspot.com/
I'm writing this: http://www.manning.com/snyder/

Re: OOM with high KahaDB index time

Reply via email to