Re: A model for predicting indexing memory costs?

2009-03-14 Thread Florian Weimer
* mark harwood: Thanks, I have a heap dump now from a run with reduced JVM memory (in order to speed up a failure point) and am working through it offline with VisualVm. This test induced a proper OOM as opposed to one of those timed out waiting for GC type OOMs so may be misleading. It

Re: A model for predicting indexing memory costs?

2009-03-11 Thread Florian Weimer
* mark harwood: Could you get a heap dump (eg with YourKit) of what's using up all the memory when you hit OOM? On this particular machine I have a JRE, no admin rights and therefore limited profiling capability :( Maybe this could give you a heap dump which you can analyze on a different

Re: A model for predicting indexing memory costs?

2009-03-11 Thread mark harwood
, 11 March, 2009 10:42:33 Subject: Re: A model for predicting indexing memory costs? * mark harwood: Could you get a heap dump (eg with YourKit) of what's using up all the memory when you hit OOM? On this particular machine I have a JRE, no admin rights and therefore limited profiling capability

Re: A model for predicting indexing memory costs?

2009-03-11 Thread Michael McCandless
mark harwood wrote: Thanks, I have a heap dump now from a run with reduced JVM memory (in order to speed up a failure point) and am working through it offline with VisualVm. This test induced a proper OOM as opposed to one of those timed out waiting for GC type OOMs so may be

Re: A model for predicting indexing memory costs?

2009-03-11 Thread Mark Miller
Michael McCandless wrote: Ie, it's still not clear if you are running out of memory vs hitting some weird it's too hard for GC to deal kind of massive heap fragmentation situation or something. It reminds me of the special (I cannot be played on record player X) record (your application)

Re: A model for predicting indexing memory costs?

2009-03-11 Thread Michael McCandless
Mark Miller wrote: Michael McCandless wrote: Ie, it's still not clear if you are running out of memory vs hitting some weird it's too hard for GC to deal kind of massive heap fragmentation situation or something. It reminds me of the special (I cannot be played on record player X)

Re: A model for predicting indexing memory costs?

2009-03-11 Thread mark harwood
:56 Subject: Re: A model for predicting indexing memory costs? Mark Miller wrote: Michael McCandless wrote: Ie, it's still not clear if you are running out of memory vs hitting some weird it's too hard for GC to deal kind of massive heap fragmentation situation or something. It reminds me

Re: A model for predicting indexing memory costs?

2009-03-10 Thread mark harwood
: Re: A model for predicting indexing memory costs? mark harwood wrote: I've been building a large index (hundreds of millions) with mainly structured data which consists of several fields with mostly unique values. I've been hitting out of memory issues when doing periodic commits/closes

Re: A model for predicting indexing memory costs?

2009-03-10 Thread Ian Lea
- Original Message From: Michael McCandless luc...@mikemccandless.com To: java-user@lucene.apache.org Sent: Tuesday, 10 March, 2009 0:01:30 Subject: Re: A model for predicting indexing memory costs? mark harwood wrote: I've been building a large index (hundreds of millions

Re: A model for predicting indexing memory costs?

2009-03-10 Thread mark harwood
From: Ian Lea ian@gmail.com To: java-user@lucene.apache.org Sent: Tuesday, 10 March, 2009 10:54:05 Subject: Re: A model for predicting indexing memory costs? That's not the usual OOM message is it? java.lang.OutOfMemoryError: GC overhead limit exceeded. Looks like you might be able

RE: A model for predicting indexing memory costs?

2009-03-10 Thread Uwe Schindler
...@thetaphi.de -Original Message- From: mark harwood [mailto:markharw...@yahoo.co.uk] Sent: Tuesday, March 10, 2009 12:07 PM To: java-user@lucene.apache.org Subject: Re: A model for predicting indexing memory costs? Thanks, Ian. I forgot to mention I tried that setting

Re: A model for predicting indexing memory costs?

2009-03-10 Thread mark harwood
To: java-user@lucene.apache.org Sent: Tuesday, 10 March, 2009 11:32:48 Subject: RE: A model for predicting indexing memory costs? It does not indefinitely hang, I think the problem is, that the GC takes up all processor resources and nothing else runs any more. You should also enable the parallel

RE: A model for predicting indexing memory costs?

2009-03-10 Thread Uwe Schindler
- Original Message From: Uwe Schindler u...@thetaphi.de To: java-user@lucene.apache.org Sent: Tuesday, 10 March, 2009 11:32:48 Subject: RE: A model for predicting indexing memory costs? It does not indefinitely hang, I think the problem is, that the GC takes up all processor resources

Re: A model for predicting indexing memory costs?

2009-03-10 Thread mark harwood
: Tuesday, 10 March, 2009 12:53:19 Subject: RE: A model for predicting indexing memory costs? It does not indefinitely hang, I guess I just need to be more patient. Thanks for the GC settings. I don't currently have the luxury of 15 other processors but this will definitely be of use in other

Re: A model for predicting indexing memory costs?

2009-03-10 Thread mark harwood
Subject: Re: A model for predicting indexing memory costs? mark harwood wrote: I've been building a large index (hundreds of millions) with mainly structured data which consists of several fields with mostly unique values. I've been hitting out of memory issues when doing periodic

Re: A model for predicting indexing memory costs?

2009-03-10 Thread Erick Erickson
, Mark - Original Message From: Michael McCandless luc...@mikemccandless.com To: java-user@lucene.apache.org Sent: Tuesday, 10 March, 2009 0:01:30 Subject: Re: A model for predicting indexing memory costs? mark harwood wrote: I've been building a large index

Re: A model for predicting indexing memory costs?

2009-03-10 Thread mark harwood
out of settings to tweak here. Cheers, Mark - Original Message From: Michael McCandless luc...@mikemccandless.com To: java-user@lucene.apache.org Sent: Tuesday, 10 March, 2009 0:01:30 Subject: Re: A model for predicting indexing memory costs? mark harwood wrote

Re: A model for predicting indexing memory costs?

2009-03-10 Thread Michael McCandless
mark harwood wrote: Could you get a heap dump (eg with YourKit) of what's using up all the memory when you hit OOM? On this particular machine I have a JRE, no admin rights and therefore limited profiling capability :( That's why I was trying to come up with some formula for estimating

RE: A model for predicting indexing memory costs?

2009-03-10 Thread Jon Loken
:54 To: java-user@lucene.apache.org Subject: Re: A model for predicting indexing memory costs? That's not the usual OOM message is it? java.lang.OutOfMemoryError: GC overhead limit exceeded. Looks like you might be able to work round it with -XX:-UseGCOverheadLimit http://java-monitor.com/forum

Re: A model for predicting indexing memory costs?

2009-03-10 Thread Grant Ingersoll
On Mar 10, 2009, at 7:55 AM, mark harwood wrote: It does not indefinitely hang, I guess I just need to be more patient. Thanks for the GC settings. I don't currently have the luxury of 15 other processors but this will definitely be of use in other environments. It is also, usually

A model for predicting indexing memory costs?

2009-03-09 Thread mark harwood
I've been building a large index (hundreds of millions) with mainly structured data which consists of several fields with mostly unique values. I've been hitting out of memory issues when doing periodic commits/closes which I suspect is down to the sheer number of terms. I set the

Re: A model for predicting indexing memory costs?

2009-03-09 Thread Michael McCandless
mark harwood wrote: I've been building a large index (hundreds of millions) with mainly structured data which consists of several fields with mostly unique values. I've been hitting out of memory issues when doing periodic commits/ closes which I suspect is down to the sheer number of