Re: Optimizing indexes with mulitiple processors?

Peter A. Friend Fri, 10 Jun 2005 10:09:28 -0700


On Jun 10, 2005, at 9:33 AM, Chris Collins wrote:

How many documents did you try to index?


Only about 4000 at the moment.

  I am using a relatively large
minMergeDoc that causes me to run out of memory when I make such achange. (I
am using 1/2 gb of heap btw).

I was running out of memory as well until I gave Java a larger heapto work with. I am assuming that a dedicated indexing machine (aswell as search) is going to need a mountain of memory. I figure Iwill be giving Java gigs to play with.

I believe changing it in the outputstream object
means that a lot of in memory only objects use that size too.

This I need to look into. At a guess, I would think that there wouldbe an OutputStream object for each open segment, and each file inthat segment. A consolidated index *might* use less but of course weare trying to improve performance here, and the consolidated indexdoes incur a cost. Assuming 10 segments and 10 files within eachsegment, that's 100 OutputStream objects or 809,600 bytes. That'llgrow quickly with merge tweaks. Those larger writes do save a bunchof system calls and make (maybe) better use of your filers blocksize. This grows quickly with maxMerge tweaks. Of course this couldbe utterly incorrect, I need to look into this a bit more carefully.

I dont know I would of used truss in this regard, this only pointsout whatsize hit the kernel not what went over the wire. I would suggestusing
ethereal to ensure thats how its ending up on the wire.


True, hadn't gotten that far yet. :-)

Peter



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Optimizing indexes with mulitiple processors?

Reply via email to