Re: [pylucene-dev] TermDocs.read() method

Andi Vajda Tue, 09 Sep 2008 11:15:27 -0700


On Tue, 9 Sep 2008, Martin Bachwerk wrote:

Hello again,
the index is kinda large indeed.. even though I have Field.Store.NO set forthe actual content.. (ok the documents are 2-3k large in average, but itcould be smaller still..)
The memory use is just growing and growing.. though doesn't go into criticalarea, it just ate up 800megs out of 1024 I have in some 15 mins.. after thatit stayed stable. I guess this would be acceptable.. but I don't quiteunderstand why it is the case..

If it stabilized, it could just mean that this is the memory necessary forJava Lucene to work with your index. Have you tried reducing the max memoryso that you use less but gc more often ?

The arrays are pretty much dependant on the term (i.e. word).. for words like"is" they're around the size of the number of documents.. for rare words theycan be 1-2-3.. entries long..
I don't have Java code to test all this sorry.

It could be written :) It's pretty much a one-to-one mapping for the APIcalls. This is what I would do next to isolate this if I were to debug thisfurther right now.


Andi..
_______________________________________________
pylucene-dev mailing list
pylucene-dev@osafoundation.org
http://lists.osafoundation.org/mailman/listinfo/pylucene-dev

Re: [pylucene-dev] TermDocs.read() method

Reply via email to