Mike, I've not tried it yet, but I think the problem can be reproduced. However, it'll take a few hours to reach that threshhold since my code also needs to extract text from some very large PDF documents to store in the index.
I'll post the pseudo-code of my code tomorrow. Maybe that'll help point to mistakes I'm making in the logic. Suman On 11/27/06, Michael McCandless <[EMAIL PROTECTED]> wrote:
Suman Ghosh wrote: > On 11/27/06, Yonik Seeley <[EMAIL PROTECTED]> wrote: >> On 11/27/06, Suman Ghosh <[EMAIL PROTECTED]> wrote: >> > Here are the values: >> > >> > mergeFactor=10 >> > maxMergeDocs=100000 >> > minMergeDocs=100 >> > >> > And I see your point. At the time of the crash, I have over 5000 >> > segments. I'll try some conservative number and try to rebuild the >> > index. >> >> Although I don't see how those settings can produce 5000 segments, >> I've developed a non-recursive patch you might want to try: >> https://issues.apache.org/jira/browse/LUCENE-729 Suman, I'd really like to understand how you're getting so many segments in your index. Is this (getting 5000 segments) easy to reproduce? Are you closing / reopening your writer every so often (eg to delete documents or something)? Mike --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]