IndexWriter.close() performance issue

2010-11-01 Thread Mark Kristensson
Hello, One of our Lucene indexes has started misbehaving on indexWriter.close and I'm searching for ideas about what may have happened and how to fix it. Here's our scenario: - We have seven Lucene indexes that contain different sets of data from a web application are indexed for searching by

Re: IndexWriter.close() performance issue

2010-11-02 Thread Mark Kristensson
grade. > > The answers above is relevant for 3x, but most of them are also relevant for > 2.9. If you have an older version of Lucene, then some of the solutions > might still apply (such as close(false)). > > Hope this helps, > Shai > > On Tue, Nov 2, 2010 at 12:55 AM, Mark

Re: IndexWriter.close() performance issue

2010-11-03 Thread Mark Kristensson
point and am thinking I may have to rebuild the index, though I would definitely prefer to avoid doing that and would like to know why this is happening. Thanks for your help, Mark On Nov 2, 2010, at 9:26 AM, Mark Kristensson wrote: > > Wonderful information on what happens during indexW

Re: IndexWriter.close() performance issue

2010-11-03 Thread Mark Kristensson
t; > On Wed, Nov 3, 2010 at 2:53 PM, Mark Kristensson > wrote: >> I've successfully reproduced the issue in our lab with a copy from >> production and have broken the close() call into parts, as suggested, with >> one addition. >> >> Previously, the call

Re: IndexWriter.close() performance issue

2010-11-03 Thread Mark Kristensson
unique fields names in an index or segment? Any suggestions for potentially mitigating the problem? Thanks, Mark On Nov 3, 2010, at 2:02 PM, Michael McCandless wrote: > On Wed, Nov 3, 2010 at 4:27 PM, Mark Kristensson > wrote: >> >> I've run checkIndex against the in

Re: IndexWriter.close() performance issue

2010-11-05 Thread Mark Kristensson
this'd prevent big segments from being produced. > Disabling norms should also workaround this, though that will affect > hit scores... > > Mike > > On Wed, Nov 3, 2010 at 7:37 PM, Mark Kristensson > wrote: >> Yes, we do have a large number of unique field names in

Re: IndexWriter.close() performance issue

2010-11-17 Thread Mark Kristensson
Reader? (Eg, run java with > -agentlib:hprof=cpu=samples,depth=16,interval=1). > > Yes, both Index.NOT_ANALYZED_NO_NORMS and Index.NO will disable norms > as long as no document in the index ever had norms on (yes it does > "infect" heh). > > Mike > > On Fri,

Re: IndexWriter.close() performance issue

2010-11-17 Thread Mark Kristensson
uch a huge number of >> unique field names is not an area of Lucene that's very well tested... >> perhaps there's something silly somewhere. Maybe you can try >> profiling just the init of your IndexReader? (Eg, run java with >> -agentlib:hprof=cpu=samples,dept

Re: IndexWriter.close() performance issue

2010-11-18 Thread Mark Kristensson
ike >>>> >>>> On Wed, Nov 3, 2010 at 7:37 PM, Mark Kristensson >>>> wrote: >>>>> Yes, we do have a large number of unique field names in that index, >>>>> because they are driven by user named fields in our application (with

Re: IndexWriter.close() performance issue

2010-11-19 Thread Mark Kristensson
lly somewhere. Maybe you can try >> profiling just the init of your IndexReader? (Eg, run java with >> -agentlib:hprof=cpu=samples,depth=16,interval=1). >> >> Yes, both Index.NOT_ANALYZED_NO_NORMS and Index.NO will disable norms >> as long as no document in the inde

Re: IndexWriter.close() performance issue

2010-11-23 Thread Mark Kristensson
that be instead of monkeying with StringHelper or in addition to it? Thanks, Mark On Nov 20, 2010, at 5:44 AM, Yonik Seeley wrote: > On Fri, Nov 19, 2010 at 5:41 PM, Mark Kristensson > wrote: >> Here's the changes I made to org.apache.lucene.util.StringHelper: >> >> //

Re: Not query help.

2010-11-23 Thread Mark Kristensson
I've had to deal with exactly this same scenario and, from what I know, there is no equivalent in Lucene to a SQL != kind of query. So, you have to have some way to select "everything" (whatever that means in your scenario) and then remove the stuff that you don't want (the NOT portion). For my app

Directory objects for index

2010-11-29 Thread Mark Kristensson
While implementing a solution for keeping warmed indexReaders around for our various indexes (so users don't have to wait while we open an indexReader for our one slow index), I've run into some serious problems trying to track down all of the outstanding file handles to my indexes. For a while

Re: IndexWriter.close() performance issue

2011-02-22 Thread Mark Kristensson
eld names might cause a problem opening an index reader. However, I'm not sure why closing (committing changes to) an index writer would have such a problem. Why is that? Thank you! Mark On Tue, Nov 23, 2010 at 2:22 PM, Mark Kristensson < mark.kristens...@smartsheet.com> wrote: > I&