FuzzyQuery is an extremely expensive query and I could understand how it could be slow with an index with a lot of terms.
--Peter On Tuesday, September 10, 2002, at 05:10 AM, Mader, Volker wrote: > Did you test FuzzyQuery? It is one part, which is terribly slow, when > using WildcardQuery (With wildcard at the end!), my query is m u c h > faster. > Any experience with FuzzyQuery? > >> -----Urspr�ngliche Nachricht----- >> Von: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]] >> Gesendet: Dienstag, 10. September 2002 11:09 >> An: Mader, Volker >> Cc: [EMAIL PROTECTED]; [EMAIL PROTECTED] >> Betreff: Re: AW: Performance with 5 Millions indexed items >> >> >> We used the standard demo supplied with lucene. Check out the >> lucene-demos-1.2.war source for fields and indexing classes. >> MergeFactor=10. >> Dual CPU 1.2MGhz running Linux with jdk1.4 >> >> Performed boolean searches, phrases, wildcards and word proximity. >> >> Can you send a fragment of your indexing and searching source? >> >>> It's a completely local installation. >>> We used the standard mergeFactor. >>> >>> Could you please describe your scenario? What classes/methods do you >>> use for indexing/searching? How big are your indexed fields? What is >>> your hardware? >>> >>> Sorry for this questions, but perhaps the answers help me understand >>> lucene a little more. >>> >>> Thanks >>> >>> >>>> -----Urspr�ngliche Nachricht----- >>>> Von: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]] >>>> Gesendet: Dienstag, 10. September 2002 10:01 >>>> An: [EMAIL PROTECTED] >>>> Cc: Mader, Volker >>>> Betreff: Re: Performance with 5 Millions indexed items >>>> >>>> >>>> We have been successful at loading 10 million documents with >>>> 3 fields and >>>> performing acceptable search response times (1-2secs). >>>> Using a network / mounted drive? >>>> >>>> What was your mergeFactor? >>>> >>>> >>>>> Hi, >>>>> >>>>> I've got a question about performance with "bigger" >> indexes. We used >>>>> IndexWriter with GermanAnalyzer to index data with the following >>>>> fields: >>>>> >>>>> Field1: ID (a long value) >>>>> Field2: Description (a free text) >>>>> Field3: Groups (a list of up to 10 long values encoded >> in a single >>>>> string) Field4: Classes (a list of up to 10 long values >> encoded in a >>>>> single string) >>>>> >>>>> Documents are created with the 4 fields and then added to the >>>>> Indexwriter. After all the index is optimized. >>>>> >>>>> Searching now for a word in field "Description" using >>>>> IndexSearcher(GermanAnalyzer) with FuzzyQuery leads to >>>> search times up >>>>> to 30 seconds on a Pentium 4 1,4GHz. Also the retrieval with >>>>> hits.doc(..) is very slow. >>>>> >>>>> Any ideas? >>>>> >>>>> Volker >>>>> >>>>> -- >>>>> To unsubscribe, e-mail: >>>>> <mailto:[EMAIL PROTECTED]> For >> additional >>>>> commands, e-mail: <mailto:[EMAIL PROTECTED]> >>>> >>>> >>>> >>>> >> >> >> >> > > -- > To unsubscribe, e-mail: > <mailto:[EMAIL PROTECTED]> > For additional commands, e-mail: > <mailto:[EMAIL PROTECTED]> > > -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>
