Re: AW: AW: Performance with 5 Millions indexed items

Peter Carlson Tue, 10 Sep 2002 08:28:25 -0700

FuzzyQuery is an extremely expensive query and I could understand how 
it could be slow with an index with a lot of terms.


--Peter


On Tuesday, September 10, 2002, at 05:10 AM, Mader, Volker wrote:

> Did you test FuzzyQuery? It is one part, which is terribly slow, when 
> using WildcardQuery (With wildcard at the end!), my query is  m u c h  
> faster.
> Any experience with FuzzyQuery?
>
>> -----Urspr�ngliche Nachricht-----
>> Von: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]
>> Gesendet: Dienstag, 10. September 2002 11:09
>> An: Mader, Volker
>> Cc: [EMAIL PROTECTED]; [EMAIL PROTECTED]
>> Betreff: Re: AW: Performance with 5 Millions indexed items
>>
>>
>> We used the standard demo supplied with lucene.  Check out the
>> lucene-demos-1.2.war source for fields and indexing classes.
>> MergeFactor=10.
>> Dual CPU 1.2MGhz running Linux with jdk1.4
>>
>> Performed boolean searches, phrases, wildcards and word proximity.
>>
>> Can you send a fragment of your indexing and searching source?
>>
>>> It's a completely local installation.
>>> We used the standard mergeFactor.
>>>
>>> Could you please describe your scenario? What classes/methods do you
>>> use for indexing/searching? How big are your indexed fields? What is
>>> your hardware?
>>>
>>> Sorry for this questions, but perhaps the answers help me understand
>>> lucene a little more.
>>>
>>> Thanks
>>>
>>>
>>>> -----Urspr�ngliche Nachricht-----
>>>> Von: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]
>>>> Gesendet: Dienstag, 10. September 2002 10:01
>>>> An: [EMAIL PROTECTED]
>>>> Cc: Mader, Volker
>>>> Betreff: Re: Performance with 5 Millions indexed items
>>>>
>>>>
>>>> We have been successful at loading 10 million documents with
>>>> 3 fields and
>>>> performing acceptable search response times (1-2secs).
>>>> Using a network / mounted drive?
>>>>
>>>> What was your mergeFactor?
>>>>
>>>>
>>>>> Hi,
>>>>>
>>>>> I've got a question about performance with "bigger"
>> indexes. We used
>>>>> IndexWriter with GermanAnalyzer to index data with the following
>>>>> fields:
>>>>>
>>>>> Field1: ID (a long value)
>>>>> Field2: Description (a free text)
>>>>> Field3: Groups (a list of up to 10 long values encoded
>> in a single
>>>>> string) Field4: Classes (a list of up to 10 long values
>> encoded in a
>>>>> single string)
>>>>>
>>>>> Documents are created with the 4 fields and then added to the
>>>>> Indexwriter. After all the index is optimized.
>>>>>
>>>>> Searching now for a word in field "Description" using
>>>>> IndexSearcher(GermanAnalyzer) with FuzzyQuery leads to
>>>> search times up
>>>>> to 30 seconds on a Pentium 4 1,4GHz. Also the retrieval with
>>>>> hits.doc(..) is very slow.
>>>>>
>>>>> Any ideas?
>>>>>
>>>>> Volker
>>>>>
>>>>> --
>>>>> To unsubscribe, e-mail:
>>>>> <mailto:[EMAIL PROTECTED]> For
>> additional
>>>>> commands, e-mail: <mailto:[EMAIL PROTECTED]>
>>>>
>>>>
>>>>
>>>>
>>
>>
>>
>>
>
> --
> To unsubscribe, e-mail:   
> <mailto:[EMAIL PROTECTED]>
> For additional commands, e-mail: 
> <mailto:[EMAIL PROTECTED]>
>
>


--
To unsubscribe, e-mail:   <mailto:[EMAIL PROTECTED]>
For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>

Re: AW: AW: Performance with 5 Millions indexed items

Reply via email to