Re: Return the sentence number in the indexed files

2008-07-20 Thread starz10de
thanks Grant for the answer, to index each sentence as a separate document , i already did this and it work fine, i indexed more than 93000 sentences (Documents) approx. in 11 minutes. I though the other option might be more efficient. Farag Grant Ingersoll-6 wrote: > > > On Jul 19, 2008,

Re: Boolean expression for no terms OR matching a wildcard

2008-07-20 Thread Ronald Rudy
A query solution is preferable.. but I can programmatically filter my results after the fact, it just seems like something that the Lucene team should consider adding.. I think it would only have value for wildcard queries, but nonetheless it would have some value I think.. -Ron On Jul 18

Re: Doesn it make sense cache IndexReader?

2008-07-20 Thread Mark Miller
With very small indexes and no sort fields (eg. you just use relevance) loading an IndexReader does not take very long. I think it does always make sense to cache it and reuse it though - unless the index has changed, there is no reason to pay the price of opening a new IndexReader. As your ind

Re: How to avoid duplicate records in lucene

2008-07-20 Thread Mark Miller
Sebastin wrote: Hi All, Is there any possibility to avoid duplicate records in lucene 2.3.1? I don't believe that there is a very high performance way to do this. You are basically going to have to query the index for an id before adding a new doc. The best way I can think of off the top