Re: TotalHitCountCollector performance

2013-09-19 Thread Nicola Buso
Hi Uwe, thanks for the fast reply. I removed the CSQ and checked again my test, still there is mainly no difference with searching with TopFieldCollector. Just a clarification; the method getQuery(...) in my code is returning a BooleanQuery, where the user input is expanded on some of the fields

Re: posting list strings

2013-09-19 Thread SachinMB
I wanted to know whether the terms are hashed before putting them in the term dictionary in Lucene. Also, does Earlybird use Hashes to retrieve the documents which contain the searched term. The Earlybird Paper [1] does talk about indexing the terms but can you tell me where exactly is the hashing

Lucene for Log file indexing and search

2013-09-19 Thread gudiseashok
Hi I am learning lucene, I am developing an application do do a search in log files in multi-environment boxes, I have googled for the deeper understanding, but all examples were just referring for just field "File Name" & "Modification (i.e. fieldtypes associated with text search) and they are

Re: Lucene for Log file indexing and search

2013-09-19 Thread Paul Libbrecht
Ashok, I would look at solr which has an amount more field types to support more queries. E.g. there you have a nice query syntax for times-spans and fantastic caching. I think there's very few initiatives for indexing logs and I would be interested to see the results of your entreprise. paul

Re: A question about "seek past EOF: MMapIndexInput"

2013-09-19 Thread hao yan
Hi, Adrien thanks for the quick replay. Let me try it out. hao On Wed, Sep 18, 2013 at 3:18 AM, Adrien Grand wrote: > Hi, > > This means that there is either a bug in Lucene or that your index is > corrupted. Can you reproduce this failure if you reindex data? The > output of CheckIndex would

Custom FieldInfo.IndexOptions

2013-09-19 Thread Marcos Juarez Lopez
Hi, I'm trying to optimize an index we have, and one thing that has come up recently is that we're not really using term frequencies, and we don't need any scoring. We noticed that the term frequencies (.doc files) are a significant chunk of the total index size, and we'd like to reduce those, or

Lucene ToParentBlockJoin problems

2013-09-19 Thread krithika r
Hi, I'm trying to do something very simple with the parent/child blockjoinquery. I have a several child docs and a parent doc added to index in the same order. There are 3 fields + filter field for the parent doc. And 2 fields in child doc. I'm trying to get child document matching a field X :

Re: Lucene for Log file indexing and search

2013-09-19 Thread Ivan Krišto
On 09/19/2013 07:41 PM, gudiseashok wrote: > I am learning lucene, I am developing an application do do a search in log > files in multi-environment boxes, I have googled for the deeper > understanding, but all examples were just referring for just field "File > Name" & "Modification (i.e. fieldtyp