Re: Lucene 5.0.0 - StringField and Sorting

2016-04-21 Thread Gimantha Bandara
Hi Torsten, Did you find a solution for this? I am having the same issue.. I am planning to create a custom Field with DocValueType.SORTED. Is there any other way to do that without creating a custom Field? On Fri, Mar 6, 2015 at 3:34 PM, Torsten Krah wrote: > Hi, > > looking at the JavaDoc of

Re: 500 millions document for loop.

2016-04-21 Thread Valentin Popov
Chris , hello. Thank got the tip, but could you explain how can I use it? Regards, Valentin. > On 16 нояб. 2015 г., at 0:42, Chris Hostetter > wrote: > > > : public void collect(int docID) throws IOException { > : Document doc = indexSearcher

Question on Lucene Behavior in 4.9 vs 5.4.1

2016-04-21 Thread Jeremy Glesner
Hello, I'm witnessing a change in behavior between Lucene 4.9 and 5.4.1 that I don't quite understand. I'd like to track down what's happening under the hood. I'm working to update the dependencies of an open source geospatial resolution tool ( https://github.com/Berico-Technologies/CLAVIN), which

Log indexing with lucene performance issues

2016-04-21 Thread Hamed Ghavamnia
Hello, We've created a log management system using lucene 4.3. Each log has about 10 fields and all of them are stored. We store each hour of the logs in a separate folder so when someone runs a query only the folders specified in the time frame are searched. The indexes are loaded using the mmap

Re: 500 millions document for loop.

2016-04-21 Thread Erick Erickson
Actually, this looks like a fine place to use Streaming Aggregation/Streaming Expressions. Those operate off of docValues fields anyway, so you kind of get all this "for free". I don't see the Solr version though, much of this is in later 5x versions. Pull down the Solr Reference Guide for the ver

Re: 500 millions document for loop.

2016-04-21 Thread Sheng
If you don't care about search, why not just use reader to traverse ? Establish a for loop from 0 to reader.maxDoc() - 1, and filter the documents using Multifields. You can even bucket this procedure, and run your statistics calc in parallel. On Thursday, November 12, 2015, Valentin Popov wrote: