Is AnalyzingSuggester thread safe?

2016-07-28 Thread Lin Guo
We are using version 4.5.0. Many thanks!

Re: get enumeration of all terms starting at a given term after lucene 4

2016-07-28 Thread Greg Bowyer
I am confused by your example, MultiFields.get allows you to ask for a specific field. On Thu, Jul 28, 2016, at 09:00 AM, Mukul Ranjan wrote: > Hi All, > > How to get enumeration of all terms starting at a given term. I have > upgrade lucene version from lucene 3.6 to lucene 5.5.2. After 3.6, >

Re: Indexing and storing Long fields

2016-07-28 Thread Kumaran Ramasubramanian
Ok mike.. thanks for the explanation... i have another doubt... i read in some article like, we can have one storedfield & docvalue field with same field... is it so? -- Kumaran R On Thu, Jul 28, 2016 at 9:29 PM, Michael McCandless < luc...@mikemccandless.com> wrote: > OK, sorry, you

get enumeration of all terms starting at a given term after lucene 4

2016-07-28 Thread Mukul Ranjan
Hi All, How to get enumeration of all terms starting at a given term. I have upgrade lucene version from lucene 3.6 to lucene 5.5.2. After 3.6, indexReader terms api is removed which used to give list of terms. I have used below code to get the termEnum, but it has no option to pass the value

Re: Indexing and storing Long fields

2016-07-28 Thread Michael McCandless
OK, sorry, you cannot change how the field is indexed for the same field name across different field indices. Lucene will "downgrade" that field to the lowest settings, e.g. "docs, no positions" in your case. Mike McCandless http://blog.mikemccandless.com On Thu, Jul 28, 2016 at 9:31 AM,

Re: BufferedUpdateStreams breaks high performance indexing

2016-07-28 Thread Michael McCandless
Hmm, your merge policy changes are dangerous: that will cause too many segments in the index, which makes it longer to apply deletes. Can you revert that and re-test? I'm not sure why DIH is using updateDocument instead of addDocument ... maybe ask on the solr-user list? Mike McCandless

setRAMBufferSizeMB and setRAMPerThreadHardLimitMB

2016-07-28 Thread Gimantha Bandara
Hi all, Can someone explain what these methods do? Why do we have two different methods for per threads and for all the documents? default value for the RAMBufferSize is 16 mbs and PerThread value is 1945 MB. What will happen if I set BufferSize to 2048? will the docs be flushed to directory when

Re: BufferedUpdateStreams breaks high performance indexing

2016-07-28 Thread Bernd Fehling
Currently I use concurrent DIH but will write some SolrJ for testing or even as replacement for DIH. Don't know whats behind DIH if only documents are added. Not tried any newer release yet, but after reading LUCENE-6161 I really should. At least a version > 5.1 May be before writing some SolrJ.

Re: BufferedUpdateStreams breaks high performance indexing

2016-07-28 Thread Michael McCandless
Hmm not good. If you are really only adding documents, you should be using IndexWriter.addDocument, which won't buffer any deleted terms and that method call should be a no-op. It also makes flushes more efficient since all of your indexing buffer goes to the added documents, not buffered delete

Re: Indexing and storing Long fields

2016-07-28 Thread Kumaran Ramasubramanian
Hi Mike, For your information, am using lucene 4.10.4.. am i missing anything? ​-- Kumaran R​ On Wed, Jul 27, 2016 at 1:52 AM, Kumaran Ramasubramanian wrote: > > Hi Mike, > > 1.if we index one field as analyzed and not analyzed using same name, > phrase queries are

Re: Order of StoredFieldVisitor.*Field calls?

2016-07-28 Thread Michael McCandless
I suspect the visitor will be called in order that the stored fields were originally added to the Document at indexing time, but I'm not sure whether this is guaranteed behavior across Lucene releases. Mike McCandless http://blog.mikemccandless.com On Thu, Jul 28, 2016 at 6:35 AM, Andreas Sewe

Order of StoredFieldVisitor.*Field calls?

2016-07-28 Thread Andreas Sewe
Hi, I am currently looking at the StoredFieldVisitor visitor to make the conversion from Documents into my own search results more efficient. My current code looks like this: Document doc = searcher.doc(id); ISearchResult result = convert(doc); My convert(Document) method creates, depending

BufferedUpdateStreams breaks high performance indexing

2016-07-28 Thread Bernd Fehling
While trying to get higher performance for indexing it turned out that BufferedUpdateStreams is breaking indexing performance. public synchronized ApplyDeletesResult applyDeletesAndUpdates(...) At IndexWriterConfig I have setRAMBufferSizeMB=1024 and the Lucene 4.10.4 API states: "Determines the