Re: Text dependent analyzer

2015-04-14 Thread Ahmet Arslan
Hi Hummel, You can perform sentence detection outside of the solr, using opennlp for instance, and then feed them to solr. https://opennlp.apache.org/documentation/1.5.2-incubating/manual/opennlp.html#tools.sentdetect Ahmet On Tuesday, April 14, 2015 8:12 PM, Shay Hummel wrote: Hi I would l

[ANNOUNCE] Apache Lucene 5.1.0 released

2015-04-14 Thread Timothy Potter
14 April 2015 - The Lucene PMC is pleased to announce the release of Apache Lucene 5.1.0 The release is available for immediate download at: http://www.apache.org/dyn/closer.cgi/lucene/java/5.1.0 Lucene 5.1.0 includes 9 new features, 10 bug fixes, and 24 optimizations / other changes from 18 uniq

Span near query with payloads

2015-04-14 Thread Shay Hummel
Hi Why did you add this note in SpanPayloadCheckQuery "Do not use this with an SpanQuery that contains a SpanNearQuery . Instead, use SpanNearPayloadCheckQuery

Text dependent analyzer

2015-04-14 Thread Shay Hummel
Hi I would like to create a text dependent analyzer. That is, *given a string*, the analyzer will: 1. Read the entire text and break it into sentences. 2. Each sentence will then be tokenized, possesive removal, lowercased, mark terms and stemmed. The second part is essentially what happens in eng

SortingAtomicReader alternate to Tim-Sort...

2015-04-14 Thread Ravikumar Govindarajan
We were experimenting with SortingMergePolicy and came across an alternate solution to TimSort of postings-list using FBS & GrowableWriter. I have attached relevant code-snippet. It would be nice if someone can clarify whether it is a good idea to implement... public class SortingAtomicReader { …