Hi; I could find a way to achieve it when I debugged the source code. I've shared the same information at Solr mail list too. Defining a delimiter and indexing it as an individual token is the first step. Writing a regex that "matches" for given delimiter is the next step. Last step is defining the slop size. When you have a big slop size you get the whole sentence.
Thanks; Furkan KAMACI 2014-04-08 16:41 GMT+03:00 Furkan KAMACI <furkankam...@gmail.com>: > Hi; > > I could not get an answer for my question at Solr list and I wanted to ask > it here because I think that it is more Lucene specific question. > > I have indexed my documents and there is a special character sequence that > shows the end of a string. It is: > > *|* > > For example: > > The quick brown fox jumps over the lazy dog.*|*Heidi has come to the > school*|* I've never seen it before.*|* Hej *|* > > When I highlight documents I do not want to split my sentences, I want to > use that sentence information. How I do that? > > PS: Special character sequence may adjacent to a word as like school*|* or > *|*Heidi. Also it may not be adjacent too: Hej *|* > > Thanks; > Furkan KAMACI >