Martin Schoenmakers created LUCENE-5697:
-------------------------------------------

             Summary: Preview issue
                 Key: LUCENE-5697
                 URL: https://issues.apache.org/jira/browse/LUCENE-5697
             Project: Lucene - Core
          Issue Type: Bug
          Components: modules/highlighter
         Environment: DocFetcher 1.1.11 on Win 7(64) pro
            Reporter: Martin Schoenmakers


In DocFetcher, which uses Lucene, we stumbled on a bug. The lead of DocFetcher 
has investigated and foud the problem seems to be in Lucene.

Issue: we use "proximity search": search on multiple words in a directory with 
about 300 PDF files.   
E.g. search for "wordA wordB wordC"~50, so three words within 50 words distance 
of each other. The resulting documents are correct. But the highligted text in 
the document is often missing. 

If the words are in the SAME order as in the search AND on the SAME page, then 
the higlight works correct. But if the order of the words is different from the 
search (like "wordA wordC wordB" OR the words are not on the same page, then 
that text is not highlighted. 

As we use the proximity search on multiple words often, it severely
degrades the usability.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to