Itamar Syn-Hershko created LUCENE-5011:
------------------------------------------

             Summary: MemoryIndex and FVH don't play along with multi-value 
fields
                 Key: LUCENE-5011
                 URL: https://issues.apache.org/jira/browse/LUCENE-5011
             Project: Lucene - Core
          Issue Type: Bug
    Affects Versions: 4.3
            Reporter: Itamar Syn-Hershko


When multi-value fields are indexed to a MemoryIndex, positions are computed 
correctly on search but the start and end offsets and the values array index 
aren't correct.

Comparing the same execution path for IndexReader on a Directory impl  and 
MemoryIndex (same document, same query, same analyzer, different Index impl), 
the difference first shows in FieldTermStack.java line 125:

termList.add( new TermInfo( term, dpEnum.startOffset(), dpEnum.endOffset(), 
pos, weight ) );

dpEnum.startOffset() and dpEnum.endOffset don't match between implementations.

This looks like a bug in MemoryIndex, which doesn't seem to handle tokenized 
multi-value fields all too well when positions and offsets are required.

I should also mention we are using an Analyzer which outputs several tokens at 
a position (a la SynonymFilter), but I don't believe this is related.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to