Alan Woodward created SOLR-13233:
------------------------------------

             Summary: SpellCheckCollator ignores stacked tokens
                 Key: SOLR-13233
                 URL: https://issues.apache.org/jira/browse/SOLR-13233
             Project: Solr
          Issue Type: Bug
      Security Level: Public (Default Security Level. Issues are Public)
            Reporter: Alan Woodward


When building collations, SpellCheckCollator ignores any tokens with a position 
increment of 0, assuming that they've been injected and may therefore have 
incorrect offsets (injected terms generally keep the offsets of the terms 
they're replacing, as they don't themselves appear anywhere in the original 
source).  However, this assumption is not necessarily correct - for example, 
WordDelimiterGraphFilter emits stacked tokens *before* the original token, 
because it needs to iterate through all stacked tokens to correctly set the 
original token's position length.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to