[ https://issues.apache.org/jira/browse/LUCENE-8365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Simon Willnauer reassigned LUCENE-8365: --------------------------------------- Assignee: Simon Willnauer > ArrayIndexOutOfBoundsException in UnifiedHighlighter > ---------------------------------------------------- > > Key: LUCENE-8365 > URL: https://issues.apache.org/jira/browse/LUCENE-8365 > Project: Lucene - Core > Issue Type: Bug > Components: modules/highlighter > Affects Versions: 7.3.1 > Reporter: Marc Morissette > Assignee: Simon Willnauer > Priority: Major > Time Spent: 10m > Remaining Estimate: 0h > > We see ArrayIndexOutOfBoundsExceptions coming out of the UnifiedHighlighter > in our production logs from time to time: > {code} > java.lang.ArrayIndexOutOfBoundsException > at java.base/java.lang.System.arraycopy(Native Method) > at > org.apache.lucene.search.uhighlight.PhraseHelper$SpanCollectedOffsetsEnum.add(PhraseHelper.java:386) > at > org.apache.lucene.search.uhighlight.PhraseHelper$OffsetSpanCollector.collectLeaf(PhraseHelper.java:341) > at org.apache.lucene.search.spans.TermSpans.collect(TermSpans.java:121) > at > org.apache.lucene.search.spans.NearSpansOrdered.collect(NearSpansOrdered.java:149) > at > org.apache.lucene.search.spans.NearSpansUnordered.collect(NearSpansUnordered.java:171) > at > org.apache.lucene.search.spans.FilterSpans.collect(FilterSpans.java:120) > at > org.apache.lucene.search.uhighlight.PhraseHelper.createOffsetsEnumsForSpans(PhraseHelper.java:261) > ... > {code} > It turns out that there is an "off by one" error in the UnifiedHighlighter's > code that, as far as I can tell, is only triggered when two nested > SpanNearQueries contain the same term. > The resulting behaviour depends on the content of the highlighted document. > Either, some highlighted terms go missing or an > ArrayIndexOutOfBoundsException is thrown. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org