Michael Braun created LUCENE-7682: ------------------------------------- Summary: UnifiedHighlighter not highlighting all terms relevant in SpanNearQuery Key: LUCENE-7682 URL: https://issues.apache.org/jira/browse/LUCENE-7682 Project: Lucene - Core Issue Type: Bug Components: modules/highlighter Reporter: Michael Braun
Original text: "Something for protecting wildlife feed in a feed thing." Query is: SpanNearQuery with Slop 9 - in order - 1. SpanTermQuery(wildlife) 2. SpanTermQuery(feed) This should highlight both instances of "feed" since they are both within slop of 9 of "wildlife". However, only the first instance is highlighted. This occurs with unordered SpanNearQuery as well. Test below replicates. Affects both the current 6.x line and master. Test that fits within TestUnifiedHighlighterMTQ: {code} public void testOrderedSpanNearQueryWithDupeTerms() throws Exception { RandomIndexWriter iw = new RandomIndexWriter(random(), dir, indexAnalyzer); Document doc = new Document(); doc.add(new Field("body", "Something for protecting wildlife feed in a feed thing.", fieldType)); doc.add(newTextField("id", "id", Field.Store.YES)); iw.addDocument(doc); IndexReader ir = iw.getReader(); iw.close(); IndexSearcher searcher = newSearcher(ir); UnifiedHighlighter highlighter = new UnifiedHighlighter(searcher, indexAnalyzer); int docID = searcher.search(new TermQuery(new Term("id", "id")), 1).scoreDocs[0].doc; SpanTermQuery termOne = new SpanTermQuery(new Term("body", "wildlife")); SpanTermQuery termTwo = new SpanTermQuery(new Term("body", "feed")); SpanNearQuery topQuery = new SpanNearQuery.Builder("body", true) .setSlop(9) .addClause(termOne) .addClause(termTwo) .build(); int[] docIds = new int[] {docID}; String snippets[] = highlighter.highlightFields(new String[] {"body"}, topQuery, docIds, new int[] {2}).get("body"); assertEquals(1, snippets.length); assertEquals("Something for protecting <b>wildlife</b> <b>feed</b> in a <b>feed</b> thing.", snippets[0]); ir.close(); } {code} -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org