[
https://issues.apache.org/jira/browse/LUCY-182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Marvin Humphrey reopened LUCY-182:
----------------------------------
The fix in this issue appears to have revealed a latent bug. See the test
failure for PGXN::API::Searcher at
[http://www.cpantesters.org/cpan/report/35c78d0a-09cf-11e1-90dc-a20eabe3e1f7].
{noformat}
# Failed test 'Should have results for simple search'
# at t/base.t line 274.
# Structures begin differing at:
# $got->{hits}[0]{excerpt} = 'This is the
<strong>pair</strong><strong> README file. Here you will find all thingds
related to </strong><strong>pair</strong>, including installation information'
# $expected->{hits}[0]{excerpt} = 'This is the <strong>pair</strong> README
file. Here you will find all thingds related to <strong>pair</strong>,
including installation information'
{noformat}
The problem looks like it either lies in how the "flattened" array of
spans is created or how it is used.
> highlighter bug when searching for duplicate terms [wordX wordX]
> ----------------------------------------------------------------
>
> Key: LUCY-182
> URL: https://issues.apache.org/jira/browse/LUCY-182
> Project: Lucy
> Issue Type: Bug
> Components: Core
> Affects Versions: 0.1.0 (incubating), 0.2.0 (incubating), 0.2.1
> (incubating)
> Reporter: gk
> Assignee: Marvin Humphrey
> Fix For: 0.2.2 (incubating), 0.3.0 (incubating)
>
> Attachments: LUCY-182.patch
>
>
> I stumbled onto this one when searching for [business to business].
> Source <TITLE>: ...Companies, Products, Trade Leads, Business Marketplace
> 'to' is a stopword which is ignored - no problem.
> So the query then becomes [business business]. The highlighter then produces:
> ...Companies, Products, Trade Leads, <strong>Business</strong>
> <strong>Marketp</strong>lace
> I then spent some time chasing my tail trying to reduce things down to
> a small reproducible unit, and finally decided to try searching for
> any duplicate [wordX wordX], and sure enough it's reproducible with
> all my indexes.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira