[ 
https://issues.apache.org/jira/browse/LUCY-182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Marvin Humphrey reopened LUCY-182:
----------------------------------


The fix in this issue appears to have revealed a latent bug. See the test
failure for PGXN::API::Searcher at
[http://www.cpantesters.org/cpan/report/35c78d0a-09cf-11e1-90dc-a20eabe3e1f7].

{noformat}
#   Failed test 'Should have results for simple search'
#   at t/base.t line 274.
#     Structures begin differing at:
#          $got->{hits}[0]{excerpt} = 'This is the 
<strong>pair</strong><strong> README file. Here you will find all thingds 
related to </strong><strong>pair</strong>, including installation information'
#     $expected->{hits}[0]{excerpt} = 'This is the <strong>pair</strong> README 
file. Here you will find all thingds related to <strong>pair</strong>, 
including installation information'
{noformat}

The problem looks like it either lies in how the "flattened" array of 
spans is created or how it is used.
                
> highlighter bug when searching for duplicate terms [wordX wordX]
> ----------------------------------------------------------------
>
>                 Key: LUCY-182
>                 URL: https://issues.apache.org/jira/browse/LUCY-182
>             Project: Lucy
>          Issue Type: Bug
>          Components: Core
>    Affects Versions: 0.1.0 (incubating), 0.2.0 (incubating), 0.2.1 
> (incubating)
>            Reporter: gk
>            Assignee: Marvin Humphrey
>             Fix For: 0.2.2 (incubating), 0.3.0 (incubating)
>
>         Attachments: LUCY-182.patch
>
>
> I stumbled onto this one when searching for [business to business].
> Source <TITLE>: ...Companies, Products, Trade Leads, Business Marketplace
> 'to' is a stopword which is ignored - no problem.
> So the query then becomes [business business].  The highlighter then produces:
> ...Companies, Products, Trade Leads, <strong>Business</strong>
> <strong>Marketp</strong>lace
> I then spent some time chasing my tail trying to reduce things down to
> a small reproducible unit, and finally decided to try searching for
> any duplicate [wordX wordX], and sure enough it's reproducible with
> all my indexes.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to