[ 
https://issues.apache.org/jira/browse/SOLR-556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12603782#action_12603782
 ] 

Lars Kotthoff commented on SOLR-556:
------------------------------------

In the setup I've been testing it with (one large single-valued "text" field 
and several multi-valued fields) it didn't seem to have any serious performance 
implications -- i.e. the randomness of my test queries was enough to mask any 
loss of performance ;)
The length of the multi-valued fields is in the order of 10-20 characters on 
average though and there're not many multiple different values.

I personally think that returning correct data is more important than 
performance in this case, but that may just be because my particular setup 
doesn't suffer any significant loss of performance. I didn't see any other way 
to correct the behaviour of the current trunk code, but if anybody else has a 
better idea how to do it, please let us know!

> Highlighting of multi-valued fields returns snippets which span multiple 
> different values
> -----------------------------------------------------------------------------------------
>
>                 Key: SOLR-556
>                 URL: https://issues.apache.org/jira/browse/SOLR-556
>             Project: Solr
>          Issue Type: Bug
>          Components: highlighter
>    Affects Versions: 1.3
>         Environment: Tomcat 5.5
>            Reporter: Lars Kotthoff
>            Assignee: Mike Klaas
>            Priority: Minor
>             Fix For: 1.3
>
>         Attachments: SOLR-556-highlight-multivalued.patch, 
> solr-highlight-multivalued-example.xml
>
>
> When highlighting multi-valued fields, the highlighter sometimes returns 
> snippets which span multiple values, e.g. with values "foo" and "bar" and 
> search term "ba" the highlighter will create the snippet "foo<em>ba</em>r". 
> Furthermore it sometimes returns smaller snippets than it should, e.g. with 
> value "foobar" and search term "oo" it will create the snippet "<em>oo</em>" 
> regardless of hl.fragsize.
> I have been unable to determine the real cause for this, or indeed what 
> actually goes on at all. To reproduce the problem, I've used the following 
> steps:
> * create an index with multi-valued fields, one document should have at least 
> 3 values for these fields (in my case strings of length between 5 and 15 
> Japanese characters -- as far as I can tell plain old ASCII should produce 
> the same effect though)
> * search for part of a value in such a field with highlighting enabled, the 
> additional parameters I use are hl.fragsize=70, hl.requireFieldMatch=true, 
> hl.mergeContiguous=true (changing the parameters does not seem to have any 
> effect on the result though)
> * highlighted snippets should show effects described above

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to