[ 
https://issues.apache.org/jira/browse/SOLR-937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12872898#action_12872898
 ] 

Chris Harris commented on SOLR-937:
-----------------------------------

I haven't taken the time to reproduce this particular issue, but I think the 
problem is not limited to stemming. Assuming you aren't specifying particular 
field names in your query, the problem can be summarized like this:

Solr (at least as of 1.4) is in danger of producing weird highlights whenever 
the analyzer for your index's default search field differs from the analyzer 
for your highlight field(s). The HighlightComponent takes the Query object 
parsed by the QueryComponent (which was tokenized according to the default 
field's analyzer) and applies it unchanged to the highlighting field (which 
will be tokenized according to a different analyzer). The same word may be 
tokenized differently by the two tokenizers, with results like missing 
highlights.

I'm not sure what the best solution is here. I've proposed an option that can 
help in some cases at SOLR-1910. Another possibility would be a new 
hl.useHighlightedFieldAsDefaultField highlighter option, which would create a 
new Query object (based on a separate analyzer) not just once at the start of 
highlighting, but separately for each particular field that's getting 
highlighted.

A complication for either approach is whether you should do anything special to 
parts of the query that _do_ specify a particular field (e.g. 
"features:electronics").

> Highlighting problem related to stemming
> ----------------------------------------
>
>                 Key: SOLR-937
>                 URL: https://issues.apache.org/jira/browse/SOLR-937
>             Project: Solr
>          Issue Type: Bug
>          Components: highlighter
>    Affects Versions: 1.3
>            Reporter: David Bowen
>
> Using the example data (as in "ant run-example") from the latest dev version, 
> add the words "electronics" and "connector" to the features field of the 
> first doc in ipod_other.xml.  Now the following query:
> http://localhost:8983/solr/select/?q=electronics&hl=true&hl.fl=features+cat
> will show "electronics" highlighted in the features field but not in the cat 
> field.  If you search instead for "connector", it is highlighted in both.
> This seems like a bug to me.  A possible but not entirely satisfactory 
> work-around would be to have the cat field copied into another field which is 
> stemmed, and use that other field for highlighting (assuming the search is on 
> the default search field, and not on cat).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to