[ 
https://issues.apache.org/jira/browse/LUCENE-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677549#action_12677549
 ] 

Hoss Man commented on LUCENE-1500:
----------------------------------

bq. The extent of my tracing suggests it's coming when the token stream is 
generated, which looks to be part of the lucene highlighter: 
org.apache.lucene.search.highlight.TokenSources

with my limited knowledge of solr highlighting, that really doesn't sound right.

If the field has TermVectors, then the TokenStream used comes from there, 
otherwise it coems from analyzing the stored field value -- either way the 
analyzer configured in Solr should be setting the offset values, correct? 

hence my question on the solr thread where this first came up...
http://www.nabble.com/Error-with-highlighter-and-UTF-8-chars--to22156161.html#a22207917

bq. so what does the analysis screen tell you about each token produced with 
that input text given your configuration?  in verbose mode it will show the 
start/end offsets for every token, so it should be fairly easy to identify 
where the bug is. 

> Highlighter throws StringIndexOutOfBoundsException
> --------------------------------------------------
>
>                 Key: LUCENE-1500
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1500
>             Project: Lucene - Java
>          Issue Type: Bug
>          Components: contrib/highlighter
>    Affects Versions: 2.4
>         Environment: Found this running the example code in Solr (latest 
> version).
>            Reporter: David Bowen
>            Assignee: Michael McCandless
>             Fix For: 2.4.1, 2.9
>
>         Attachments: LUCENE-1500.patch, patch.txt
>
>
> Using the canonical Solr example (ant run-example) I added this document 
> (using exampledocs/post.sh):
> <add><doc>
>   <field name="id">Test for Highlighting 
> StringIndexOutOfBoundsExcdption</field>
>   <field name="name">Some Name</field>
>   <field name="manu">Acme, Inc.</field>
>   <field name="features">Description of the features, mentioning various 
> things</field>
>   <field name="features">Features also is multivalued</field>
>   <field name="popularity">6</field>
>   <field name="inStock">true</field>
> </doc></add>
> and then the URL 
> http://localhost:8983/solr/select/?q=features&hl=true&hl.fl=features caused 
> the exception.
> I have a patch.  I don't know if it is completely correct, but it avoids this 
> exception.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to