[ https://issues.apache.org/jira/browse/SOLR-4137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13508764#comment-13508764 ]
Marcel commented on SOLR-4137: ------------------------------ @Robert - sure. Posted an example in the bug description. Here is my analyzer chain: {code} Index Analyzer org.apache.solr.analysis.WhitespaceTokenizerFactory {luceneMatchVersion=LUCENE_36} org.apache.solr.analysis.HyphenatedWordsFilterFactory {luceneMatchVersion=LUCENE_36} org.apache.solr.analysis.WordDelimiterFilterFactory {preserveOriginal=1, splitOnCaseChange=1, generateNumberParts=1, catenateWords=1, types=wdftypes.txt, luceneMatchVersion=LUCENE_36, generateWordParts=1, catenateAll=0, catenateNumbers=1} org.apache.solr.analysis.LowerCaseFilterFactory {luceneMatchVersion=LUCENE_36} org.apache.solr.analysis.StopFilterFactory {words=lang/stopwords_de.txt, ignoreCase=true, enablePositionIncrements=true, luceneMatchVersion=LUCENE_36} org.apache.solr.analysis.DictionaryCompoundWordTokenFilterFactory {maxSubwordSize=15, onlyLongestMatch=true, minSubwordSize=4, minWordSize=5, dictionary=spellings.txt, luceneMatchVersion=LUCENE_36} org.apache.solr.analysis.GermanNormalizationFilterFactory {luceneMatchVersion=LUCENE_36} org.apache.solr.analysis.GermanStemFilterFactory {luceneMatchVersion=LUCENE_36} org.apache.solr.analysis.SnowballPorterFilterFactory {language=German2, luceneMatchVersion=LUCENE_36} org.apache.solr.analysis.RemoveDuplicatesTokenFilterFactory {luceneMatchVersion=LUCENE_36} {code} > FastVectorHighlighter: StringIndexOutOfBoundsException in BaseFragmentsBuilder > ------------------------------------------------------------------------------ > > Key: SOLR-4137 > URL: https://issues.apache.org/jira/browse/SOLR-4137 > Project: Solr > Issue Type: Bug > Components: highlighter > Affects Versions: 3.6.1 > Reporter: Marcel > > under some circumstances the BaseFragmentsBuilder genereates a > StringIndexOutOfBoundsException inside the makeFragment method. > The starting offset is higher than the end offset. > I did a small patch checking the offsets and posted it over there at > Stackoverflow: > http://stackoverflow.com/questions/12456448/solr-highlight-bug-with-usefastvectorhighlighter > The code in 4.0 seems to be the same as in 3.6.1 > Example how to reproduce the behaviour: > There is a word called "www.DAKgesundAktivBonus.de" inside the index. If you > search for "dak bonus" some offset calculations went wrong. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org