Honestly, I have no idea how to do that. PatternReplaceCharFilter doesn't seem to have a parameter like preservePositions="true" and optionally fillCharacter=" ". And I don't think I can express this simply as regex. How would I count in a pure regex the length difference before and after the match?
Well, the specific problem is, that when highlighting the term positions are wrong and the result is not a valid XML structure that I can handle. I expect something like <TextLine aa="bb" cc="dd" content="the content to <em>search</em> in" ee="ff" /> but I can <Tex<em>tLine</em>aa="bb" cc="dd" content="the content to <em>search</em> in" ee="ff" /> Thanks for your help. -- View this message in context: http://lucene.472066.n3.nabble.com/Problem-with-PatternReplaceCharFilter-tp4066869p4066939.html Sent from the Solr - User mailing list archive at Nabble.com.