It preserves the location of the terms in the original HTML document so that you can highlight terms in HTML. This makes it possible (for instance) to display the entire document, with all the search terms highlighted, or (with some careful surgery) to display formatted HTML (bold, italic, etc) in your search results.

-Mike

On 05/12/2011 03:42 PM, Jonathan Rochkind wrote:
On 5/12/2011 2:55 PM, Ahmet Arslan wrote:
I recently upgraded from Solr 1.3 to Solr 3.1 in order to
take advantage of
the HTMLStripCharFilter. But it isn't working as I
expected.

You need to strip html tag before analysis phase. If you are using DIH, you can use stripHTML="true" transformer.



Wait, then what's the HTMLStripCharFilter for?

Reply via email to