It preserves the location of the terms in the original HTML document so
that you can highlight terms in HTML. This makes it possible (for
instance) to display the entire document, with all the search terms
highlighted, or (with some careful surgery) to display formatted HTML
(bold, italic, etc) in your search results.
-Mike
On 05/12/2011 03:42 PM, Jonathan Rochkind wrote:
On 5/12/2011 2:55 PM, Ahmet Arslan wrote:
I recently upgraded from Solr 1.3 to Solr 3.1 in order to
take advantage of
the HTMLStripCharFilter. But it isn't working as I
expected.
You need to strip html tag before analysis phase. If you are using
DIH, you can use stripHTML="true" transformer.
Wait, then what's the HTMLStripCharFilter for?