Hey Guys,
How do I add HTML/XML documents using SolrJ such that it does not by
pass the HTML char filter?
SolrJ escapes the HTML/XML value of a field, and that make it bypass
the HTML char filter. For example centercontent/center if added to
a field with HTMLStripCharFilter on the field using
The HTMLStripCharFilter will strip the html for the *indexed* terms,
it does not effect the *stored* field.
If you don't want html in the stored field, can you just strip it out
before passing to solr?
On Nov 11, 2009, at 8:07 PM, aseem cheema wrote:
Hey Guys,
How do I add HTML/XML
Ohhh... you are a life saver... thank you so much.. it makes sense.
Aseem
On Wed, Nov 11, 2009 at 7:40 PM, Ryan McKinley ryan...@gmail.com wrote:
The HTMLStripCharFilter will strip the html for the *indexed* terms, it does
not effect the *stored* field.
If you don't want html in the stored