All, In working on integrating Tika 1.13 with Solr, I found that we are now suppressing the "body" tag in our HTMLParser via DefaultHtmlMapper not including "body" among the SAFE_ELEMENTS. The XHTMLHandler is responsible for this now. Does this ring a bell?
Is this a bug in Tika, or do we expect people to suppress body in their
implementations of HtmlMapper?
Cheers,
Tim
https://issues.apache.org/jira/browse/SOLR-8981
