HI Claude,
I'd recommend a custom XMLParser for this, perhaps subclass DcXMLParser? We
could also parameterize this in the DcXMLParser if a committer had a chance
to add that feature or review a PR from yoou.
Best,
Tim
On Thu, Apr 18, 2024 at 7:33 AM Claude Warren <[email protected]> wrote:
> It seems that SAX based XML parsers by default ignore XML comments, that
> the SAX2 LexicalHandler does handle them. Is there a way to add a
> LexicalHandler to the XML parser so that the contents of the comments will
> be returned by the Tika handler.toString()?
>
> Many thanks,
> Claude
>