HI Claude,

I'd recommend a custom XMLParser for this, perhaps subclass DcXMLParser? We
could also parameterize this in the DcXMLParser if a committer had a chance
to add that feature or review a PR from yoou.

Best,

        Tim

On Thu, Apr 18, 2024 at 7:33 AM Claude Warren <[email protected]> wrote:

> It seems that SAX based XML parsers by default ignore XML comments, that
> the SAX2 LexicalHandler does handle them.  Is there a way to add a
> LexicalHandler to the XML parser so that the contents of the comments will
> be returned by the Tika handler.toString()?
>
> Many thanks,
> Claude
>

Reply via email to