I don't see any way to get XML2 components into the system. At the lower levels I don't see any support for XML2. Any idea how to move forward?
On Thu, Apr 18, 2024 at 2:13 PM Tim Allison <[email protected]> wrote: > HI Claude, > > I'd recommend a custom XMLParser for this, perhaps subclass DcXMLParser? > We could also parameterize this in the DcXMLParser if a committer had a > chance to add that feature or review a PR from yoou. > > Best, > > Tim > > On Thu, Apr 18, 2024 at 7:33 AM Claude Warren <[email protected]> wrote: > >> It seems that SAX based XML parsers by default ignore XML comments, that >> the SAX2 LexicalHandler does handle them. Is there a way to add a >> LexicalHandler to the XML parser so that the contents of the comments will >> be returned by the Tika handler.toString()? >> >> Many thanks, >> Claude >> > -- LinkedIn: http://www.linkedin.com/in/claudewarren
