Hi, I've discovered that 2.0.1 is not that latest version of Xerces and that the latest version does not exhibit the problem discribed in my previous post.
Paul Paul Kinnucan writes: > Hi, > > If I try to use the JAXP version of DOM and Xerces-J 2.0.1.01 > to extract the content of an XML document that includes > a doctype declaration that declares external entities, all > the elements of the parsed document appear to be empty. If I remove > the external entity declaration or the entire doctype declaration, > my program is able to extract the content without any problem. > > Consider, for example, the following document: > > > <?xml version="1.0" encoding="utf-8"?> > <!DOCTYPE book PUBLIC "-//The Mathworks//DTD axdocbook variant//" "" []> > <book> > <title>Using Simulink</title> > <para>hello world</para> > </book> > > If I try to extract the content of the title element, using the following > code: > > DOMParser parser = new > DOMParser();Book.parser.parse(bookFilePath.getAbsolutePath()); > Document doc = parser.getDocument(); > NodeList titleElements = doc.getElementsByTagName("title"); > Node titleElement = titleElements.item(0); > > titleElement is empty, i.e., titleElement.hasChildNodes() returns false. > However, if I remove the square brackets from the doctype declaration, i.e., > > <!DOCTYPE book PUBLIC "-//The Mathworks//DTD axdocbook variant//" ""> > > or remove the doctype declaration itself, the above code works perfectly, > i.e., > titleElement.hasChildNodes() returns true and the child node is a text > element > that contains "Using Simulink." > > I'd appreciate any help you can give me. > > Paul > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]