hi,
i would like to round-trip XML document content as UNICODE.
however i have noticed a problem when using Xerces 2:
in prolog or epilog in XML 1.0 we have
prolog ::= XMLDecl? Misc* (doctypedecl Misc*)?
Misc ::= Comment | PI | S
(as defined in http://www.w3.org/TR/REC-xml#NT-prolog)
and Xerces2 reports Comment and PI but not S though it could
naturally be reported as ignorableWhitespace (at least in XNI).
i think that the same is true for epilog (Misc*).
i looked into org.apache.xerces.impl.XMLDocumentScannerImpl.PrologDispatcher
and it seems to skip over spaces without reporting them as ignorable
whitespaces (fEntityScanner.skipSpaces()) - is it right place or maybe
reporting S can be turned on somewhere by some feature?
also is there a way to get exact content of doctypedecl or XMLDecl?
thanks,
alek
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]