is full UNICODE input roundtrip possible with Xerces 2?

Aleksander Slominski Wed, 31 Jul 2002 13:30:23 -0700

hi,

i would like to round-trip XML document content as UNICODE.


however i have noticed a problem when using Xerces 2:
in prolog or epilog in XML 1.0 we have

  prolog ::=   XMLDecl? Misc* (doctypedecl Misc*)?
  Misc ::=   Comment | PI | S
    (as defined in http://www.w3.org/TR/REC-xml#NT-prolog)

and Xerces2 reports Comment and PI but not S though it could
naturally be reported as ignorableWhitespace (at least in XNI).
i think that the same is true for epilog (Misc*).

i looked into org.apache.xerces.impl.XMLDocumentScannerImpl.PrologDispatcher
and it seems to skip over spaces without reporting them as ignorable
whitespaces (fEntityScanner.skipSpaces()) - is it right place or maybe
reporting S can be turned on somewhere by some feature?

also is there a way to get exact content  of doctypedecl or XMLDecl?

thanks,

alek



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

is full UNICODE input roundtrip possible with Xerces 2?

Reply via email to