All,
It looks like the Xerces parser is converting incoming UTF-8 to
UTF-16 automatically during the parse.
Consider the following:
<abc>******</abc> where "*****" is raw UTF-8
The parser is converting the raw UTF-8 to UTF-16 and the Java string
containing the "abc" value contains UTF-16 codepoints.
Is this proper behavior? Can I turn this behavior off?
Any help is MUCH appreciated.
thanks,
-- John
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]