All,

        It looks like the Xerces parser is converting incoming UTF-8 to
UTF-16 automatically during the parse.

        Consider the following:
                <abc>******</abc>   where "*****" is raw UTF-8

        The parser is converting the raw UTF-8 to UTF-16 and the Java string
containing the "abc" value contains UTF-16 codepoints.
        Is this proper behavior?  Can I turn this behavior off?

        Any help is MUCH appreciated.

thanks,
-- John

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to