Hello,

What do you mean by "convert it into InputSource"?  Did you use a byte
inputStream or a character input stream?  And precisely how did the parser
crash?

Xerces can certainly read UTF-8 encoded XML documents directly; it also
should have no problems with character streams, so long as the JVM had
enough information to encode the characters correctly.

Cheers,
Neil Graham
XML Parser Development
IBM Toronto Lab
Phone:  905-413-3519, T/L 969-3519
E-mail:  [EMAIL PROTECTED]



"himanshu" <[EMAIL PROTECTED]> on 09/25/2001
09:51:26 AM

Please respond to [EMAIL PROTECTED]

To:   <[EMAIL PROTECTED]>
cc:
Subject:  A query regarding Unicode support in Xerces-J


Hi All,
   I am wondering if all the XML parsers that are available today are
internally Unicode
based or not. I am working with Apache Xerces-J parser and found that it
can
not
handle Unicode characters in a XML document (or they have some hidden way
of
working
around with them). What I tried was simple - create a UTF-8 encoded XML
file,
read it into a String, convert this String object into InputSource and try
to parse
it using Apache DOMParser - it just crashes!!

Any insight will be a great help.

Thanks and regards,

Himanshu


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]





---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to