I think they come in as UTF-8 from the server, all I need is to parse them.
On Fri, 2008-09-19 at 16:07 -0700, David Bertoni wrote: > Anna Simbirtsev wrote: > > Do you know if I receive utf-8 string, can I just take out s.transcode > > completely and keep the string in utf-8? DOMString is capable of > > containing utf-8 strings? > No, Xerces-C always uses UTF-16 internally to encode character data. > When you supply a document that is not encoded in UTF-16, it uses a > transcoder to convert the byte stream to UTF-16 before parsing it. > > You seemed to be confused about the differences between UTF-8 and > UTF-16. Both are encodings that can represent all of the characters in > Unicode. UTF-8 is an 8-bit encoding that is compatible with the char > data type in C. UTF-16 is a 16-bit encoding, so it's not compatible > with the char data type. > > Is there some reason you need strings encoded in UTF-8? > > Dave
