Anna Simbirtsev wrote:
I pass just plain xml string to the DOMParser, so I don't use the
transcode function.

[...]
I just copy utf-8 strings from wikipedia.org and paste it right
into the code to test. After I compiled the parser with ICU, it returns
the string, but shorter. My xml has UTF-8 encoding set: <?xml
version='1.0' encoding='UTF-8'?>.

If you just used cut & paste from your browser to your C++ code editor, I can bet you are not pasting UTF-8 codepoints, but something in your local code page. Can you attach your source code to this e-mail (attached, not copied)?

Alberto

Reply via email to