Anna Simbirtsev wrote:
I pass just plain xml string to the DOMParser, so I don't use the
transcode function.
[...]
I just copy utf-8 strings from wikipedia.org and paste it right
into the code to test. After I compiled the parser with ICU, it returns
the string, but shorter. My xml has UTF-8 encoding set: <?xml
version='1.0' encoding='UTF-8'?>.
If you just used cut & paste from your browser to your C++ code editor,
I can bet you are not pasting UTF-8 codepoints, but something in your
local code page. Can you attach your source code to this e-mail
(attached, not copied)?
Alberto