I am attaching a sample XML that contains a U+200D character between a
--| and |-- pattern; I modified DOMPrint to issue a
const XMLCh* data=doc->getDocumentElement()->getTextContent();
and in the debugger I see that data[4] is \x200D
Have you checked your source XML really has that character? Also, is
the representation of the ZWJ character in the XML file valid according
to the specified encoding (e.g. in UTF-8, it's 0xE2 0x80 0x8D)?
Alberto
jinesh kj wrote:
hi,
Actually, getTextContent is not returning any value when there is a Zero
width joiner.
cheers
Jinesh K J
On Nov 28, 2007 3:28 PM, Alberto Massari <[EMAIL PROTECTED]> wrote:
Hi Jinesh,
which kind of issues are you having? The text returned by getTextContent
should contain a \x200D value inside. Or have you transcoded it into
chars?
Alberto
jinesh kj wrote:
hi all,
I was trying to read from an XML file where some data have ZERO Width
Joiner
in it. I used the getTextContent in DOMNode. I was able to read the
contents
without Zero width joiner, but there are some issues with these special
characters. What do i have to change? Do i have to make any special
settings? Or do i have to use any other function insttead?
cheers
Jinesh K J
<?xml version="1.0"?>
<root>---|â|---</root>