Hi,
References to surrogates are not allowed in XML documents.
Here's the range of allowed characters in the XML 1.0 specification [1]:
Char ::= #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] |
[#x1-#x10] /* any Unicode character, excluding the surrogate
blocks, FFFE, and . */
Hi!
I have an issue parsing XML containing Unicode strings with surrogate
characters (Xerces 2.11.0). The following exception is thrown:
org.xml.sax.SAXParseException; lineNumber: 1; columnNumber: 18;
Character reference "�" is an invalid XML character.
at org.apache.xerces.pa