Re: Problem with surrogate characters

2014-08-26 Thread Michael Glavassevich
Hi, References to surrogates are not allowed in XML documents. Here's the range of allowed characters in the XML 1.0 specification [1]: Char ::= #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] | [#x1-#x10] /* any Unicode character, excluding the surrogate blocks, FFFE, and . */

Problem with surrogate characters

2014-08-22 Thread Ilya Sokolov
Hi! I have an issue parsing XML containing Unicode strings with surrogate characters (Xerces 2.11.0). The following exception is thrown: org.xml.sax.SAXParseException; lineNumber: 1; columnNumber: 18; Character reference "�" is an invalid XML character. at org.apache.xerces.pa