We are attempting to store data in an XML file.  This data is encoded ascii
text, and because of this, some of the characters end up falling outside the
legal limits for XML characters.  Specifically, I am getting this error:

Fatal Error at file "C:\natemail.xml", line 2, column 3275(4/23/2002
22:27:53):
Invalid character (Unicode: 0x1F)

The FAQ states that these are illegal values for XML, but it is unclear
whether this is for UTF-8 specifically, or for all encodings.

I have tried UTF-8, US-ASCII, and windows-1252 for encodings.  Additionally,
I have played around with the XMLFormatter::EscapeFlags with no success at
getting the parser to accept the characters after writing the xml with those
settings.

Additionally, I can get XMLFormatter::UnRepFlags UnRep_Fail to report the
illegal character and terminate output, but I can't seem to get a usable
input using UnRep_CharRef or UnRep_Replace both of these values usages are
very poorly documented.

My question is, how can I set Xerces C++ up to encode or translate these
"illegal" characters for output and input so that the encoding can be
transparent from the perspective of the user of the parser.  I can imagine
using a XMLFormatTarget to convert characters that are "illegal" to the file
output on export, but I don't know how I could re-assemble these encodings
back to their ASCII values at parse time so that when I read the values with
the IDOM the strings are usable.

I can't help but think I have not been the first person with the need for
representing these characters in XML.

Any help would be appreciated,
Nate Pitzer

Owner/Engineer
Deep Fried Software.
www.dfsoftware.com


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to