If you need to handle these characters and don't ever use unicode, use a line like this on top:
<?xml version="1.0" encoding="ISO-8859-1"?>
But you'd have to import that statement into all the files in your database, unless there's a way to force the parser to see it as that. (I can't remember.)
Otherwise you should convert those characters to HTML entities or unicode multi-bytes. There are many programs that will automate this.
Swiftpaw Foxyshadis, wildlife artist [EMAIL PROTECTED] | http://foxyshadis.dyndns.org/
From: "susan atmaja" <[EMAIL PROTECTED]>
I have a question on the supported character which can be saved / retrieved from database as an XML BLOB.
I saved an XML Document as a BLOB in db2. I can retrieve the BLOB from the db as an array of bytes.
I aim to regenerate the Document from the array of bytes.
The XML document contains characters such as,"�" "�" . I can save the document in the db without error. I can also retrieve it as an array of bytes succesfully. However, when I tried to regenerate the Document from it, it gave the following error : invalid byte 2 of 3-byte UTF-8 sequence (0x61)
Is there any limitation on the character set supported in a XML document to be retrieved from db? If there is? what are the supported set? Please help.
FYI, I am using Xerces 1.2.3 with Xalan 1.2.2. Following is the code I use to regenerate the Document from the array of bytes:
ByteArrayInputStream bInStream = new ByteArrayInputStream(cxmlBytes); DocumentBuilderFactory dFactory = DocumentBuilderFactory.newInstance(); DocumentBuilder dBuilder = dFactory.newDocumentBuilder(); Document doc = dBuilder.parse(bInStream);
Please advise. Thanks.
-Susan-
_________________________________________________________________
The new MSN 8: advanced junk mail protection and 2 months FREE* http://join.msn.com/?page=features/junkmail
_________________________________________________________________
MSN 8 helps eliminate e-mail viruses. Get 2 months FREE*. http://join.msn.com/?page=features/virus
