Xerces C++ defines an encoding-string that Xerces/Java refuses to parse
-----------------------------------------------------------------------

         Key: XERCESJ-1041
         URL: http://issues.apache.org/jira/browse/XERCESJ-1041
     Project: Xerces2-J
        Type: Bug
    Versions: 2.4.0    
 Environment: XercesC-2.3, XalanJ 2.4, Solaris 6
    Reporter: Dominik Stadler


We are using Xerces C++ to create XML-Messages that are later parsed by 
Xerces/Java.

XercesC provides a define XMLUni::fgISO88591EncodingString for setting the 
encoding, the XML-Message contains the string "ISO8859-1" as encoding.

When we later use Xerces/Java to parse this file, we get the following error:

[Fatal Error] :1:43: Invalid encoding name "ISO8859-1".

It seems that Xerces/Java only knows the encoding "ISO-8859-1" (with a dash), 
but not "ISO8859-1" (without dash).

The XML-Specification states that "ISO-8859-1" (with a dash) SHOULD be used, 
look at http://www.w3.org/TR/2004/REC-xml-20040204/#charencoding

So in my opinion either Xerces C++ should not provide that define any more, or 
Xerces/Java should be enhanced to accept that encoding-string. Otherwise 
XercesC and XercesJ differ in this part, where we until now thought they would 
be equal in their parsing-behavior.

I already report a Bug at http://issues.apache.org/jira/browse/XERCESC-1336 
that reports this for XercesC.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
If you want more information on JIRA, or have a bug to report see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to