[ https://issues.apache.org/jira/browse/XERCESC-1663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Scott Cantor updated XERCESC-1663: ---------------------------------- Fix Version/s: (was: 2.9.0) > IconvGNU and IconvFBSD based transcoders assume UCS-2 as XMLCh encoding > ----------------------------------------------------------------------- > > Key: XERCESC-1663 > URL: https://issues.apache.org/jira/browse/XERCESC-1663 > Project: Xerces-C++ > Issue Type: Bug > Components: Utilities > Affects Versions: 2.7.0 > Environment: any > Reporter: Boris Kolpackov > Assignee: Alberto Massari > Priority: Blocker > Fix For: 3.0.0 > > > I was studying the code in IconvGNU and IconvFBSD transcoders and it appears > that they assume UCS-2 is the encoding for XMLCh when it's actually UTF-16. I > believe this can result in the loss of data. > The encoding that is used for XMLCh is stored in the fUnicodeCP variable > which is initialized in the Iconv{GNU,FBSD}TransServices c-tor. The > initialization code just tries all encodings from the > gIconv{GNU,FBSD}Encodings array which for GNU contains only UCS-2 and for > FreeBSD contains UCS-2 and UCS-4 encodings. > I tried to add a UTF-16LE to this array (as a first item) and it works fine > for GNU (I double checked that UTF-16LE ends up in fUnicodeCP). Does anybody > knows what's going on here? Should we add UTF-16 to these arrays? -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: c-dev-unsubscr...@xerces.apache.org For additional commands, e-mail: c-dev-h...@xerces.apache.org