[ https://issues.apache.org/jira/browse/COCOON3-83?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13167455#comment-13167455 ]
JBert commented on COCOON3-83: ------------------------------ The problem would appear to be in the StringGenerator inner class, in version 3.0.0-alpha 3 the problem is on line 322: {code} XMLUtils.toSax(new ByteArrayInputStream(this.xmlString.getBytes()), XMLGenerator.this.getSAXConsumer()); {code} This encodes the String to bytes using the JRE platform encoding. A SAX parser will later have trouble parsing this bytestream when the XML declares an encoding different from the platform encoding. If I look at line 149, there's this: {code} XMLUtils.toSax(new String(this.bytes, this.encoding), XMLGenerator.this.getSAXConsumer()); {code} This issue could thus be fixed by simply passing the XML String into the toSax function, like so: {code} XMLUtils.toSax(this.xmlString, XMLGenerator.this.getSAXConsumer()); {code} > Encoding bug in XMLGenerator (SAX version) when passing a String to the > constructor > ----------------------------------------------------------------------------------- > > Key: COCOON3-83 > URL: https://issues.apache.org/jira/browse/COCOON3-83 > Project: Cocoon 3 > Issue Type: Bug > Components: cocoon-sax > Affects Versions: 3.0.0-alpha-3 > Reporter: JBert > > When creating a new org.apache.cocoon.sax.component.XMLGenerator instance > with a String argument, the execute method will fail when the XML uses > non-ASCII characters. > See http://stackoverflow.com/a/8471886/983949 for more info with regards to > the code. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira