ISO-8859-1 - Invalid byte 2 of 3-byte UTF-8 sequence error with Doxia (maven plugin)
Hello, I'm trying to use the Doxia Maven plugin to automatically render .pdf from APT and i'm encountering this problem, does any of you have an idea about this ? The .apt files are encoded in ISO-8859-1 and contain various french characters (like accentued characters). I got the Invalid byte 2 of 3-byte UTF-8 sequence. error and i really don't know how to deal with ISO-8859-1 with this plugin, what should I add and where ? I've tried to add in my pom.xml the inputEncoding and outputEncoding lines but the behaviour is just the same with or without. My Doxia example.xml file doesn't contain at this time any special characters. If i set the book configuration to point to plain .apt without any special characters only, everything works fine, but as soon as I have french national characters in the apt files, i get this error. I'm using (as far as i know) the last version of the plugin (1.0-alpha-10). My POM looks like that : plugin groupIdorg.apache.maven.doxia/groupId artifactIddoxia-maven-plugin/artifactId version1.0-alpha-10/version executions execution phasepre-site/phase goals goalrender-books/goal /goals /execution /executions configuration inputEncodingiso-8859-1/inputEncoding !-- added but useless -- outputEncodingiso-8859-1/outputEncoding !-- added but useless -- books book directorysite//directory descriptorpdf/example.xml/descriptor formats format idpdf/id /format /formats /book /books /configuration /plugin Result stacktrace : [INFO] [ERROR] FATAL ERROR [INFO] [INFO] Invalid byte 2 of 3-byte UTF-8 sequence. [INFO] [INFO] Trace ExceptionConverter: org.xml.sax.SAXParseException: Invalid byte 2 of 3-byte UTF-8 sequence. at com.sun.org.apache.xerces.internal.util.ErrorHandlerWrapper.createSAXParseException(ErrorHandlerWrapper.java:236) at com.sun.org.apache.xerces.internal.util.ErrorHandlerWrapper.fatalError(ErrorHandlerWrapper.java:215) at com.sun.org.apache.xerces.internal.impl.XMLErrorReporter.reportError(XMLErrorReporter.java:386) at com.sun.org.apache.xerces.internal.impl.XMLErrorReporter.reportError(XMLErrorReporter.java:316) at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher.dispatch(XMLDocumentFragmentScannerImpl.java:1810) at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl.scanDocument(XMLDocumentFragmentScannerImpl.java:368) at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(XML11Configuration.java:834) at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(XML11Configuration.java:764) at com.sun.org.apache.xerces.internal.parsers.XMLParser.parse(XMLParser.java:148) at com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse(AbstractSAXParser.java:1242) at javax.xml.parsers.SAXParser.parse(SAXParser.java:375) at com.lowagie.text.xml.XmlParser.go(Unknown Source) at com.lowagie.text.xml.XmlParser.parse(Unknown Source) at com.lowagie.text.xml.XmlToXXX.parse(Unknown Source) at org.apache.maven.doxia.module.itext.ITextUtil.writePdf(ITextUtil.java:118) at org.apache.maven.doxia.book.services.renderer.PdfBookRenderer.renderXML(PdfBookRenderer.java:50) at org.apache.maven.doxia.book.services.renderer.AbstractITextBookRenderer.renderBook(AbstractITextBookRenderer.java:162) at org.apache.maven.doxia.book.DefaultBookDoxia.renderBook(DefaultBookDoxia.java:126) at org.apache.maven.doxia.plugin.DoxiaRenderBooksMojo.execute(DoxiaRenderBooksMojo.java:214) at org.apache.maven.plugin.DefaultPluginManager.executeMojo(DefaultPluginManager.java:443) at org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeGoals(DefaultLifecycleExecutor.java:539) at org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeStandaloneGoal(DefaultLifecycleExecutor.java:493) at org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeGoal(DefaultLifecycleExecutor.java:463) at org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeGoalAndHandleFailures(DefaultLifecycleExecutor.java:311) at org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeTaskSegments(DefaultLifecycleExecutor.java:278) at
Re: ISO-8859-1 - Invalid byte 2 of 3-byte UTF-8 sequence error with Doxia (maven plugin)
Hi Serge, It is a known limitation in the iText plugin. The implementation uses the deprecated itext xml and the xml encoding is hardcoded to utf-8 Cheers, Vincent [1] http://www.lowagie.com/iText/tutorial/ch07.html 2007/11/9, [EMAIL PROTECTED] [EMAIL PROTECTED]: Hello, I'm trying to use the Doxia Maven plugin to automatically render .pdf from APT and i'm encountering this problem, does any of you have an idea about this ? The .apt files are encoded in ISO-8859-1 and contain various french characters (like accentued characters). I got the Invalid byte 2 of 3-byte UTF-8 sequence. error and i really don't know how to deal with ISO-8859-1 with this plugin, what should I add and where ? I've tried to add in my pom.xml the inputEncoding and outputEncoding lines but the behaviour is just the same with or without. My Doxia example.xml file doesn't contain at this time any special characters. If i set the book configuration to point to plain .apt without any special characters only, everything works fine, but as soon as I have french national characters in the apt files, i get this error. I'm using (as far as i know) the last version of the plugin (1.0-alpha-10). My POM looks like that : plugin groupIdorg.apache.maven.doxia/groupId artifactIddoxia-maven-plugin/artifactId version1.0-alpha-10/version executions execution phasepre-site/phase goals goalrender-books/goal /goals /execution /executions configuration inputEncodingiso-8859-1/inputEncoding !-- added but useless -- outputEncodingiso-8859-1/outputEncoding !-- added but useless -- books book directorysite//directory descriptorpdf/example.xml/descriptor formats format idpdf/id /format /formats /book /books /configuration /plugin Result stacktrace : [INFO] [ERROR] FATAL ERROR [INFO] [INFO] Invalid byte 2 of 3-byte UTF-8 sequence. [INFO] [INFO] Trace ExceptionConverter: org.xml.sax.SAXParseException: Invalid byte 2 of 3-byte UTF-8 sequence. at com.sun.org.apache.xerces.internal.util.ErrorHandlerWrapper.createSAXParseException(ErrorHandlerWrapper.java:236) at com.sun.org.apache.xerces.internal.util.ErrorHandlerWrapper.fatalError(ErrorHandlerWrapper.java:215) at com.sun.org.apache.xerces.internal.impl.XMLErrorReporter.reportError(XMLErrorReporter.java:386) at com.sun.org.apache.xerces.internal.impl.XMLErrorReporter.reportError(XMLErrorReporter.java:316) at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher.dispatch(XMLDocumentFragmentScannerImpl.java:1810) at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl.scanDocument(XMLDocumentFragmentScannerImpl.java:368) at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(XML11Configuration.java:834) at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(XML11Configuration.java:764) at com.sun.org.apache.xerces.internal.parsers.XMLParser.parse(XMLParser.java:148) at com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse(AbstractSAXParser.java:1242) at javax.xml.parsers.SAXParser.parse(SAXParser.java:375) at com.lowagie.text.xml.XmlParser.go(Unknown Source) at com.lowagie.text.xml.XmlParser.parse(Unknown Source) at com.lowagie.text.xml.XmlToXXX.parse(Unknown Source) at org.apache.maven.doxia.module.itext.ITextUtil.writePdf(ITextUtil.java:118) at org.apache.maven.doxia.book.services.renderer.PdfBookRenderer.renderXML(PdfBookRenderer.java:50) at org.apache.maven.doxia.book.services.renderer.AbstractITextBookRenderer.renderBook(AbstractITextBookRenderer.java:162) at org.apache.maven.doxia.book.DefaultBookDoxia.renderBook(DefaultBookDoxia.java:126) at org.apache.maven.doxia.plugin.DoxiaRenderBooksMojo.execute(DoxiaRenderBooksMojo.java:214) at org.apache.maven.plugin.DefaultPluginManager.executeMojo(DefaultPluginManager.java:443) at org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeGoals(DefaultLifecycleExecutor.java:539) at org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeStandaloneGoal(DefaultLifecycleExecutor.java:493)