[ https://issues.apache.org/jira/browse/PDFBOX-3338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15271099#comment-15271099 ]
Tilman Hausherr commented on PDFBOX-3338: ----------------------------------------- I thought he was using a JDK7 or JDK8 concept. So no problem there. The encodedByteAlign option means that after reading all the bits of a line, you have to skip bits until reaching a byte boundary. Maybe this can be done in decodeRow(). > CCITT Fax decoder fails > ----------------------- > > Key: PDFBOX-3338 > URL: https://issues.apache.org/jira/browse/PDFBOX-3338 > Project: PDFBox > Issue Type: Bug > Affects Versions: 1.8.12, 2.0.1 > Reporter: Petr Slaby > Attachments: 1.tiff, TestCCITTFaxDecoder.java > > > I have a PDF which does not render in PDFBox. It contains pages from a > scanner, encoded as CCITT Fax Tiffs. On each page, the decoder always runs > into IOException("TIFFFaxDecoder: EOL encountered in black run.") (or the > same message just with "white" instead of "black"). Unfortunately, the PDF > contains sensitive data and I cannot share it. > As a test, I have replaced the TIFFFaxDecoder by the class > CCITTFaxDecoderStream from the Twelve Monkeys ImageIO library. All worked > fine after that and PDFToImage produced the expected result. > I have extracted the first few bytes of the TIFF to show the problem without > sharing the confidential content. See the attached test program and test file. > I have tested this against latest trunk version of PDFBox, but I think the > decoder implementation is basically the same in all versions. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org