[
https://issues.apache.org/jira/browse/PDFBOX-1729?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13894364#comment-13894364
]
Jr. John commented on PDFBOX-1729:
----------------------------------
Current version has same problem 1.8.4
D:\Software\pdfbox>java -jar pdfbox-app-1.8.4.jar ConvertColorspace
rpt1390780234888753.pdf test.pdf
二月 07, 2014 4:59:11 下午 org.apache.pdfbox.pdfparser.BaseParser parseCOSStream
警告: Specified stream length 15353 is wrong. Fall back to reading stream until
'endstream'.
二月 07, 2014 4:59:11 下午 org.apache.pdfbox.pdfparser.BaseParser parseCOSStream
警告: Specified stream length 12156 is wrong. Fall back to reading stream until
'endstream'.
二月 07, 2014 4:59:11 下午 org.apache.pdfbox.pdfparser.XrefTrailerResolver
setStartxref
警告: Did not found XRef object at specified startxref position 83636
ConvertColorspace failed with the following exception:
java.io.IOException: Missing closing bracket for hex string. Reached EOS.
at
org.apache.pdfbox.pdfparser.BaseParser.parseCOSHexString(BaseParser.java:1023)
at
org.apache.pdfbox.pdfparser.BaseParser.parseCOSString(BaseParser.java:816)
at
org.apache.pdfbox.pdfparser.PDFStreamParser.parseNextToken(PDFStreamParser.java:259)
at
org.apache.pdfbox.pdfparser.PDFStreamParser.parse(PDFStreamParser.java:133)
at
org.apache.pdfbox.ConvertColorspace.replaceColors(ConvertColorspace.java:88)
at org.apache.pdfbox.ConvertColorspace.main(ConvertColorspace.java:385)
at org.apache.pdfbox.PDFBox.main(PDFBox.java:46)
> java.util.zip.DataFormatException: incorrect header check
> ---------------------------------------------------------
>
> Key: PDFBOX-1729
> URL: https://issues.apache.org/jira/browse/PDFBOX-1729
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Affects Versions: 2.0.0
> Environment: XP, W7
> Reporter: Tilman Hausherr
> Assignee: Andreas Lehmkühler
> Priority: Minor
>
> After parsing the file from PDFBOX-1606 with the "old" parser, and doing
> RenderUtil.convertToImage() I get this:
> 27.09.2013 16:56:06.685 ERROR [main] org.apache.pdfbox.filter.FlateFilter:134
> - FlateFilter: stop reading corrupt stream due to a DataFormatException
> 27.09.2013 16:56:06.691 ERROR [main] org.apache.pdfbox.filter.FlateFilter:134
> - FlateFilter: stop reading corrupt stream due to a DataFormatException
> 27.09.2013 16:56:06.692 ERROR [main] org.apache.pdfbox.filter.FlateFilter:134
> - FlateFilter: stop reading corrupt stream due to a DataFormatException
> 27.09.2013 16:56:06.692 ERROR [main] org.apache.pdfbox.filter.FlateFilter:134
> - FlateFilter: stop reading corrupt stream due to a DataFormatException
> IOException for file PDFBOX-1606.pdf
> 27.09.2013 16:56:06.693 ERROR [main] org.apache.pdfbox.filter.FlateFilter:134
> - FlateFilter: stop reading corrupt stream due to a DataFormatException
> java.io.IOException
> 27.09.2013 16:56:06.693 ERROR [main] org.apache.pdfbox.filter.FlateFilter:134
> - FlateFilter: stop reading corrupt stream due to a DataFormatException
> 27.09.2013 16:56:06.693 ERROR [main] org.apache.pdfbox.filter.FlateFilter:134
> - FlateFilter: stop reading corrupt stream due to a DataFormatException
> 27.09.2013 16:56:06.694 ERROR [main] org.apache.pdfbox.filter.FlateFilter:134
> - FlateFilter: stop reading corrupt stream due to a DataFormatException
> 27.09.2013 16:56:06.694 ERROR [main] org.apache.pdfbox.filter.FlateFilter:134
> - FlateFilter: stop reading corrupt stream due to a DataFormatException
> 27.09.2013 16:56:06.695 ERROR [main] org.apache.pdfbox.filter.FlateFilter:134
> - FlateFilter: stop reading corrupt stream due to a DataFormatException
> at org.apache.pdfbox.filter.FlateFilter.decode(FlateFilter.java:136)
> at org.apache.pdfbox.cos.COSStream.doDecode(COSStream.java:317)
> at org.apache.pdfbox.cos.COSStream.doDecode(COSStream.java:237)
> at
> org.apache.pdfbox.cos.COSStream.getUnfilteredStream(COSStream.java:170)
> at
> org.apache.pdfbox.pdfparser.PDFStreamParser.<init>(PDFStreamParser.java:108)
> at
> org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:235)
> at
> org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:219)
> at
> org.apache.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:199)
> at org.apache.pdfbox.pdfviewer.PageDrawer.drawPage(PageDrawer.java:153)
> at org.apache.pdfbox.util.RenderUtil.renderPage(RenderUtil.java:221)
> at org.apache.pdfbox.util.RenderUtil.convertToImage(RenderUtil.java:185)
> at org.apache.pdfbox.util.RenderUtil.convertToImage(RenderUtil.java:143)
> at pdfboxpageimageextraction.ExtractImages.doPdf(ExtractImages.java:162)
> at pdfboxpageimageextraction.ExtractImages.main(ExtractImages.java:98)
> Caused by: java.util.zip.DataFormatException: incorrect header check
> at java.util.zip.Inflater.inflateBytes(Native Method)
> at java.util.zip.Inflater.inflate(Inflater.java:259)
> at java.util.zip.Inflater.inflate(Inflater.java:280)
> at org.apache.pdfbox.filter.FlateFilter.decompress(FlateFilter.java:167)
> at org.apache.pdfbox.filter.FlateFilter.decode(FlateFilter.java:96)
> ... 13 more
> It does not happen when I use the nonSequential parser. I don't really care
> much about this, except that I can't use the file for tests because these use
> the "old" parser. I therefore suggest:
> 1) @deprecate the load() call
> 2) change TestImageIOUtils.java so that it uses the loadNonSeq call
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)