[
https://issues.apache.org/jira/browse/PDFBOX-562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matt Hicks updated PDFBOX-562:
------------------------------
Description:
The following exception is thrown when attempting to process certain PDFs.
This occurs in the latest code from SVN and causes PDFBox to be useless in any
production processing system since not all PDFs are supported. I have uploaded
an example PDF that reproduces the problem here:
http://captiveimagination.com/download/test02.pdf
This occurs upon invocation of PDFDocument.load(...):
Exception in thread "main" org.apache.pdfbox.exceptions.WrappedIOException
at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:237)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:841)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:808)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:733)
at test.Test.main(Test.java:21)
Caused by: java.lang.ClassCastException: org.apache.pdfbox.cos.COSDictionary
cannot be cast to org.apache.pdfbox.cos.COSStream
at
org.apache.pdfbox.cos.COSDocument.dereferenceObjectStreams(COSDocument.java:456)
at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:207)
... 4 more
was:
The following exception is thrown when attempting to process certain PDFs.
This occurs in the latest code from SVN and causes PDFBox to be useful in any
production processing system since not all PDFs are supported. I have uploaded
an example PDF that reproduces the problem here:
http://captiveimagination.com/download/test02.pdf
This occurs upon invocation of PDFDocument.load(...):
Exception in thread "main" org.apache.pdfbox.exceptions.WrappedIOException
at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:237)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:841)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:808)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:733)
at test.Test.main(Test.java:21)
Caused by: java.lang.ClassCastException: org.apache.pdfbox.cos.COSDictionary
cannot be cast to org.apache.pdfbox.cos.COSStream
at
org.apache.pdfbox.cos.COSDocument.dereferenceObjectStreams(COSDocument.java:456)
at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:207)
... 4 more
Correction
> Some PDFs cannot be processed yet are valid PDFs
> ------------------------------------------------
>
> Key: PDFBOX-562
> URL: https://issues.apache.org/jira/browse/PDFBOX-562
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Affects Versions: 0.7.0, 0.7.1, 0.7.2, 0.7.3, 0.8.0-incubator, 1.0.0
> Environment: Windows 7, JDK 1.6, Eclipse
> Reporter: Matt Hicks
> Priority: Critical
>
> The following exception is thrown when attempting to process certain PDFs.
> This occurs in the latest code from SVN and causes PDFBox to be useless in
> any production processing system since not all PDFs are supported. I have
> uploaded an example PDF that reproduces the problem here:
> http://captiveimagination.com/download/test02.pdf
> This occurs upon invocation of PDFDocument.load(...):
> Exception in thread "main" org.apache.pdfbox.exceptions.WrappedIOException
> at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:237)
> at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:841)
> at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:808)
> at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:733)
> at test.Test.main(Test.java:21)
> Caused by: java.lang.ClassCastException: org.apache.pdfbox.cos.COSDictionary
> cannot be cast to org.apache.pdfbox.cos.COSStream
> at
> org.apache.pdfbox.cos.COSDocument.dereferenceObjectStreams(COSDocument.java:456)
> at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:207)
> ... 4 more
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.