Some PDFs cannot be processed yet are valid PDFs
------------------------------------------------

                 Key: PDFBOX-562
                 URL: https://issues.apache.org/jira/browse/PDFBOX-562
             Project: PDFBox
          Issue Type: Bug
          Components: Parsing
    Affects Versions: 0.8.0-incubator, 0.7.3, 0.7.2, 0.7.1, 0.7.0, 1.0.0
         Environment: Windows 7, JDK 1.6, Eclipse
            Reporter: Matt Hicks
            Priority: Critical


The following exception is thrown when attempting to process certain PDFs.  
This occurs in the latest code from SVN and causes PDFBox to be useful in any 
production processing system since not all PDFs are supported.  I have uploaded 
an example PDF that reproduces the problem here: 
http://captiveimagination.com/download/test02.pdf

This occurs upon invocation of PDFDocument.load(...):

Exception in thread "main" org.apache.pdfbox.exceptions.WrappedIOException
        at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:237)
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:841)
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:808)
        at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:733)
        at test.Test.main(Test.java:21)
Caused by: java.lang.ClassCastException: org.apache.pdfbox.cos.COSDictionary 
cannot be cast to org.apache.pdfbox.cos.COSStream
        at 
org.apache.pdfbox.cos.COSDocument.dereferenceObjectStreams(COSDocument.java:456)
        at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:207)
        ... 4 more


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to