[ 
https://issues.apache.org/jira/browse/PDFBOX-5405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17514865#comment-17514865
 ] 

Tilman Hausherr commented on PDFBOX-5405:
-----------------------------------------

Your PDF is truncated.

> "Page tree root must be a dictionary" when attempting to parse pdf 
> -------------------------------------------------------------------
>
>                 Key: PDFBOX-5405
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5405
>             Project: PDFBox
>          Issue Type: Bug
>    Affects Versions: 2.0.25
>            Reporter: Johannes Wirkkala Westlund
>            Priority: Minor
>         Attachments: Grafiska riktlinjer, fordon LRV.pdf
>
>
> Hi,
> I have a PDF file that throws the following error when I try to parse it:
> {code:java}
> Caused by: java.io.IOException: Page tree root must be a dictionary
>     at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:198)
>     at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:226)
>     at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1228)
>     at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1202)
>     at org.apache.tika.parser.pdf.PDFParser.getPDDocument(PDFParser.java:191)
>     at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:149)
>     at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:289)
>     ... 5 more {code}
> I have attached the file in question with this issue.
> Might be related to https://issues.apache.org/jira/browse/PDFBOX-4915



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to