Johannes Wirkkala Westlund created PDFBOX-5405:
--------------------------------------------------
Summary: "Page tree root must be a dictionary" when attempting to
parse pdf
Key: PDFBOX-5405
URL: https://issues.apache.org/jira/browse/PDFBOX-5405
Project: PDFBox
Issue Type: Bug
Affects Versions: 2.0.25
Reporter: Johannes Wirkkala Westlund
Attachments: Grafiska riktlinjer, fordon LRV.pdf
Hi,
I have a PDF file that throws the following error when I try to parse it:
{{
}}
{code:java}
Caused by: java.io.IOException: Page tree root must be a dictionary
at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:198)
at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:226)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1228)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1202)
at org.apache.tika.parser.pdf.PDFParser.getPDDocument(PDFParser.java:191)
at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:149)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:289)
... 5 more {code}
{{}}
I have attached the file in question with this issue.[^Grafiska riktlinjer,
fordon LRV.pdf]
--
This message was sent by Atlassian Jira
(v8.20.1#820001)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]