[
https://issues.apache.org/jira/browse/PDFBOX-2887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14630323#comment-14630323
]
Tilman Hausherr edited comment on PDFBOX-2887 at 7/16/15 9:13 PM:
------------------------------------------------------------------
The xref value points to a page object, where it expects an object stream. We
have a repair mechanism, but that fails because xref points to a valid object
and not "at some random stuff".
was (Author: tilman):
The xref value points to a page object, where it expects an object stream.
> NPE in PDFXrefStreamParser in 2.0 trunk
> ---------------------------------------
>
> Key: PDFBOX-2887
> URL: https://issues.apache.org/jira/browse/PDFBOX-2887
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Affects Versions: 2.0.0
> Reporter: Tim Allison
> Priority: Trivial
> Attachments: 746140.pdf
>
>
> {noformat}
> java.lang.NullPointerException
> at
> org.apache.pdfbox.pdfparser.PDFXrefStreamParser.parse(PDFXrefStreamParser.java:89)
> at
> org.apache.pdfbox.pdfparser.COSParser.parseXrefStream(COSParser.java:1970)
> at
> org.apache.pdfbox.pdfparser.COSParser.parseXrefObjStream(COSParser.java:322)
> at org.apache.pdfbox.pdfparser.COSParser.parseXref(COSParser.java:282)
> at
> org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:184)
> at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:225)
> at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:850)
> at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:795)
> at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:121)
> {noformat}
> Content was extracted with 1.8.9.
> Might be too rare to bother fixing.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]