[ 
https://issues.apache.org/jira/browse/PDFBOX-5283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17419793#comment-17419793
 ] 

Michael Klink commented on PDFBOX-5283:
---------------------------------------

[~Schmidor],

the PDF error this all is about is a mismatch of the object number written in 
front of the indirect object and the object number implied by the position of a 
reference to that indirect object from the cross reference table.

In case of your file the cross reference table position implies the _correct_ 
object number while the explicitly given one is _wrong_.

In the past, though, other broken files with such mismatches here have given 
rise to issues for which the cross reference table position implied the _wrong_ 
object number while the explicitly given one was _correct_. For example look at 
PDFBOX-2679. I've also seen that other PDF libraries have made provisions for 
such errors in the cross references.

Understandably you'd like PDFBox to be changed to properly read your file. On 
the other hand doing so results in regressions for those other files.

So unless you find an easy way to determine which kind of error we have in a 
given file to fix correctly, PDFBox can only fix one of these kinds of issues. 
In that case it might be of interest how the error came to be in your PDF. Do 
you have any information?

In particular on one hand the creator tool is claimed to be Necrona Rave. Have 
you had often seen such mismatch errors in Rave PDFs?

And on the other hand the PDF contains a comment that the PDF was _Modified by 
WPCubed PDFWorkBench_. Might the error be due to that tool? And have you often 
been confronted with such mismatch errors in PDFs modified by PDFWorkBench?

> No Content - xRef / Obj Parsing
> -------------------------------
>
>                 Key: PDFBOX-5283
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5283
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 2.0.24, 3.0.0 PDFBox
>            Reporter: Oliver Schmidtmer
>            Priority: Major
>         Attachments: Lieferschein_110300.pdf
>
>
> There seems to be an issue with xRef / object reading when parsing the 
> attached pdf.
> The PDF itself has for example two objects with the ref "8 0 R":
> One at position 1967 with a "/Content" entry.
> One at position 7782 without a "/Content" entry.
> Both are referenced in the XRef Table, so there seems to be something off. 
> Probably Acrobat, etc. are using the first object, while PDFBox is using the 
> second one.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to