[ 
https://issues.apache.org/jira/browse/PDFBOX-5178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17655671#comment-17655671
 ] 

Andreas Lehmkühler commented on PDFBOX-5178:
--------------------------------------------

I've added support for the index of the objects within a compressed object 
stream. But it is limited to those streams where the object numbers aren't 
unique as in the given file. But this doesn't fix the issue. PDFBox 
accidentally refers to the correct object. The issue was introduced when I 
"optimized" the parser to stop reading malformed dictionaries to avoid endless 
loops. I'm still searching for the correct ticket. BTW, 2.0.24 is affected as 
well.

> Parsing differences between 2.0.23 and 2.0.24/3.0
> -------------------------------------------------
>
>                 Key: PDFBOX-5178
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5178
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 2.0.23, 3.0.0 PDFBox
>            Reporter: Tilman Hausherr
>            Assignee: Andreas Lehmkühler
>            Priority: Major
>         Attachments: poppler-704-0.pdf
>
>
> There are some weird differences in parsing the attached file, 2.0.23 shows 
> "BigTIFF.tif" in the /Contents of the first annotation and a loop at 
> Root/Pages/Kids/[0]/Annots/[0]/FS (always 14 0 R), while 3.0 doesn't have 
> that, but doesn't have "BigTIFF.tif". I'm not sure which one (if any) is 
> wrong.
>  
> UPDATE
> 2.0.24 shows the same behaviour as 3.0



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to