[ 
https://issues.apache.org/jira/browse/PDFBOX-5216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17364651#comment-17364651
 ] 

Tilman Hausherr commented on PDFBOX-5216:
-----------------------------------------

The images on the first page are the same, but it's the same reference (6635) 
so removing one reference doesn't change much. I have no idea what Adobe did to 
optimize. You could also try to split both files (with the PDFSplit tool), then 
compare the size of the individual pages to see if there is a difference there. 
(Might not be if they did indeed found duplicate stuff that goes on different 
pages)

> Is there a way to optimize by cleaning up duplicate objects?
> ------------------------------------------------------------
>
>                 Key: PDFBOX-5216
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5216
>             Project: PDFBox
>          Issue Type: Wish
>            Reporter: yoonho
>            Priority: Major
>         Attachments: samepage.png, 스크린샷 2021-06-15 오후 2.02.21.png
>
>
> Is there a way to clean up duplicate objects using PDFBox?
> [http://gofile.me/4hSqO/Cis33w0Sa] - Original
> [http://gofile.me/4hSqO/7XKmWqUBB]  - Clean version
> I applied the Adobe DC's Optimize option (relevant in the attached file). As 
> a result, a 48mb PDF file was reduced to 19mb. I think this is due to 
> cleaning up duplicate objects in the PDF.
> Am I right? I would like to implement this process with PDFBox. How should I 
> approach it?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to