[
https://issues.apache.org/jira/browse/PDFBOX-4943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17187915#comment-17187915
]
Tilman Hausherr edited comment on PDFBOX-4943 at 8/31/20, 5:48 PM:
-------------------------------------------------------------------
No I don't.
Another idea: open the PDF with an editor like NOTEPAD++. Look for
"/StructTreeRoot ". If it is there (i.e. that it isn't encoded itself), and
contains a text like "<number> 0 R", then replace the entire thing with blanks
of the same size. If you can open the modified PDF file without troubles, then
do this overwriting programmatically.
was (Author: tilman):
No I don't.
Another idea: open the PDF with an editor like NOTEPAD++. Look for
"/StructTreeRoot ". If it is there (i.e. that it isn't encoded itself), and
contains a text like "<number> 0 R", then replace the entire thing with blanks
of the same size. If you can open the file, then do this programmatically.
> PDF Merge of large document, memory usage?
> ------------------------------------------
>
> Key: PDFBOX-4943
> URL: https://issues.apache.org/jira/browse/PDFBOX-4943
> Project: PDFBox
> Issue Type: Bug
> Components: Utilities
> Affects Versions: 2.0.21
> Environment: Windows, java x64
> Reporter: Richard Stafford
> Priority: Major
>
> We are trying to use the PDFMergerUtility to merge a number of PDF files, but
> are having an issue with exhausting java heap with a specific PDF file. This
> file is about 50mb, but merging with it requires a java heap of 8gb. We've
> tried using setupTempFileOnly() but that doesn't seem to help.
> Looking further, just doing a PDDocument.load() operation for this file uses
> 3.5gb of heap, regardless of MemoryUsageSetting() values.
> For instance, with the following main()
> public static void main(String arg[]) public static void main(String arg[])
> { try
> { // load the document
> PDDocument sourceDoc = PDDocument.load( new
> File("c:\\tmp\\testfile.PDF"),
>
> MemoryUsageSetting.setupTempFileOnly().setTempDir(new File("c:\\tmp")) );
> sourceDoc.close(); LogMessage.log( "Completed");
> }
> catch (Exception e)
> { LogMessage.log( "Exception in document load: "+e.toString());
> }
> }
> Setting a breakpoint at the sourceDoc.close(), the heap has increased to
> 3.5gb.
>
> Our test case can be downloaded from:
> [https://s3.amazonaws.com/webdl.equorum.com/misc/testfile.pdf]
>
> Thanks,
> Rich Stafford
> Chief Scientist
> eQuorum Corporation
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]