[ https://issues.apache.org/jira/browse/PDFBOX-5372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tilman Hausherr updated PDFBOX-5372: ------------------------------------ Affects Version/s: 2.0.25 > *LOADS of* "WARNING: key node000xxxxx already exists in destination IDTree" > --------------------------------------------------------------------------- > > Key: PDFBOX-5372 > URL: https://issues.apache.org/jira/browse/PDFBOX-5372 > Project: PDFBox > Issue Type: Bug > Affects Versions: 2.0.25 > Reporter: nunop5 > Priority: Major > > Hello, > I've been using PDFBox since very very long, and it's been working very well 👍 > > However, for 2 sets of PDFs I'm trying to merge in the past day, for some > reason I'm always getting LOADS of these Warnings, with a "out of memory" > crash at the end. > I believe (but could be wrong) that the "out of memory" is due to these > endless Warnings. > (heap is currently 15g) > > But, does anyone know what could be causing these warnings? (anything in > specific you'd suggest me to look at, in these PDFs?) > > Thank you very much! > > (... LOADS of these, redacted ...) > Jan 31, 2022 2:41:48 AM org.apache.pdfbox.multipdf.PDFMergerUtility > mergeIDTree > WARNING: key node00018714 already exists in destination IDTree > Jan 31, 2022 2:41:48 AM org.apache.pdfbox.multipdf.PDFMergerUtility > mergeIDTree > WARNING: key node00018715 already exists in destination IDTree > Jan 31, 2022 2:41:48 AM org.apache.pdfbox.multipdf.PDFMergerUtility > mergeIDTree > WARNING: key node00018716 already exists in destination IDTree > Jan 31, 2022 2:41:48 AM org.apache.pdfbox.multipdf.PDFMergerUtility > mergeIDTree > WARNING: key node00018717 already exists in destination IDTree > Jan 31, 2022 2:41:48 AM org.apache.pdfbox.multipdf.PDFMergerUtility > mergeIDTree > WARNING: key node00018718 already exists in destination IDTree > Exception in thread "main" java.lang.OutOfMemoryError: Java heap space > at > org.apache.pdfbox.io.ScratchFileBuffer.addPage(ScratchFileBuffer.java:132) > at > org.apache.pdfbox.io.ScratchFileBuffer.ensureAvailableBytesInPage(ScratchFileBuffer.java:184) > at > org.apache.pdfbox.io.ScratchFileBuffer.write(ScratchFileBuffer.java:236) > at > org.apache.pdfbox.io.RandomAccessOutputStream.write(RandomAccessOutputStream.java:46) > at org.apache.pdfbox.cos.COSStream$2.write(COSStream.java:281) > at org.apache.pdfbox.io.IOUtils.copy(IOUtils.java:70) > at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:127) > at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:109) > at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:146) > at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:146) > at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:146) > at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:109) > at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:117) > at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:146) > at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:109) > at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:117) > at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:146) > at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:109) > at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:117) > at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:146) > at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:109) > at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:146) > at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:109) > at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:146) > at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:109) > at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:146) > at > org.apache.pdfbox.multipdf.PDFMergerUtility.appendDocument(PDFMergerUtility.java:800) > at > org.apache.pdfbox.multipdf.PDFMergerUtility.legacyMergeDocuments(PDFMergerUtility.java:459) > at > org.apache.pdfbox.multipdf.PDFMergerUtility.mergeDocuments(PDFMergerUtility.java:346) > at org.apache.pdfbox.tools.PDFMerger.merge(PDFMerger.java:70) > at org.apache.pdfbox.tools.PDFMerger.main(PDFMerger.java:49) > at org.apache.pdfbox.tools.PDFBox.main(PDFBox.java:81) > > – > > EDIT - Increased the heap, and seems I got something new too: > > Jan 31, 2022 10:50:46 AM org.apache.pdfbox.multipdf.PDFMergerUtility > mergeIDTree > WARNING: key node00018714 already exists in destination IDTree > Jan 31, 2022 10:51:29 AM org.apache.pdfbox.cos.COSDocument finalize > *WARNING: Warning: You did not close a PDF Document* > Exception in thread "main" java.lang.OutOfMemoryError: Java heap space > > Not sure if this means something is actually wrong with one of the PDFs, or a > consequence of going out of memory. > (and how do I know what's the PDF it's referring to? :) ) > I'm using Pupeteer to generate the PDFs, so I assume it generated them > well... (also, it's consistently failing even if I re-generate all of them) -- This message was sent by Atlassian Jira (v8.20.1#820001) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org