[ https://issues.apache.org/jira/browse/PDFBOX-5801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17835007#comment-17835007 ]
Tilman Hausherr edited comment on PDFBOX-5801 at 4/8/24 6:00 PM: ----------------------------------------------------------------- I got it to work by using the option "-Xss50m". I tried that after noticing it also happened with 2.0.31 and adding some debug output and seeing different numbers, i.e. not the same or a cycle which would be typical for a stack overflow. So this is just a very complex file, and not a bug. Please try it yourself. was (Author: tilman): I got it to work by using the option "-Xss50m". I tried that after noticing it also happened with 2.0.31 and adding some debug output and seeing different numbers, i.e. not the same or a cycle which would be typical for a stack overflow. So this is just a very complex file. > StackOverflowError with specific document > ----------------------------------------- > > Key: PDFBOX-5801 > URL: https://issues.apache.org/jira/browse/PDFBOX-5801 > Project: PDFBox > Issue Type: Bug > Affects Versions: 2.0.31, 3.0.2 PDFBox > Environment: Windows 10 Pro - 19045.3208 > openjdk version "13.0.2" 2020-01-14 > Memory: -Xmx4G -Xms100m > Reporter: Rodrigo Schonardt > Priority: Major > Labels: stack-overflow > Attachments: arquivo.pdf > > > {{When i try to merge the attached file with a cover the error happens:}} > {quote}{{>>java.lang.StackOverflowError}} > {{>> at java.base/sun.nio.cs.UTF_8$Decoder.decodeLoop(UTF_8.java:414)}} > {{UTF_8.java:414}} > {{>> at > java.base/java.nio.charset.CharsetDecoder.decode(CharsetDecoder.java:578)}} > {{CharsetDecoder.java:578}} > {{>> at > java.base/java.nio.charset.CharsetDecoder.decode(CharsetDecoder.java:801)}} > {{CharsetDecoder.java:801}} > {{>> at > org.apache.pdfbox.pdfparser.BaseParser.decodeBuffer(BaseParser.java:885)}} > {{BaseParser.java:885}} > {{>> at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSName(BaseParser.java:872)}} > {{BaseParser.java:872}} > {{>> at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:368)}} > {{BaseParser.java:368}} > {{>> at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:289)}} > {{BaseParser.java:289}} > {{>> at > org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:915)}} > {{BaseParser.java:915}} > {{>> at > org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:737)}} > {{COSParser.java:737}} > {{>> at > org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:671)}} > {{COSParser.java:671}} > {{>> at > org.apache.pdfbox.pdfparser.COSParser.dereferenceCOSObject(COSParser.java:625)}} > {{COSParser.java:625}} > {{>> at org.apache.pdfbox.cos.COSObject.getObject(COSObject.java:121)}} > {{COSObject.java:121}} > {{>> at > org.apache.pdfbox.multipdf.PDFCloneUtility.hasSelfReference(PDFCloneUtility.java:249)}} > {{PDFCloneUtility.java:249}} > {{>> at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneCOSDictionary(PDFCloneUtility.java:176)}} > {{PDFCloneUtility.java:176}} > {{>> at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneCOSBaseForNewDocument(PDFCloneUtility.java:122)}} > {{PDFCloneUtility.java:122}} > {{>> at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:100)}} > {{PDFCloneUtility.java:100}} > {{>> at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneCOSBaseForNewDocument(PDFCloneUtility.java:110)}} > {{PDFCloneUtility.java:110}} > {{>> at > org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:100)}} > {{{}PDFCloneUtility.java:100{}}}{{{{}}{}}} > {quote} > The code is the following, tried different configs but none helped: > {code:java} > PDFMergerUtility merger = new PDFMergerUtility(); > merger.addSource( cover ); > merger.addSource( file ); > merger.setDestinationFileName( location ); > merger.mergeDocuments( null ); {code} > A similar error happens if i try to extract a page of the file and save it in > a new file: > {quote} >>java.lang.StackOverflowError > >> at java.base/java.util.HashMap.putVal(HashMap.java:631) > HashMap.java:631 > >> at java.base/java.util.HashMap.put(HashMap.java:612) > HashMap.java:612 > >> at > >> org.apache.pdfbox.pdfwriter.compress.COSObjectPool.put(COSObjectPool.java:80) > COSObjectPool.java:80 > >> at > >> org.apache.pdfbox.pdfwriter.compress.COSWriterCompressionPool.addObjectToPool(COSWriterCompressionPool.java:141) > COSWriterCompressionPool.java:141 > >> at > >> org.apache.pdfbox.pdfwriter.compress.COSWriterCompressionPool.addStructure(COSWriterCompressionPool.java:179) > COSWriterCompressionPool.java:179 > >> at > >> org.apache.pdfbox.pdfwriter.compress.COSWriterCompressionPool.addElements(COSWriterCompressionPool.java:219) > COSWriterCompressionPool.java:219 > >> at > >> org.apache.pdfbox.pdfwriter.compress.COSWriterCompressionPool.addStructure(COSWriterCompressionPool.java:188) > COSWriterCompressionPool.java:188 > >> at > >> org.apache.pdfbox.pdfwriter.compress.COSWriterCompressionPool.addElements(COSWriterCompressionPool.java:219) > {quote} > The code is the following: > {code:java} > PDDocument pdf = Loader.loadPDF( file ); > > int pageCount = pdf.getNumberOfPages(); > > for ( int i = 0; i < pageCount; i++) { > PDDocument singlePageDocument = new PDDocument(); > PDPage page = pdf.getPage(i); > singlePageDocument.addPage(page); > singlePageDocument.save( location + (i + 1) + ".pdf"); > singlePageDocument.close(); > } > > pdf.close(); {code} > I believe theres is something to do with the specific structure of the file, > considering i was able to use way bigger files, both in pages and in size > without problems. > Also tried to use the same file on itext and no problem was found. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org