[ https://issues.apache.org/jira/browse/PDFBOX-1498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13559514#comment-13559514 ]
Andreas Lehmkühler commented on PDFBOX-1498: -------------------------------------------- The described IndexOutOfBounds issue isn't related to the size of the pdf (see PDFBOX-1490, the pdf in question has a size of 102Kb) Are you sure that - you have the latest code? - you are really using the new compiled code? - the issue/stacktrace is the same? > Index Out Of Bounds Exception while reading large PDF Document > --------------------------------------------------------------- > > Key: PDFBOX-1498 > URL: https://issues.apache.org/jira/browse/PDFBOX-1498 > Project: PDFBox > Issue Type: Bug > Reporter: Manoj Patel > Assignee: Andreas Lehmkühler > > I am getting java.lang.IndexOutOfBoundsException while reading large PDF > document (800 mb). > Below is the full stack > Exception in thread "main" org.apache.pdfbox.exceptions.WrappedIOException > at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:243) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1071) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1038) > at imageData.AddFooter.main(AddFooter.java:26) > Caused by: java.lang.IndexOutOfBoundsException: Index: 3377, Size: 3377 > at java.util.ArrayList.RangeCheck(ArrayList.java:547) > at java.util.ArrayList.get(ArrayList.java:322) > at > org.apache.pdfbox.io.RandomAccessBuffer.seek(RandomAccessBuffer.java:84) > at > org.apache.pdfbox.io.RandomAccessFileOutputStream.write(RandomAccessFileOutputStream.java:106) > at > java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:65) > at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123) > at java.io.FilterOutputStream.close(FilterOutputStream.java:140) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSStream(BaseParser.java:606) > at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:566) > at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:187) > ... 3 more -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira