[ https://issues.apache.org/jira/browse/PDFBOX-1498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Manoj Patel reopened PDFBOX-1498: --------------------------------- Thanks for fast reply. I am still getting same error, i have added latest files from svn and still issue is continue. Below is my code public class ReadLargeFile { public static void main(String[] args) throws Exception{ File file = new File("C:\\TESTFILE.pdf"); FileInputStream fin = new FileInputStream(file); PDDocument doc = PDDocument.load(fin); List allPages = doc.getDocumentCatalog().getAllPages(); //Need some operation on Pages. System.out.println("Pages Count::" + allPages.size()); doc.close(); } } > Index Out Of Bounds Exception while reading large PDF Document > --------------------------------------------------------------- > > Key: PDFBOX-1498 > URL: https://issues.apache.org/jira/browse/PDFBOX-1498 > Project: PDFBox > Issue Type: Bug > Reporter: Manoj Patel > Assignee: Andreas Lehmkühler > > I am getting java.lang.IndexOutOfBoundsException while reading large PDF > document (800 mb). > Below is the full stack > Exception in thread "main" org.apache.pdfbox.exceptions.WrappedIOException > at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:243) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1071) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1038) > at imageData.AddFooter.main(AddFooter.java:26) > Caused by: java.lang.IndexOutOfBoundsException: Index: 3377, Size: 3377 > at java.util.ArrayList.RangeCheck(ArrayList.java:547) > at java.util.ArrayList.get(ArrayList.java:322) > at > org.apache.pdfbox.io.RandomAccessBuffer.seek(RandomAccessBuffer.java:84) > at > org.apache.pdfbox.io.RandomAccessFileOutputStream.write(RandomAccessFileOutputStream.java:106) > at > java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:65) > at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123) > at java.io.FilterOutputStream.close(FilterOutputStream.java:140) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSStream(BaseParser.java:606) > at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:566) > at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:187) > ... 3 more -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira