[jira] [Reopened] (PDFBOX-1498) Index Out Of Bounds Exception while reading large PDF Document

Manoj Patel (JIRA) Mon, 21 Jan 2013 22:38:16 -0800

     [ 
https://issues.apache.org/jira/browse/PDFBOX-1498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Manoj Patel reopened PDFBOX-1498:
---------------------------------


Thanks for fast reply. I am still getting same error, i have added latest files 
from svn and still issue is continue.

Below is my code

public class ReadLargeFile {
        public static void main(String[] args) throws Exception{
                File file = new File("C:\\TESTFILE.pdf");               
                FileInputStream fin = new FileInputStream(file);                
                
                PDDocument doc = PDDocument.load(fin);
                List allPages = doc.getDocumentCatalog().getAllPages();
                //Need some operation on Pages.
                System.out.println("Pages Count::" + allPages.size());
                doc.close();            
        }
}

                
> Index Out Of Bounds Exception while reading large PDF Document 
> ---------------------------------------------------------------
>
>                 Key: PDFBOX-1498
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1498
>             Project: PDFBox
>          Issue Type: Bug
>            Reporter: Manoj Patel
>            Assignee: Andreas Lehmkühler
>
> I am getting java.lang.IndexOutOfBoundsException while reading large PDF 
> document (800 mb). 
> Below is the full stack
> Exception in thread "main" org.apache.pdfbox.exceptions.WrappedIOException
>       at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:243)
>       at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1071)
>       at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1038)
>       at imageData.AddFooter.main(AddFooter.java:26)
> Caused by: java.lang.IndexOutOfBoundsException: Index: 3377, Size: 3377
>       at java.util.ArrayList.RangeCheck(ArrayList.java:547)
>       at java.util.ArrayList.get(ArrayList.java:322)
>       at 
> org.apache.pdfbox.io.RandomAccessBuffer.seek(RandomAccessBuffer.java:84)
>       at 
> org.apache.pdfbox.io.RandomAccessFileOutputStream.write(RandomAccessFileOutputStream.java:106)
>       at 
> java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:65)
>       at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123)
>       at java.io.FilterOutputStream.close(FilterOutputStream.java:140)
>       at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSStream(BaseParser.java:606)
>       at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:566)
>       at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:187)
>       ... 3 more

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Reopened] (PDFBOX-1498) Index Out Of Bounds Exception while reading large PDF Document

Reply via email to