Hi,

I am facing issue while reading larger size pdf document which is around 700 
mb. I am using latest build and its giving
below mentioned error

Exception in thread "main" org.apache.pdfbox.exceptions.WrappedIOException
    at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:243)
    at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1071)
    at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1038)
    at imageData.ReadLargeFile.main(ReadLargeFile.java:13)
Caused by: java.lang.OutOfMemoryError: Java heap space
    at java.io.BufferedOutputStream.<init>(BufferedOutputStream.java:59)
    at org.apache.pdfbox.cos.COSStream.createFilteredStream(COSStream.java:415)
    at 
org.apache.pdfbox.pdfparser.BaseParser.parseCOSStream(BaseParser.java:452)
    at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:566)
    at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:187)
    ... 3 more


If i use loadNonSeq to load document its gives error 


Exception in thread "main" java.lang.OutOfMemoryError: Java heap space
    at java.lang.String.substring(String.java:1940)
    at java.lang.String.subSequence(String.java:1973)
    at java.util.regex.Pattern.split(Pattern.java:1002)
    at java.lang.String.split(String.java:2293)
    at java.lang.String.split(String.java:2335)
    at org.apache.pdfbox.pdfparser.PDFParser.parseXrefTable(PDFParser.java:725)
    at 
org.apache.pdfbox.pdfparser.NonSequentialPDFParser.initialParse(NonSequentialPDFParser.java:296)
    at 
org.apache.pdfbox.pdfparser.NonSequentialPDFParser.parse(NonSequentialPDFParser.java:617)
    at org.apache.pdfbox.pdmodel.PDDocument.loadNonSeq(PDDocument.java:1124)
    at org.apache.pdfbox.pdmodel.PDDocument.loadNonSeq(PDDocument.java:1107)
    at imageData.ReadLargeFile.main(ReadLargeFile.java:13)


Thanks
                                          

Reply via email to