Hi,
I am facing issue while reading larger size pdf document which is around 700
mb. I am using latest build and its giving
below mentioned error
Exception in thread "main" org.apache.pdfbox.exceptions.WrappedIOException
at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:243)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1071)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1038)
at imageData.ReadLargeFile.main(ReadLargeFile.java:13)
Caused by: java.lang.OutOfMemoryError: Java heap space
at java.io.BufferedOutputStream.<init>(BufferedOutputStream.java:59)
at org.apache.pdfbox.cos.COSStream.createFilteredStream(COSStream.java:415)
at
org.apache.pdfbox.pdfparser.BaseParser.parseCOSStream(BaseParser.java:452)
at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:566)
at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:187)
... 3 more
If i use loadNonSeq to load document its gives error
Exception in thread "main" java.lang.OutOfMemoryError: Java heap space
at java.lang.String.substring(String.java:1940)
at java.lang.String.subSequence(String.java:1973)
at java.util.regex.Pattern.split(Pattern.java:1002)
at java.lang.String.split(String.java:2293)
at java.lang.String.split(String.java:2335)
at org.apache.pdfbox.pdfparser.PDFParser.parseXrefTable(PDFParser.java:725)
at
org.apache.pdfbox.pdfparser.NonSequentialPDFParser.initialParse(NonSequentialPDFParser.java:296)
at
org.apache.pdfbox.pdfparser.NonSequentialPDFParser.parse(NonSequentialPDFParser.java:617)
at org.apache.pdfbox.pdmodel.PDDocument.loadNonSeq(PDDocument.java:1124)
at org.apache.pdfbox.pdmodel.PDDocument.loadNonSeq(PDDocument.java:1107)
at imageData.ReadLargeFile.main(ReadLargeFile.java:13)
Thanks