Re: PDFParser Error Caused by: org.apache.pdfbox.exceptions.WrappedIOException

Tilman Hausherr Sat, 07 Mar 2015 05:22:57 -0800

The best would be to test whether that file can be handled by newerversions of PDFBox (1.8.9 and 2.0)


https://repository.apache.org/content/groups/snapshots/org/apache/pdfbox/pdfbox/1.8.9-SNAPSHOT/
https://repository.apache.org/content/groups/snapshots/org/apache/pdfbox/pdfbox/2.0.0-SNAPSHOT/


download the jar files, for each one try

    - run java -jar <jarfile> ExtractText <yourfile>
    - see what happens
    - tell it

Your paste indicates a problem in RandomAccessBuffer.java.

Tilman

Am 06.03.2015 um 21:05 schrieb ganesh.ya...@sungard.com:

Hello,
I am getting PDFParser Error Caused by: 
org.apache.pdfbox.exceptions.WrappedIOException
Complete stack trace is on the following link.
( http://apaste.info/DRD )

I am trying to import 4GB Long PDF using Tika into Solr. I was able to import 
up to 500MB.


Please suggest if there is any workaround.

Thanks
G

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: users-h...@pdfbox.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: users-h...@pdfbox.apache.org

Re: PDFParser Error Caused by: org.apache.pdfbox.exceptions.WrappedIOException

Reply via email to