Hi,

I am using tika 1.3 for parsing the pdf but I am getting error for one of
my pdf file. below is the error.
pdfbox 1.3.1
java.io.IOException: expected='obj' actual='655'
org.apache.pdfbox.io.PushBackInputStream@fe7591
at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:511)
at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:172)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:859)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:826)
at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:53)

Please help me to solve this issue.

Thanks
Uday Venkatadasari
Senior Consultant | Avalon Consulting, LLC
<http://www.avalonconsult.com/>P: 703 635 3302 | M: 631 332 1595
LinkedIn <http://www.linkedin.com/company/avalon-consulting-llc> | Google+
<http://www.google.com/+AvalonConsultingLLC> | Twitter
<https://twitter.com/avalonconsult>
-------------------------------------------------------------------------------------------------------------
This message (including any attachments) contains confidential information
intended for a specific individual and purpose, and is protected by law. If
you are not the intended recipient, you should delete this message. Any
disclosure, copying, or distribution of this message, or the taking of any
action based on it, is strictly prohibited.

Reply via email to