Hi, I am using tika 1.3 for parsing the pdf but I am getting error for one of my pdf file. below is the error. pdfbox 1.3.1 java.io.IOException: expected='obj' actual='655' org.apache.pdfbox.io.PushBackInputStream@fe7591 at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:511) at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:172) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:859) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:826) at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:53)
Please help me to solve this issue. Thanks Uday Venkatadasari Senior Consultant | Avalon Consulting, LLC <http://www.avalonconsult.com/>P: 703 635 3302 | M: 631 332 1595 LinkedIn <http://www.linkedin.com/company/avalon-consulting-llc> | Google+ <http://www.google.com/+AvalonConsultingLLC> | Twitter <https://twitter.com/avalonconsult> ------------------------------------------------------------------------------------------------------------- This message (including any attachments) contains confidential information intended for a specific individual and purpose, and is protected by law. If you are not the intended recipient, you should delete this message. Any disclosure, copying, or distribution of this message, or the taking of any action based on it, is strictly prohibited.