[ https://issues.apache.org/jira/browse/PDFBOX-1541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13605021#comment-13605021 ]
Maruan Sahyoun commented on PDFBOX-1541: ---------------------------------------- I agree that a recovery mode is needed. On the other hand there are issues like PDFBOX-474 (invalid xref entry) which are currently handled directly in the parser. That's the type of issues I thought a callback could handle. > expected='endstream' actual='' failure to parse > ----------------------------------------------- > > Key: PDFBOX-1541 > URL: https://issues.apache.org/jira/browse/PDFBOX-1541 > Project: PDFBox > Issue Type: Bug > Components: Text extraction > Affects Versions: 1.7.1 > Environment: Ubuntu 12.04, JDK 1.7 > Reporter: Jinder Aujla > Attachments: exporeal09_flyer_email3.pdf > > > Following exception thrown when parsing attached PDF > Caused by: java.io.IOException: expected='endstream' actual='' > org.apache.pdfbox.io.PushBackInputStream@2a789924 > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSStream(BaseParser.java:597) > at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:575) > at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:187) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira