[ 
https://issues.apache.org/jira/browse/PDFBOX-1541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13605021#comment-13605021
 ] 

Maruan Sahyoun commented on PDFBOX-1541:
----------------------------------------

I agree that a recovery mode is needed. On the other hand there are issues like 
PDFBOX-474 (invalid xref entry) which are currently handled directly in the 
parser. That's the type of issues I thought a callback could handle.
                
> expected='endstream' actual='' failure to parse
> -----------------------------------------------
>
>                 Key: PDFBOX-1541
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1541
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 1.7.1
>         Environment: Ubuntu 12.04, JDK 1.7
>            Reporter: Jinder Aujla
>         Attachments: exporeal09_flyer_email3.pdf
>
>
> Following exception thrown when parsing attached PDF
> Caused by: java.io.IOException: expected='endstream' actual='' 
> org.apache.pdfbox.io.PushBackInputStream@2a789924
>       at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSStream(BaseParser.java:597)
>       at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:575)
>       at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:187)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to