[ https://issues.apache.org/jira/browse/PDFBOX-2163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tilman Hausherr resolved PDFBOX-2163. ------------------------------------- Resolution: Fixed > inline image with EI in the middle incorrectly parsed > ----------------------------------------------------- > > Key: PDFBOX-2163 > URL: https://issues.apache.org/jira/browse/PDFBOX-2163 > Project: PDFBox > Issue Type: Bug > Components: Parsing > Affects Versions: 1.8.6, 1.8.7, 2.0.0 > Reporter: Tilman Hausherr > Assignee: Tilman Hausherr > Labels: inline > Fix For: 1.8.7, 2.0.0 > > > This PDF > http://digitalcorpora.org/corp/nps/files/govdocs1/876/876636.pdf > has an exception because the end of an inline image is improperly detected. > The stream looks like this: > {code} > BI > /W 452 > /H 169 > /BPC 8 > /CS /RGB > /D [0.0 1.0 0.0 1.0 0.0 1.0] > /F [/A85 /Fl] > ID > ...................................................... > ....................................................EI > ...................................................... > ... > .... > EI Q > {code} > The inline images are handled in PDFStreamParser. This is tricky, we look for > followup bin data to check that it isn't an EI in the middle, but here it > isn't bin data, but ascii85 stuff. We also can't request that there be a LF > before the EI, because I remember that I had a PDF at work created by a well > known company that doesn't use it. -- This message was sent by Atlassian JIRA (v6.2#6252)