[
https://issues.apache.org/jira/browse/PDFBOX-462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jeremias Maerki updated PDFBOX-462:
-----------------------------------
Attachment: partial-false-fix.diff
I've managed to isolate and fix the problem. Please see the attached patch. The
problem is that InputStream.read() does not guarantee that it reads exactly the
amount of bytes specified. It can also be less. Somehow, this file triggered a
border case together with my PDF/FOP plug-in where this rule was invoked. I'm
really tired right now so I'll stop looking for other possible failure points
due to this bug. I'll also need to finish my work on an updated FOP plug-in
later. But at least, this should be a step forward.
> Problems parsing a PDF
> ----------------------
>
> Key: PDFBOX-462
> URL: https://issues.apache.org/jira/browse/PDFBOX-462
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Environment: Latest version from svn, Windows XP 64, IntelliJ Idea
> 8.0.1
> Reporter: Georg Datterl
> Attachments: P0737387.pdf, partial-false-fix.diff,
> PDFBOX462-P0737387.ps.gz, PDFBOX462-P07373871.png
>
>
> I have a PDF file which I want to load using PDFParser.load(InputStream,
> null), but some way through the parsing process BaseParser.parseDirObject()
> throws an IOException("expected false actual='fa'"). I downloaded the latest
> source code for the class and in line 871 indeed the string "false" is
> expected, but "fa" followed by three empty bytes is received.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.