[ 
https://issues.apache.org/jira/browse/PDFBOX-2527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler resolved PDFBOX-2527.
----------------------------------------
    Resolution: Fixed

I'm finished at this point. I discontinue the work on rebuilding a corrupt file 
which is encrypted as it is far more complicated than expected. We can open a 
new issue if someone comes up with a real sample (I've created mine by 
manipulating a well-formed one).
Thanks to everybody for the help/input/report

> IOException: Negative seek offset in NonSequentialPDFParser
> -----------------------------------------------------------
>
>                 Key: PDFBOX-2527
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2527
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 1.8.8, 2.0.0
>            Reporter: Tilman Hausherr
>            Assignee: Andreas Lehmkühler
>            Priority: Minor
>             Fix For: 2.0.0
>
>         Attachments: PDFBOX-2527-069020.pdf
>
>
> {code}
> Exception in thread "main" java.io.IOException: Negative seek offset
>       at java.io.RandomAccessFile.seek(Native Method)
>       at 
> org.apache.pdfbox.io.RandomAccessBufferedFileInputStream.seek(RandomAccessBufferedFileInputStream.java:116)
>       at 
> org.apache.pdfbox.io.PushBackInputStream.seek(PushBackInputStream.java:234)
>       at 
> org.apache.pdfbox.pdfparser.NonSequentialPDFParser.initialParse(NonSequentialPDFParser.java:492)
>       at 
> org.apache.pdfbox.pdfparser.NonSequentialPDFParser.parse(NonSequentialPDFParser.java:1013)
>       at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:951)
>       at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:897)
>       at org.apache.pdfbox.tools.PDFReader.parseDocument(PDFReader.java:375)
>       at org.apache.pdfbox.tools.PDFReader.openPDFFile(PDFReader.java:340)
>       at org.apache.pdfbox.tools.PDFReader.main(PDFReader.java:326)
>       at org.apache.pdfbox.tools.PDFBox.main(PDFBox.java:80)
> {code}
> This happens with several malformed PDFs from the test set in TIKA-1442. 
> These files (303385, 069020, 303385, 742141, 982996) all have some trash at 
> the end.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to