[ https://issues.apache.org/jira/browse/PDFBOX-5398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17511722#comment-17511722 ]
Michael Klink commented on PDFBOX-5398: --------------------------------------- Yes, that file is _kaputt_. It suffices to make sure that PDFBox does not seriously hang up or kill the VM for it. An exception during parsing is completely appropriate. I would prefer a declared one, though, not a RuntimeException or Error. > Parsing fails in 2.0.26 that worked in 2.0.25 > --------------------------------------------- > > Key: PDFBOX-5398 > URL: https://issues.apache.org/jira/browse/PDFBOX-5398 > Project: PDFBox > Issue Type: Bug > Components: Parsing > Affects Versions: 2.0.26, 3.0.0 PDFBox > Reporter: Tilman Hausherr > Assignee: Andreas Lehmkühler > Priority: Major > Labels: regression > Attachments: 077867.pdf, 392443.pdf, > crash-024bde7e01045bb3a6ab9d86ccccb13cf411bc35.pdf > > > {noformat} > März 23, 2022 4:14:13 AM org.apache.pdfbox.pdfparser.BaseParser > parseCOSDictionaryNameValuePair > WARNUNG: Empty COSName at offset 12313 > Exception in thread "main" java.io.IOException: Unknown dir object c='>' > cInt=62 peek='>' peekInt=62 at offset 12326 (start offset: 12326) > at > org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:928) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:154) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:303) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:228) > at > org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:872) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:154) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:303) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:228) > at > org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:872) > at > org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:916) > at > org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:883) > at > org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:796) > at > org.apache.pdfbox.pdfparser.COSParser.parseDictObjects(COSParser.java:756) > at > org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:187) > at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:226) > {noformat} > The cause is not PDFBOX-5283. -- This message was sent by Atlassian Jira (v8.20.1#820001) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org