[ https://issues.apache.org/jira/browse/PDFBOX-3004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14941357#comment-14941357 ]
Arkady Zalkowitsch commented on PDFBOX-3004: -------------------------------------------- Example of PDF file that parse fails. Any fonts after "/Encoding" font in PDF Object is not considered (Justa Arial is considered). name type encoding emb sub uni object ID ------------------------------------ ----------------- ---------------- — --- — --------- Arial CID TrueType Identity-H yes no yes 12 0 [none] Type 3 Custom yes no yes 17 0 [none] Type 3 Custom yes no yes 274 0 Source Sans Pro CID TrueType Identity-H yes no yes 531 0 [none] Type 3 Custom yes no yes 536 0 Source Sans Pro CID TrueType Identity-H yes no yes 793 0 > PDF fulltext index fails. > ------------------------- > > Key: PDFBOX-3004 > URL: https://issues.apache.org/jira/browse/PDFBOX-3004 > Project: PDFBox > Issue Type: Bug > Reporter: Arkady Zalkowitsch > > PDF fulltext index fails when font dictionary in there contains one entry for > the font Helvetica and an entry for Encoding whose value does not represent a > font at all. > The PDF Object in PDF looks like this: > obj = { > "/Fields": [ 12 0 R ], > "/DA": "/Helvetica 0 Tf 0 g", > "/DR": { > "/Font": { > "/Helvetica": "11 0 R", > "/Encoding": { > "/PDFDocEncoding": "10 0 R" > } > } > "/NeedAppearances": true > } > PDFBox tries to parse that "font" called Encoding and fails doing so. but > PDResources.getFonts() only logs the resulting exception: > try { > newFont = PDFontFactory.createFont( (COSDictionary)font ); > } catch (IOException exception) { > LOG.error("error while creating a font", exception); > } -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org