[ https://issues.apache.org/jira/browse/PDFBOX-3024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15024296#comment-15024296 ]
Guillaume Monteils commented on PDFBOX-3024: -------------------------------------------- Sorry for not answering sooner but i wait for the 2.0 Prefligth. I did the test yesterday with the new version, and the problem still occurs. And the validation gives always the same answer : "3.3.1 : Glyph error, The character code 0 in the font program "XYNCVU+TimesNewRomanPSMT" is missing from the Character Encoding" (attached PDFBox.png) I did the Test in PDF-Tools too, the file is valid (PDF-Tools.png) I really think the problem is still the test for glyph presence. @Override public boolean hasGlyph(int code) throws IOException { return font.codeToGID(code) != 0; } When i look at this site (http://www.readytext.co.uk/?p=1375), it says that GID ranges from 0 to N-1 where N is the glyph count. So i don't understand why the GID must be different from 0. I undestand that my previous modification GID >= 0 was wrong due to another test. I will try sending back a null value in case of error. Get back when it is done. > Preflight validation call PDType0Font.clear at the wrong time > ------------------------------------------------------------- > > Key: PDFBOX-3024 > URL: https://issues.apache.org/jira/browse/PDFBOX-3024 > Project: PDFBox > Issue Type: Bug > Components: Preflight > Affects Versions: 1.8.10 > Reporter: Guillaume Monteils > Attachments: 004973.pdf, PDF-Tools.png, PDFBox.png, eclipse-1.jpg, > eclipse-2.jpg > > > I used the algorythm here to test PDF / A compliance : > https://pdfbox.apache.org/1.8/cookbook/pdfavalidation.html > With one pdf document (which i cant give you due to confidentiality), an > NullPointerException occur here : > {code} > java.lang.NullPointerException > at > org.apache.pdfbox.pdmodel.font.PDType0Font.getFontWidth(PDType0Font.java:188) > at > org.apache.pdfbox.preflight.font.container.FontContainer.checkGlyphWith(FontContainer.java:114) > at > org.apache.pdfbox.preflight.content.ContentStreamWrapper.validText(ContentStreamWrapper.java:372)... > {code} > As i dug deeper, i found that preflight loads a font context where it puts > all pdf fonts. The PDType0Font is also created and put in this context. > {code} > (CSObject : > COSDictionary{(COSName{BaseFont}:COSName{INWHIX+TimesNewRomanPSMT}) > (COSName{DescendantFonts}:COSArray{[COSObject{349, 0}]}) > (COSName{Encoding}:COSName{Identity-H}) > (COSName{Subtype}:COSName{Type0}) > (COSName{ToUnicode}:COSDictionary{(COSName{Filter}:COSName{FlateDecode}) > (COSName{Length}:COSInt{260}) }) (COSName{Type}:COSName{Font}) }) > {code} > The problem is that at the end of one step of the analysis, the clear method > is called on the PDType0Font (see eclipse-1.jpg), but the font is still > present in the context. On a second step, the same font is retrieved from the > context, with no data in it, and the NullPointerException occurs (see > eclipse-2.jpg). > I tried the validation after removing the clear method from PDType0Font and > it works just fine. > I think the problem comes from this context, and a clear on a font should > also trigger a deletion in this map. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org