[ 
https://issues.apache.org/jira/browse/TIKA-2862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16837432#comment-16837432
 ] 

Sergey Beryozkin edited comment on TIKA-2862 at 5/10/19 4:37 PM:
-----------------------------------------------------------------

The call path from PDType1Font to RAFDataStream:

{noformat}
        at org.apache.fontbox.ttf.TTFParser.parse(TTFParser.java:132)
        at org.apache.fontbox.ttf.TTFParser.parse(TTFParser.java:87)
        at 
org.apache.pdfbox.pdmodel.font.FileSystemFontProvider.readTrueTypeFont(FileSystemFontProvider.java:731)
        at 
org.apache.pdfbox.pdmodel.font.FileSystemFontProvider.getTrueTypeFont(FileSystemFontProvider.java:696)
        at 
org.apache.pdfbox.pdmodel.font.FileSystemFontProvider.access$200(FileSystemFontProvider.java:55)
        at 
org.apache.pdfbox.pdmodel.font.FileSystemFontProvider$FSFontInfo.getFont(FileSystemFontProvider.java:132)
        at 
org.apache.pdfbox.pdmodel.font.FontMapperImpl.findFont(FontMapperImpl.java:436)
        at 
org.apache.pdfbox.pdmodel.font.FontMapperImpl.findFontBoxFont(FontMapperImpl.java:382)
        at 
org.apache.pdfbox.pdmodel.font.FontMapperImpl.getFontBoxFont(FontMapperImpl.java:359)
        at 
org.apache.pdfbox.pdmodel.font.PDType1Font.<init>(PDType1Font.java:146)
        at 
org.apache.pdfbox.pdmodel.font.PDType1Font.<clinit>(PDType1Font.java:91)
{noformat}


was (Author: sergey_beryozkin):
The call path from PDType1Font to RAFDataStream:

{noformat}
17:31:09,714 ERROR [org.apa.pdf.pdm.fon.FileSystemFontProvider] Could not load 
font file: /usr/share/fonts/liberation/LiberationSans-Regular.ttf: 
java.lang.NullPointerException
        at 
org.apache.fontbox.ttf.RAFDataStream.readSignedShort(RAFDataStream.java:77)
        at 
org.apache.fontbox.ttf.TTFDataStream.read32Fixed(TTFDataStream.java:50)
        at org.apache.fontbox.ttf.TTFParser.parse(TTFParser.java:132)
        at org.apache.fontbox.ttf.TTFParser.parse(TTFParser.java:87)
        at 
org.apache.pdfbox.pdmodel.font.FileSystemFontProvider.readTrueTypeFont(FileSystemFontProvider.java:731)
        at 
org.apache.pdfbox.pdmodel.font.FileSystemFontProvider.getTrueTypeFont(FileSystemFontProvider.java:696)
        at 
org.apache.pdfbox.pdmodel.font.FileSystemFontProvider.access$200(FileSystemFontProvider.java:55)
        at 
org.apache.pdfbox.pdmodel.font.FileSystemFontProvider$FSFontInfo.getFont(FileSystemFontProvider.java:132)
        at 
org.apache.pdfbox.pdmodel.font.FontMapperImpl.findFont(FontMapperImpl.java:436)
        at 
org.apache.pdfbox.pdmodel.font.FontMapperImpl.findFontBoxFont(FontMapperImpl.java:382)
        at 
org.apache.pdfbox.pdmodel.font.FontMapperImpl.getFontBoxFont(FontMapperImpl.java:359)
        at 
org.apache.pdfbox.pdmodel.font.PDType1Font.<init>(PDType1Font.java:146)
        at 
org.apache.pdfbox.pdmodel.font.PDType1Font.<clinit>(PDType1Font.java:91)
{noformat}

> Make PDF Parser Graal native mode ready 
> ----------------------------------------
>
>                 Key: TIKA-2862
>                 URL: https://issues.apache.org/jira/browse/TIKA-2862
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.20
>            Reporter: Sergey Beryozkin
>            Assignee: Sergey Beryozkin
>            Priority: Major
>
> PDF Parser is not Graal native mode ready yet, the following is reported when 
> it is processed as part of Quarkus native mode build:
> Error: Detected a FileDescriptor in the image heap. ....You can manually 
> delay class initialization to image run time by using the option 
> --delay-class-initialization-to-runtime=<class-name>. ...
> Detailed message:
> Trace:     object org.apache.fontbox.ttf.BufferedRandomAccessFile
>     object org.apache.fontbox.ttf.RAFDataStream
>     object org.apache.fontbox.ttf.TrueTypeFont
>     object org.apache.pdfbox.pdmodel.font.PDType1Font
>     method 
> org.apache.pdfbox.pdmodel.interactive.form.PDAcroForm.verifyOrCreateDefaults()
> Call path from entry point to 
> org.apache.pdfbox.pdmodel.interactive.form.PDAcroForm.verifyOrCreateDefaults():
>  
>     at 
> org.apache.pdfbox.pdmodel.interactive.form.PDAcroForm.verifyOrCreateDefaults(PDAcroForm.java:106)
>     at 
> org.apache.pdfbox.pdmodel.interactive.form.PDAcroForm.<init>(PDAcroForm.java:93)
>     at 
> org.apache.pdfbox.pdmodel.PDDocumentCatalog.getAcroForm(PDDocumentCatalog.java:108)
>     at org.apache.tika.parser.pdf.PDFParser.handleXFAOnly(PDFParser.java:534)
>     at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:164)
>  
> See also 
> [https://medium.com/graalvm/understanding-class-initialization-in-graalvm-native-image-generation-d765b7e4d6ed]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to