[ https://issues.apache.org/jira/browse/TIKA-3683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17496285#comment-17496285 ]
Tilman Hausherr commented on TIKA-3683: --------------------------------------- I don't know about the first and the last, but the other three are font packages so PDFs will look better when rendering when fonts are not embedded in a PDF. > Documentation of native dependencies per module > ----------------------------------------------- > > Key: TIKA-3683 > URL: https://issues.apache.org/jira/browse/TIKA-3683 > Project: Tika > Issue Type: Wish > Components: tika-docker, tika-server > Reporter: dataminer.accolade > Priority: Minor > > I created a custom Docker image using the latest Tesseract release. I came > across the tika > [Dockerfile|https://github.com/apache/tika-docker/blob/master/full/Dockerfile] > file which installs the following dependencies: > xfonts-utils > fonts-freefont-ttf > fonts-liberation > ttf-mscorefonts-installer > cabextract > I have not found any documetation yet about those dependencies in > [https://cwiki.apache.org/confluence/display/tika] and > [https://github.com/apache/tika]. I can only guess that those dependencies > might impact PDF content handling. -- This message was sent by Atlassian Jira (v8.20.1#820001)