[ https://issues.apache.org/jira/browse/TIKA-3874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17616949#comment-17616949 ]
Tim Allison commented on TIKA-3874: ----------------------------------- Not clear how we want to do this. The simplest method would be a percentage, but it feels like we should have a sense of scale as well. If one pdf only has 10 characters and 9 of them lack mappings, is that a greater loss of information than a PDF with 10000 characters and missing mappings for 1000? Perhaps one field for overall average and one for sum of missing? > Add summary of missing unicode mappings for PDF > ----------------------------------------------- > > Key: TIKA-3874 > URL: https://issues.apache.org/jira/browse/TIKA-3874 > Project: Tika > Issue Type: Task > Reporter: Tim Allison > Priority: Minor > -- This message was sent by Atlassian Jira (v8.20.10#820010)