parse Unicode glyph names
-------------------------
Key: PDFBOX-433
URL: https://issues.apache.org/jira/browse/PDFBOX-433
Project: PDFBox
Issue Type: Improvement
Components: Parsing, Text extraction
Affects Versions: 0.8.0-incubator
Reporter: Timo Boehme
Priority: Minor
Adobe has specified (http://www.adobe.com/devnet/opentype/archives/glyph.html)
how glyph names should be constructed to easily convert them (to Unicode). What
is currently missing in PDFBox is the handling of suffixes (NAME.SUFFIX) and
Unicode names (uniXXXX). I have therefore attached an updated method
getCharacter( COSName name ) for class org.apache.pdfbox.encoding.Encoding.
It first strips off suffix and tests later on for names starting with 'uni'.
Timo
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.