Hi,
Gesendet: Mi, 20. Jan 2010 Von: tsuraan<[email protected]> > Has there been any progress towards Asian language support since the > last ticket updates from last september? I'm afraid there is still no full support for asian languages. To be more specific the problem is the missing support for CID-coded fonts and unicode mappings, which is especially needed for pdfs based on non latin based characters sets like those heavily used whithin asian languages. > I've been using the standard > unix pdftotext tool for pdf processing, and while that extracts utf-8 > text from Asian pdfs perfectly, I don't particularly like shelling out > to an external program. Completed Asian support for pdfbox would be > really nice. I guess that everyone knows that the support for CID-code fonts and unicode mappings is one of the needed keyfeatures. But it's not that easy to implement, especially if you are not able to read those (asian) texts. ;-) It is one of the issues on my "want-todo-list" .... BR Andreas Lehmkühler

