Hi,

Gesendet: Mi, 20. Jan 2010 Von: tsuraan<[email protected]>

> Has there been any progress towards Asian language support since the
> last ticket updates from last september?  
I'm afraid there is still no full support for asian languages. To be more 
specific
the problem is the missing support for CID-coded fonts and unicode mappings,
which is especially needed for pdfs based on non latin based characters sets 
like
those heavily used whithin asian languages.

> I've been using the standard
> unix pdftotext tool for pdf processing, and while that extracts utf-8
> text from Asian pdfs perfectly, I don't particularly like shelling out
> to an external program.  Completed Asian support for pdfbox would be
> really nice.
I guess that everyone knows that the support for CID-code fonts and unicode 
mappings
is one of the needed keyfeatures. But it's not that easy to implement, 
especially if you are 
not able to read those (asian) texts. ;-)  It is one of the issues on my 
"want-todo-list" ....

BR
Andreas Lehmkühler

Reply via email to