This is a difficult one. Currently only CJK fonts with a ToUnicode cmap or an
Unicode cmap is supported. Other encodings, as it's the case, will require a
serious rerwrite of DocumentFont and CMapAwareDocumentFont.
Paulo
----- Original Message -----
From: 1T3XT BVBA
To: Post all your questions about iText here
Sent: Sunday, August 21, 2011 2:43 PM
Subject: Re: [iText-questions] Problem when extracting CJK chars from PDF
files
On 21/08/2011 4:12, Mophy Xiong wrote:
> Hi all,
>
> I'm using iText 5.1.2 to extract text from PDF files. But it just returns me
> two spaces (#32#32) when it encounters a chinese char. An example PDF file
> is attached.
You're right. I was able to reproduce the problem.
As soon as I find the time, I'll look into it.
Unfortunately, I'm almost fully booked next week.
Maybe next weekend.
------------------------------------------------------------------------------
Get a FREE DOWNLOAD! and learn more about uberSVN rich system,
user administration capabilities and model configuration. Take
the hassle out of deploying and managing Subversion and the
tools developers use with it. http://p.sf.net/sfu/wandisco-d2d-2
_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions
iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a
reference to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples:
http://itextpdf.com/themes/keywords.php
------------------------------------------------------------------------------
uberSVN's rich system and user administration capabilities and model
configuration take the hassle out of deploying and managing Subversion and
the tools developers use with it. Learn more about uberSVN and get a free
download at: http://p.sf.net/sfu/wandisco-dev2dev
_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions
iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples:
http://itextpdf.com/themes/keywords.php