This is a difficult one. Currently only CJK fonts with a ToUnicode cmap or an 
Unicode cmap is supported. Other encodings, as it's the case, will require a 
serious rerwrite of DocumentFont and CMapAwareDocumentFont.

Paulo
  ----- Original Message ----- 
  From: 1T3XT BVBA 
  To: Post all your questions about iText here 
  Sent: Sunday, August 21, 2011 2:43 PM
  Subject: Re: [iText-questions] Problem when extracting CJK chars from PDF 
files


  On 21/08/2011 4:12, Mophy Xiong wrote:
  > Hi all,
  >
  > I'm using iText 5.1.2 to extract text from PDF files. But it just returns me
  > two spaces (#32#32) when it encounters a chinese char. An example PDF file
  > is attached.
  You're right. I was able to reproduce the problem.
  As soon as I find the time, I'll look into it.
  Unfortunately, I'm almost fully booked next week.
  Maybe next weekend.

  ------------------------------------------------------------------------------
  Get a FREE DOWNLOAD! and learn more about uberSVN rich system, 
  user administration capabilities and model configuration. Take 
  the hassle out of deploying and managing Subversion and the 
  tools developers use with it. http://p.sf.net/sfu/wandisco-d2d-2
  _______________________________________________
  iText-questions mailing list
  [email protected]
  https://lists.sourceforge.net/lists/listinfo/itext-questions

  iText(R) is a registered trademark of 1T3XT BVBA.
  Many questions posted to this list can (and will) be answered with a 
reference to the iText book: http://www.itextpdf.com/book/
  Please check the keywords list before you ask for examples: 
http://itextpdf.com/themes/keywords.php
------------------------------------------------------------------------------
uberSVN's rich system and user administration capabilities and model 
configuration take the hassle out of deploying and managing Subversion and 
the tools developers use with it. Learn more about uberSVN and get a free 
download at:  http://p.sf.net/sfu/wandisco-dev2dev
_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions

iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference 
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples: 
http://itextpdf.com/themes/keywords.php

Reply via email to