On Jun 2, 2008, at 4:49 AM, Cam Bazz wrote:

> Well, this is a very common problem. If the pdf has embedded fonts,  
> many software can not extract the font correctly - especially if  
> the embedded font has custom encoding.
>

        Any software that can't extract text with embedded fonts is BROKEN  
and should be tossed in the trash where it belongs!   PDF has  
supported embedded fonts and custom encodings since version 1.0 (yes,  
it's ENTIRE LIFE!).   Embedding of fonts is a requirement for all of  
the ISO "subset" standards of PDF (PDF/X, PDF/A and PDF/E).   To not  
support it correctly and properly means stupid programmers and bad  
software.


> I am interested in a solution for changing the font of the pdf  
> document as well, programatically.
>
        Changing the font for non-embeds has already been discussed.    
Changing it for embedded fonts is difficult->impossible... There are  
some tools out there that attempt to it - but not all succeed...


> The only solution I found, is to use pdflib TET, to extract word  
> based - or glyph based info from the pdf, and use itext to form, a  
> new, restructured pdf document. depending on the detail of the  
> original pdf your milage might vary.
>
        OUCH!!!


Leonard


-------------------------------------------------------------------------
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
_______________________________________________
iText-questions mailing list
iText-questions@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions

Do you like iText?
Buy the iText book: http://www.1t3xt.com/docs/book.php
Or leave a tip: https://tipit.to/itexttipjar

Reply via email to