Thank to both of you.

You are right: when coping to text there is nothing but random characters
because the font (namely the differences array) is wrong.

But I have discovered why is wrong:  the character g3 in the vector , for
instance, means the Ascii code 29+3=32 which is an space. All characters
follow the same patern gnn (the letter g followed by an integer). the Ascii
code is always 29+nn

Therefore I made a little program that edits the pdf, gets the differences
array, compute the right caracter and then rebuilds the array back. Now I
can read the pdf, once is beeing rebuilt in this fashion.

I know I should not spend so much time correcting somebody else's mistakes,
but I receive plenty of pdf like this...

--
View this message in context: 
http://itext-general.2136553.n4.nabble.com/Unreadable-Pdf-with-PdfTextExtractor-tp3345219p3347943.html
Sent from the iText - General mailing list archive at Nabble.com.

------------------------------------------------------------------------------
Colocation vs. Managed Hosting
A question and answer guide to determining the best fit
for your organization - today and in the future.
http://p.sf.net/sfu/internap-sfd2d
_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions

iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference 
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples: 
http://itextpdf.com/themes/keywords.php

Reply via email to