Thank to both of you. You are right: when coping to text there is nothing but random characters because the font (namely the differences array) is wrong.
But I have discovered why is wrong: the character g3 in the vector , for instance, means the Ascii code 29+3=32 which is an space. All characters follow the same patern gnn (the letter g followed by an integer). the Ascii code is always 29+nn Therefore I made a little program that edits the pdf, gets the differences array, compute the right caracter and then rebuilds the array back. Now I can read the pdf, once is beeing rebuilt in this fashion. I know I should not spend so much time correcting somebody else's mistakes, but I receive plenty of pdf like this... -- View this message in context: http://itext-general.2136553.n4.nabble.com/Unreadable-Pdf-with-PdfTextExtractor-tp3345219p3347943.html Sent from the iText - General mailing list archive at Nabble.com. ------------------------------------------------------------------------------ Colocation vs. Managed Hosting A question and answer guide to determining the best fit for your organization - today and in the future. http://p.sf.net/sfu/internap-sfd2d _______________________________________________ iText-questions mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/itext-questions iText(R) is a registered trademark of 1T3XT BVBA. Many questions posted to this list can (and will) be answered with a reference to the iText book: http://www.itextpdf.com/book/ Please check the keywords list before you ask for examples: http://itextpdf.com/themes/keywords.php
