1. you are still posting to me personally instead of to the mailing list, even after you said you had overlooked my message about my mail policy. I'm sorry, but I'll have to blacklist you for this. It's nothing personal. It's standard policy. 2. You have sent me a VB.NET example. I have never worked with VB .NET; I only do JAVA 3. I had a look at your PDF. Open it in Adobe Reader and select the 'Select' option in the toolbar. Using your mouse pointer you can select the French text (and do copy/paste). You can't do that with the arabic text. That's why iText and PDFBox produce rubbish.
There is no way to solve this problem with iText. Please forget what I told you earlier about encodings, I was confused. When a large font is subsetted characters are mapped to glyphs (in your case arabic glyphs). What you get are these characters. They are IDs of a map. The map can be different for every different PDF, therefore it is very difficult to retrieve 'text' from a PDF. iText and/or PDFBox will NOT be able to help you with this problem. best regards, Bruno ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys -- and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ iText-questions mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/itext-questions
