Hi, It happens that I just committed a patch that addresses similar issues a few minutes ago. Would you mind checking the current trunk to see if your approach is the same as mine?
You can email your patches directly to this list. Merci -- Palmer Zent On September 27, 2015 at 10:14:37 AM, Hugues LEFEBVRE (hugues.lefeb...@lusis.fr) wrote: Hi, I'm was using PoDoFo to extract text from a PDF. When trying to get the unicode characters from glyphs (ie: for Tj command) it was not working in some cases. When a TrueType font has no Encoding but a ToUnicode map then it's not read. The ToUnicode CMAP parser (from PdfIdentityEncoding and PdfCMapEncoding) has some also bugs (like the value of loop variable that is not reset between sections) loading only partial informations from the CMap. I've fixed this points and now I'm able to get all the text in the PDF file with PoDoFo. I'm new to PoDoFo and I don't know how to submit a patch for these corrections (if there is a way) in case it helps other people having the same problems. Meanwhile or if patch are not accepted/reviewed people having same issue can ask me for the patch. Regards, Hugues ------------------------------------------------------------------------------ _______________________________________________ Podofo-users mailing list Podofo-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/podofo-users
------------------------------------------------------------------------------
_______________________________________________ Podofo-users mailing list Podofo-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/podofo-users