I discovered that in 
     PdfTokenizer::ReadString() 
there is 
     PdfImputDevice::GetChar();
which returns characters ‘wrong’, so decoding have to take place erlier.
Any hint where can I look for more? I’m new with PoDoFo and I’m little bit 
confused.
Maybe PdfDictionary is read wrong? 

Best regards, 
Fryderyk


Od: frydery...@gmail.com
Wysłano: poniedziałek, 6 lutego 2017 19:37
Do: podofo-users@lists.sourceforge.net
Temat: ODP: [Podofo-users] reading polish characters using PoDoFo

I was able to extract text by making ‘m_pEncoding’ public in PdfFont.h and 
changing encoding to Iso88592(which contains polish characters):
PdfEncoding* pIsoEncoding = new PdfIso88592Encoding();
pCurFont->m_pEncoding = pIsoEncoding;
I have text now, but still have not polish characters.
I think it may be bug in 
PdfContsntsTokenizer.ReadNext() 
which doesn’t read polish chars in the right way, as PoDoFo writes unknown 
chars like ‘\200’, ‘\201’.
Will try to figure it out, but if someone has any hint, pls let me know😊

Best regards, 
Fryderyk

------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, SlashDot.org! http://sdm.link/slashdot
_______________________________________________
Podofo-users mailing list
Podofo-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/podofo-users

Reply via email to