Fulvio,

This is a mapping problem -  some characters have been compounded ( fi ff fl ft 
Th )

Have a look in the  file  (Resources.afm.Times-Roman.afm) at the codes above 
126.

You might need to change the PDFont encode to produce the result you require !

Cheers --- Iain








Fulvio D'Antonio wrote:
Hello everybody,
I'm using PDFTetStripper to extract plain text from a pdf.
The problem I encounter is that every occurrence of "fi","ffi" etc is
replaced by a "?".
I think is a problem of encoding but I can't figure out how to solve it.

Thank you in advance for your help.

Fulvio


Reply via email to