'fi' getting converted to '?' ----------------------------- Key: PDFBOX-860 URL: https://issues.apache.org/jira/browse/PDFBOX-860 Project: PDFBox Issue Type: Bug Components: Text extraction Affects Versions: 1.2.1 Environment: Solaris 10 Reporter: Saurabh Mehrotra
Hi I am trying to use PDF box 1.2.1 version to extract text from PDF files. The following issue is observed in the extracted text: 1. Combination of the characters 'fi' is converted to a '?' example: first becomes ?rst classifier becomes classi?er find becomes ?nd Is this a known bug? Can some setting of the PDF box be turned of to prevent this? Thanks & Regards Saurabh -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.