Encoding in PDFTextStripperByArea

2020-02-02 Thread Athanasios Viennas
Hello! I am looking out for the functionality of encoding in a org.apache.pdfbox.util.PDFTextStripperByArea as the constructor specification has been removed after release 1.8.13, now looking out to use it in release 2.0.18. I want to have control over parsing a pdf document in research paper type

Re: question about LegacyPDFStreamEngine

2020-02-02 Thread Tilman Hausherr
I had a look... the calculations were indeed in PDFStreamEngine, but in processEncodedText(). ShowGlyph() didn't exist previously and when it existed has always been mostly empty. I'll adjust the comment within a few days (wait for feedback here) to replace "good" with "mostly empty". With "