On Tue, Apr 12, 2022 at 5:24 PM Francesco Pretto <[email protected]> wrote:
> On Tue, 12 Apr 2022 at 14:50, zyx <[email protected]> wrote: > > there exists a text extract tool [1], which is supposed to, well, extract > > text from the PDF files. > > [1] > https://sourceforge.net/p/podofo/code/HEAD/tree/podofo/branches/PODOFO_0_9_7_BRANCH/tools/podofotxtextract/ > > > > Correct: albeit many text related operators are not handled, that is > the code to look in PoDoFo. > > Just note that text position really does not depend on "m" or "l" operators like that code may misleadingly suggest (correct me if I am wrong): if( strcmp( pszToken, "l" ) == 0 || strcmp( pszToken, "m" ) == 0 ) { if( stack.size() == 2 ) { dCurPosX = stack.top().GetReal(); stack.pop(); dCurPosY = stack.top().GetReal(); stack.pop(); > Cheers, > Francesco > > > _______________________________________________ > Podofo-users mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/podofo-users >
_______________________________________________ Podofo-users mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/podofo-users
