I think this is better done with Poppler. PoDoFo does not offer you
any information about page contents above of PdfContentsTokenizer. So,
you will have to collect all the geometric information by yourself, do
all the matrix transformations etc .... even before you can start to
do any comparisons.

Regards,
  Dom

On Mon, Oct 10, 2011 at 2:13 PM, Alec Taylor <[email protected]> wrote:
> Good afternoon,
>
> Do you have some recommends and/or sample code for comparing textual
> and geometric layout information across pages?
>
> Basically I'm trying to realise patterns within documents, e.g., page
> numbers, header and footers, title, column information &etc; using the
> capabilities of the PoDoFo PDF library.
>
> Thanks for all suggestions,
>
> Alec Taylor
>
> ------------------------------------------------------------------------------
> All the data continuously generated in your IT infrastructure contains a
> definitive record of customers, application performance, security
> threats, fraudulent activity and more. Splunk takes this data and makes
> sense of it. Business sense. IT sense. Common sense.
> http://p.sf.net/sfu/splunk-d2dcopy1
> _______________________________________________
> Podofo-users mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/podofo-users
>

------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure contains a
definitive record of customers, application performance, security
threats, fraudulent activity and more. Splunk takes this data and makes
sense of it. Business sense. IT sense. Common sense.
http://p.sf.net/sfu/splunk-d2dcopy1
_______________________________________________
Podofo-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/podofo-users

Reply via email to