> On Oct 7, 2016, at 11:35 AM, Christopher Begley > <[email protected]> wrote: > […] My task to to basically map ALL elements on a page of a pdf document. > This includes text, color boxes, highlights, underlines, lines, curves, > images, etc. > > Does there exist a way to dump all objects on a page and then retrieve > information about each object? (Specifically, coordinates that can then be > mapped to page coordinates in another file format).
Start by looking at PDFGraphicsStreamEngine. There are usage examples in the source tree. -- Manuel Aristarán http://jazzido.com

