> On Dec 28, 2016, at 8:18 AM, Tilman Hausherr <[email protected]> wrote:
> 
> […]
> Try also https://github.com/tabulapdf/ <https://github.com/tabulapdf/> , I 
> wonder how they handle this problem.

Hi, main author of Tabula here.

We've come across that case many times. Some spreadsheet->PDF generators clip a 
cell's content to the extent of its container. We handle it by simply detecting 
whether a character is inside the current clipping path [1].

Cheers,

[1] 
https://github.com/tabulapdf/tabula-java/blob/master/src/main/java/technology/tabula/ObjectExtractor.java#L342

—
Manuel Aristarán <[email protected]>
http://jazzido.com



Reply via email to