[
https://issues.apache.org/jira/browse/PDFBOX-832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14108692#comment-14108692
]
Frank van der Hulst commented on PDFBOX-832:
--------------------------------------------
Hi John,
Thanks for your help.
I'm a newbie with open-source projects like this, and don't know how to do
the subversion stuff. So I've just attached all the files to the new ticket.
I have seen Tabula, and it sort of does what I want, but not quite, hence
my own code.
Frank
> Extract text from table, or find table co-ordinates from page. If there is no
> way to find out table, then just give co-ordinates of rectangle.
> ----------------------------------------------------------------------------------------------------------------------------------------------
>
> Key: PDFBOX-832
> URL: https://issues.apache.org/jira/browse/PDFBOX-832
> Project: PDFBox
> Issue Type: Improvement
> Components: Text extraction
> Affects Versions: 1.2.1
> Reporter: Pratik Thaker
>
> Please provide some mechanism to extract text from a table. If it is not
> possible to find out table in pdf then just provide co-ordinates of outer
> rectangle.
--
This message was sent by Atlassian JIRA
(v6.2#6252)