Re: Limit on input PDF file size in Tika?

2017-06-08 Thread tesm...@gmail.com
) { System.out.println("Exception caught:"); } //Convert the body handler to string and return the string to the calling function return handler.toString(); } Regards, On Thu, Jun 8, 2017 at 4:29 PM, Nick Burch <apa...@gagravarr.org> wrote: > On Thu, 8 Jun 2017, tesm...@gmail.com

Grobid with TXT and HTML files

2017-06-08 Thread tesm...@gmail.com
https://github.com/USCDataScience/parser-indexer- > py/tree/master/parser-server > [4] https://github.com/USCDataScience/parser-indexer- > py/blob/master/docs/parser-index-journals.md > > *--* > *Thamme Gowda* > TG | @thammegowda <https://twitter.com/thammegowda> >

Re: Analysing a document sections with Apache Tika

2017-05-04 Thread tesm...@gmail.com
> Thamme > > [1] http://grobid.readthedocs.io/en/latest/Introduction/ > [2] https://wiki.apache.org/tika/GrobidJournalParser > [3] https://github.com/USCDataScience/parser-indexer- > py/tree/master/parser-server > [4] https://github.com/USCDataScience/parser-indexer- > py