On Thu, 8 Jun 2017, tesm...@gmail.com wrote:
Thanks for your reply. I am calling Apache Tika in Java code like this:
public String extractPDFText(String faInputFileName) throws
IOException,TikaException {
//Handler for body text of the PDF article
BodyContentHandler handler = new
wrote:
>
>> My tika code is not extracting full body text of larger PDF files.
>>
>> Files more than 1 MB in size and around 20 pages are partially extracted.
>> Is there any limit on input PDF file size in tika
>>
>
> How are you calling Apache Tika? D
On Thu, 8 Jun 2017, tesm...@gmail.com wrote:
My tika code is not extracting full body text of larger PDF files.
Files more than 1 MB in size and around 20 pages are partially extracted.
Is there any limit on input PDF file size in tika
How are you calling Apache Tika? Direct java calls