[ 
https://issues.apache.org/jira/browse/TIKA-2496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

chelambarasan updated TIKA-2496:
--------------------------------
    Description: 
We're using TIKA embedded in a webcrawler and today I've encountered a PDF that 
results in OutOfMemory errors while being processed by TIKA.

Error as below:

  was:
We're using TIKA embedded in a webcrawler and today I've encountered a PDF that 
results in OutOfMemory errors while being processed by TIKA.

It's a small, 1 page PDF file, so I don't think that it should consume that 
much memory.

I verified the problem by using the GUI from the tika-app-1.13.jar file and 
that results in the same error on the same file. The file can be found at:

http://www.spesmea.nl/pdf/algemene_voorwaarden_bbztcn_2010_nl.pdf

If I can help by providing any additional information, please let me know.


> TIKA crashes / runs out of memory on simple PDF
> -----------------------------------------------
>
>                 Key: TIKA-2496
>                 URL: https://issues.apache.org/jira/browse/TIKA-2496
>             Project: Tika
>          Issue Type: Bug
>          Components: core
>    Affects Versions: 1.13
>         Environment: Linux, Java 8
>            Reporter: chelambarasan
>             Fix For: 2.0, 1.14
>
>
> We're using TIKA embedded in a webcrawler and today I've encountered a PDF 
> that results in OutOfMemory errors while being processed by TIKA.
> Error as below:



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to