chelambarasan created TIKA-2496:
-----------------------------------
Summary: TIKA crashes / runs out of memory on simple PDF
Key: TIKA-2496
URL: https://issues.apache.org/jira/browse/TIKA-2496
Project: Tika
Issue Type: Bug
Components: core
Affects Versions: 1.13
Environment: Linux, Java 8
Reporter: chelambarasan
Fix For: 2.0, 1.14
We're using TIKA embedded in a webcrawler and today I've encountered a PDF that
results in OutOfMemory errors while being processed by TIKA.
It's a small, 1 page PDF file, so I don't think that it should consume that
much memory.
I verified the problem by using the GUI from the tika-app-1.13.jar file and
that results in the same error on the same file. The file can be found at:
http://www.spesmea.nl/pdf/algemene_voorwaarden_bbztcn_2010_nl.pdf
If I can help by providing any additional information, please let me know.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)