On Thu, 17 Jul 2014, Shannon Brown wrote:
Problem: How to avoid Out of Memory errors during Tika parsing.
Typical approaches are either to use the ForkParser, or the Tika Server. Both ensure that if there's a fatal problem with parsing (eg OOM) then the JVM with your main application in it doesn't die too
For cases where it does die, log it, and if possible report a bug with the file in question, so we can hopefully fix it for the next release!
Nick