RE: Very slow parsing of a few PDF files

2017-11-27 Thread Jim Idle
Thanks for the mockparser pointer - I saw that when looking at the tests for ForkParser. I am going to have to write my own application specific solution for this as ForkParser tries to serialize every class it things will be needed across the connection and a lot of third party classes are not

RE: Very slow parsing of a few PDF files

2017-11-27 Thread Allison, Timothy B.
The ForkParser does have the ability to kill and restart on permanent hangs. We don't have the RecursiveParserWrapper integrated into the ForkParser currently...patches are welcomed. At the Tika level, we generally don't check for a Thread.interrupted() because our dependencies don't do it.