mbiso created CONNECTORS-1778:
---------------------------------
Summary: Error: Repeated service interruptions - failure
processing document: The target server failed to respond
Key: CONNECTORS-1778
URL: https://issues.apache.org/jira/browse/CONNECTORS-1778
Project: ManifoldCF
Issue Type: Bug
Components: Tika extractor
Affects Versions: ManifoldCF 2.28
Reporter: mbiso
Assignee: Piergiorgio Lucidi
Hi.
I have a job ingesting a windows network share.
It use tika server (standalone)
There are many errors on Tika because some files cause error like:
{code:java}
ERROR [qtp131037934-61] 10:44:03,903
org.apache.poi.xssf.eventusermodel.XSSFSheetXMLHandler Failed to parse SST
index '1356 '
java.lang.NumberFormatException: For input string: "1356 " {code}
The errors cause a restart of a child tika process, and this is reported like
an interruption in the ManifoldCF job.
It ends with the message: "Error: Repeated service interruptions - failure
processing document: The target server failed to respond"
How could I get over this issue?
I have opened an [issue |[TIKA-4494]
org.apache.poi.xssf.eventusermodel.XSSFSheetXMLHandler Failed to parse SST
index - ASF JIRA] on Tika as well, but It could be a right behaviour on Tika:
many errors cause a restart child process, so this is a problem for me.
Any suggestion?
Thanks a lot.
Mario Bisonti
--
This message was sent by Atlassian Jira
(v8.20.10#820010)