[ 
https://issues.apache.org/jira/browse/CONNECTORS-1778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

mbiso updated CONNECTORS-1778:
------------------------------
    Description: 
Hi.

I have a job ingesting a windows network share.

It use tika server (standalone)

There are many errors on Tika because some files cause error like:

 
{code:java}
ERROR [qtp131037934-61] 10:44:03,903 
org.apache.poi.xssf.eventusermodel.XSSFSheetXMLHandler Failed to parse SST 
index '1356 '

java.lang.NumberFormatException: For input string: "1356 " {code}
The errors cause a restart of a child tika process, and this is reported like 
an interruption in the ManifoldCF job.
It ends with the message: "Error: Repeated service interruptions - failure 
processing document: The target server failed to respond"

 

How could I get over this issue?

I have opened an issue [TIKA-4494 ] on Tika as well,  but It could be a right 
behaviour on Tika: many errors cause a restart child process, so this is a 
problem for me.

 

Any suggestion?
Thanks a lot.

Mario Bisonti

  was:
Hi.

I have a job ingesting a windows network share.

It use tika server (standalone)

There are many errors on Tika because some files cause error like:

 
{code:java}
ERROR [qtp131037934-61] 10:44:03,903 
org.apache.poi.xssf.eventusermodel.XSSFSheetXMLHandler Failed to parse SST 
index '1356 '

java.lang.NumberFormatException: For input string: "1356 " {code}
The errors cause a restart of a child tika process, and this is reported like 
an interruption in the ManifoldCF job.
It ends with the message: "Error: Repeated service interruptions - failure 
processing document: The target server failed to respond"

 

How could I get over this issue?

I have opened an [issue |[TIKA-4494] 
org.apache.poi.xssf.eventusermodel.XSSFSheetXMLHandler Failed to parse SST 
index - ASF JIRA] on Tika as well,  but It could be a right behaviour on Tika: 
many errors cause a restart child process, so this is a problem for me.

 

Any suggestion?
Thanks a lot.

Mario Bisonti


> Error: Repeated service interruptions - failure processing document: The 
> target server failed to respond
> --------------------------------------------------------------------------------------------------------
>
>                 Key: CONNECTORS-1778
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1778
>             Project: ManifoldCF
>          Issue Type: Bug
>          Components: Tika extractor
>    Affects Versions: ManifoldCF 2.28
>            Reporter: mbiso
>            Assignee: Piergiorgio Lucidi
>            Priority: Major
>
> Hi.
> I have a job ingesting a windows network share.
> It use tika server (standalone)
> There are many errors on Tika because some files cause error like:
>  
> {code:java}
> ERROR [qtp131037934-61] 10:44:03,903 
> org.apache.poi.xssf.eventusermodel.XSSFSheetXMLHandler Failed to parse SST 
> index '1356 '
> java.lang.NumberFormatException: For input string: "1356 " {code}
> The errors cause a restart of a child tika process, and this is reported like 
> an interruption in the ManifoldCF job.
> It ends with the message: "Error: Repeated service interruptions - failure 
> processing document: The target server failed to respond"
>  
> How could I get over this issue?
> I have opened an issue [TIKA-4494 ] on Tika as well,  but It could be a right 
> behaviour on Tika: many errors cause a restart child process, so this is a 
> problem for me.
>  
> Any suggestion?
> Thanks a lot.
> Mario Bisonti



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to