Hi,

The problem goes away when I increase the socket timeout from the mfc tika
connector edit page. I think "document ingest (Solr)" should not be OK when
there is such a problem.

Regards,
Cihad Güzel


Cihad Guzel <cguz...@gmail.com>, 20 Eki 2022 Per, 02:28 tarihinde şunu
yazdı:

>  Hi Julien,
>
> I ran the tika 2x service using the official tika available on docker hub.
> I am using MFC version 2.3. I activated the tika-service-rmeta connector
> for MFC. I created a job on mfc for a folder with 5 files in it. But OCR
> was not performed on some of the files. When I look at Solr, the content of
> some files seems empty. I also got the error messages found in the
> attachment.
>
> In the second test I made, this time I created 5 separate jobs to include
> each of the 5 files one by one. When I ran these jobs, I did not encounter
> any problems.
>
> When I send these 5 files directly to the tika-service using curl it also
> works correctly.
>
> When I examine the Simple History Report, I see error messages for some
> files as in the attached picture.
>
> Could Tika connector have a bug that will cause an error while sending
> multiple files to tika? Could it have something to do with this issue?
> https://issues.apache.org/jira/browse/CONNECTORS-1733
> [image: Screen Shot 2022-10-20 at 02.08.11.png]
> Regards,
> Cihad Güzel
>

Reply via email to