Hi, The problem goes away when I increase the socket timeout from the mfc tika connector edit page. I think "document ingest (Solr)" should not be OK when there is such a problem.
Regards, Cihad Güzel Cihad Guzel <cguz...@gmail.com>, 20 Eki 2022 Per, 02:28 tarihinde şunu yazdı: > Hi Julien, > > I ran the tika 2x service using the official tika available on docker hub. > I am using MFC version 2.3. I activated the tika-service-rmeta connector > for MFC. I created a job on mfc for a folder with 5 files in it. But OCR > was not performed on some of the files. When I look at Solr, the content of > some files seems empty. I also got the error messages found in the > attachment. > > In the second test I made, this time I created 5 separate jobs to include > each of the 5 files one by one. When I ran these jobs, I did not encounter > any problems. > > When I send these 5 files directly to the tika-service using curl it also > works correctly. > > When I examine the Simple History Report, I see error messages for some > files as in the attached picture. > > Could Tika connector have a bug that will cause an error while sending > multiple files to tika? Could it have something to do with this issue? > https://issues.apache.org/jira/browse/CONNECTORS-1733 > [image: Screen Shot 2022-10-20 at 02.08.11.png] > Regards, > Cihad Güzel >