Ritika – We have had some discussions regarding docker and etc.  The public one 
that is out there builds a single node and does not use an RDBM.  I would not 
recommend using that to index billions of documents.  You can turn on debugging 
in the connector and look at the logs to see if that traffic is actually going 
to Elastic search.

Karl – I believe Ritika said Elastic.


--
Michael Cizmar


From: ritika jain <ritikajain5...@gmail.com>
Sent: Thursday, December 31, 2020 7:33 AM
To: user@manifoldcf.apache.org
Subject: Re: Indexation Not OK

Elastic search output connector with some custom changes for some fields

On Thursday, December 31, 2020, Karl Wright 
<daddy...@gmail.com<mailto:daddy...@gmail.com>> wrote:
Hi,
Can you let us know what you are using for the output connector?
Thanks,
Karl


On Thu, Dec 31, 2020 at 8:24 AM ritika jain 
<ritikajain5...@gmail.com<mailto:ritikajain5...@gmail.com>> wrote:
Hi,

I am using Manifoldcf 2.14 and JCIFS connector, to ingest some billions of 
records into elastic search
I am facing an issue in which when Job is run some time, successful indexation 
happens but after sometime , manifoldcf loops the records and Indexation is not 
getting OK.

[cid:image003.png@01D6DF61.14FDAFC0]

and it keeps on retrying for those specific records, then to again start up, I 
need to restart the docker container everytime and after restart Indexation 
works fine for those records too.
And also checked JSON formation of elastic search connector is fine, which 
sures that the files are not having any problem.
Can anybody please guide me the reason for this

Thanks
Ritika


Reply via email to