Hi dev community,

 

I am trying to develop a connector for an API that exposes a hierarchical
arborescence of documents: each document can have children documents.  

During the init crawl, the child documents are referenced in the MCF
connector through the method
activities.addDocumentRefenrece(childDocumentIdentifier,
parentDocumentIdentifier, parentDataNames, parentDataValues)

The API is able to provide delta modifications/deletions from a provided
date but, when a document that has children is deleted, the API only returns
the id of the document, not its children. On the MCF connector side, I
thought that, as I have referenced the children, by deleting the parent
document all its children would be deleted with it, but it appears that it
is not the case.

So my question is : did I miss something ? Is there another way to perform
delta deletions ? Unfortunately if I don't find a way to solve this issue, I
will not be able to take advantage of the delta feature and thus I will have
to use the "add_modify" connector type and test every id on a delta crawl to
figure out which ids are missing. This would be a huge loss of performances.

 

Regards,

Julien Massiera

Reply via email to