Hello,

The size and complexity of a collection that I'm running on a SolrCloud
(v7.5) has recently grown to the point where it warranted splitting the
collection into two shards. I run the data import handler once a day to
index documents returned by a MSSQL stored proc. Previously, on the
single-shard collection, when the DIH encountered a document that was
missing a required field or otherwise couldn't be indexed, it would throw a
warning into the log and continue. Now, with a doubly-sharded collection, a
similar event causes the entire DIH full import to fail with a
DistributedUpdatesAsyncException when posting that document to another
node. I was wondering if this is a known issue with the DIH as of 7.5 and
if there's a way to have the DistributedUpdateProcessor sort of "warn and
continue" when this type of document is encountered.

Thanks in advance!
Jack

Reply via email to