Re: Batch replica transfer from master shard solr cloud

Shawn Heisey Fri, 17 Oct 2014 09:32:52 -0700

On 10/17/2014 8:50 AM, Cp Mishra wrote:

So, we changed the logic to:


-Read SolrInputDocument objects from stream in batches of 500.

-Add  documents to ConcurrentUpdateSolrServer instance

-Index documents in a loop

This has improved indexing speed significantly.

What are the caveats to this approach?

Thinking back to when we were having deadlock problems with heavyindexing on SolrCloud, I seem to recall that one of the experts saidthat SolrCloud already does batch the documents, only it was 10 at atime. I also seemed to remember that making the batch size configurablewas discussed, but I don't know how discussion ended. Am I rememberingincorrectly?


I'm not familiar with the actual code for this part of Solr at all.

Thanks,
Shawn


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Batch replica transfer from master shard solr cloud

Reply via email to