On 6/4/2018 5:36 AM, Greenhorn Techie wrote:
1. In the SolrCloud, as a single host can have information about multiple
shards (either leader or replica), how does the backup API handle the
underlying data copy? I presume it will simply copy the data across ALL the
shards (both leader and replicas) for the specified collection.

The Collections API backup would indeed work this way.

I see this line of code in the patch for SOLR-5750:

log.debug("Sent backup requests to all shard leaders for snapshotName={}", backupName);

So it sounds like the leader replica will write the backup for each shard.

2. If I am invoking the backup command periodically to backup the data and
then invoke restore command later (possibly due to cluster shutdown and
create a fresh SolrCloud cluster), I presume I don't need to tinker with
the hash values as long as the default settings have been used in both
backup and restore situations?

The Collections API restore capability creates a new collection from the backup.  The backup includes information gathered from ZK.  The restored collection should have all the same hash ranges found in the original collection.

Thanks,
Shawn

Reply via email to