To understand shard & replica, let's first understand what is sharding and
why it is needed.
Sharding - Assume your index grows large that it doesn't fit into a single
machine (for e.g. your index size is 80GB and your machine is 64GB in which
case index won't fit into memory). Now to get better
A collection in SolrCloud is a logical entity that encapsulates documents
that confirm to a shared schema. As a distributed system, the data needs to
be split and so the collection is logically split into 'Shards'.
Shard(s):
* don't represent a physical index.
* are logical entities
Replica:
*
Hey,
I have have the same question on freenode channel , people answered me ,
but I believe that I still got doubts. Just because I never had approach
to such data store technologies before it makes me hardly understand
what is exactly is replica and shard in solr. I believe once I
understand