On Thu, 2013-10-24 at 13:27 +0200, yriveiro wrote:
> The motivation of this is simple, I want have replication but I don't want
> have n replicas actives with full resources allocated (cache and so on).
> This is usefull in enviroments where replication is needed but a high query
> throughput is not fundamental and the resources are limited.

Coincidentally we recently talked about the exact same setup.

We are looking at sharding a 20 TB index into 20 * 1 TB shards, each
located on their own dedicated physical SSD, which has more than enough
horsepower for our needs. For replication, we have a remote storage
system capable of serving requests for 2-4 shards with acceptable
latency.

Projected performance for the SSD setup is superior (5-10 times) to our
remote storage, so we would like to hit only the SSDs if possible.
Setting up a cloud to issue all requests to the SSD-shards unless a
catastrophic failure happened to on of them and in that case fallback to
the remote story replica for only that shard, would be perfect.

> I know that right now is not possible, but I think that it's a feature that
> can be implemented in a easy way creating a new status for shards.

shardIsLastResort=true? On paper it seems like a simple addition, but I
am not at familiar enough with the SolrCloud-code to guess if it is easy
to implement.

- Toke Eskildsen, State and University Library, Denmark


Reply via email to