On 5/2/2018 11:44 AM, Adam Blank wrote:
> Shawn - Sorry if I mixed up terminology, but by standalone I meant having a
> single Zk instead of a Zk ensemble.  So I would reconfigure the remaining
> Zk and Solr node to only use that single Zk.  To your point about the
> config data being stored in Zk, I'm thinking that should be alright in this
> case since the remaining Zk should already have that data stored?

If the goal is to avoid manual intervention in the face of failure, the
scenario you've described does not meet that goal.  You would have to
reconfigure and restart services.  Adding another machine for a third ZK
would eliminate the need to do that.

Andor's assertion that the machine should have similar horsepower to the
others would only apply if the only thing those machines are running is
ZK.  If you've got two beefy machines doing both Solr and ZK and a third
machine doing only ZK, the third machine does not need as much
horsepower.  It should certainly not be slow, but it's not going to need
the same level of storage and memory that the other two will.  A lot of
the CPU horsepower on the two beefy machines would be used for Solr, so
the third machine probably doesn't need as much CPU either.

For an install where you're trying to get by with two machines, it is
highly unlikely that SolrCloud would put much of a load on ZK.  Most of
what SolrCloud writes to ZK is cluster state changes -- machines going
up/down, collections created/deleted, etc.  When a SolrCloud cluster is
stable, there will be very little written to ZK.  Earlier I said that
when ZK loses quorum, SolrCloud will go read-only.  This is true, but it
does NOT mean that index updates go through ZK.  They do not.  SolrCloud
switches to read-only when ZK quorum goes away to protect itself from
split-brain problems.

Thanks,
Shawn

Reply via email to