Hi, I plan to setup a HBase installation on EC2. As recommended I therefore want to setup a zookeeper ensemble with 3 nodes but I'm not sure what kind of storage I've to choose for the two zk directories (dataDir and dataLogDir). Do this two directories need to be on a persistent storage which survives a node crash? Or does an ephemeral storage device suffice since a failed node which is restarted is being synchronized with the other two nodes anyway? And what happens when I restart the whole zk ensemble with ephemeral storage which means there is no zk data available anymore after booting up? Any impact on the Hbase cluster?
I've read through the documentation but I was not able to answer those questions. Thanks Yves
