Re: Ok to share ZK nodes with Hadoop nodes?

Patrick Hunt Mon, 08 Mar 2010 11:23:35 -0800

See the troubleshooting page, some apropos detail there (esp relative tovirtual env).


http://wiki.apache.org/hadoop/ZooKeeper/Troubleshooting

ZK servers are sensitive to IO (disk/network) latency. As long as youaren't very sensitive latency requirements it should be fine. If themachine were to swap for example, or the JVM were to go into long termGC (visualization in particular kills jvm gc) that would be bad.

Best practice for "on-line production serving" is 5 dedicated hosts with"shared nothing", physically distributed thoughout the data center (5hosts in a rack might not be the best idea for super reliability).There's alot of lee-way though, many ppl run with 3 and spof on switchfor example.


Patrick

David Rosenstrauch wrote:

I'm contemplating an upcoming zookeeper rollout and was wondering whatthe zookeeper brain trust here thought about a network deployment question:
Is it generally considered bad practice to just deploy zookeeper on ourexisting hdfs/MR nodes? Or is it better to run zookeeper instances ontheir own dedicated nodes?
On the one hand, we're not going to be making heavy-duty use ofzookeeper, so it might be sufficient for zookeeper nodes to share boxresources with HDFS & MR. On the other hand, though, I don't wantzookeeper to become unavailable if the nodes are running a resourceintensive job that's hogging CPU or network.
What's generally considered best practice for Zookeeper?

Thanks,

DR

Re: Ok to share ZK nodes with Hadoop nodes?

Reply via email to