Re: [VOTE -- Round 2] Commit hdfs-630 to 0.21?

Steve Loughran Fri, 22 Jan 2010 07:33:25 -0800

Stack wrote:

I'm being 0 on this

-I would worry if the exclusion list was used by the NN to do itsblacklisting, I'm glad to see this isn't happening. Yes, you could pickup datanode failure faster, but you would also be vulnerable to a userdoing a DoS against the cluster by reporting every DN as failing

-Russ Perry's work on high-speed Hadoop rendering [1] tweaked Hadoop toallow the datanodes to get the entire list of nodes holding the data,and allowed them to make their own decision about where to get the datafrom. This1. pushed the policy of handling failure down to the clients, lessneed to talk to the NN about it.2. lets you do something very fancy where you deliberately choose datafrom different DNs, so that you can then pull data off the cluster atthe full bandwidth of every disk

Long term, I would like to see Russ's addition go in, so worry if theHDFS-630 patch would be useful long term. Maybe its a more fundamentalissue: where does the decision making go, into the clients or into the NN?


-steve



[1] http://www.hpl.hp.com/techreports/2009/HPL-2009-345.html

Re: [VOTE -- Round 2] Commit hdfs-630 to 0.21?

Reply via email to