Re: distcp performing much better for rebalancing than dedicated balancer

Ferdy Galema Thu, 05 May 2011 08:27:16 -0700

I figured out what caused the slow balancing. Starting the balancer witha too small threshold will decrease the speed dramatically:


./start-balancer.sh -threshold 0.01

2011-05-05 17:17:04,132 INFOorg.apache.hadoop.hdfs.server.balancer.Balancer: Will move 1.26 GBbytesin this iteration2011-05-05 17:17:36,684 INFOorg.apache.hadoop.hdfs.server.balancer.Balancer: Will move 1.26 GBbytesin this iteration2011-05-05 17:18:09,737 INFOorg.apache.hadoop.hdfs.server.balancer.Balancer: Will move 1.26 GBbytesin this iteration2011-05-05 17:18:41,977 INFOorg.apache.hadoop.hdfs.server.balancer.Balancer: Will move 1.26 GBbytesin this iteration


as opposed to:

./start-balancer.sh

2011-05-05 17:19:01,676 INFOorg.apache.hadoop.hdfs.server.balancer.Balancer: Will move 40 GBbytes inthis iteration2011-05-05 17:21:36,800 INFOorg.apache.hadoop.hdfs.server.balancer.Balancer: Will move 30 GBbytes inthis iteration2011-05-05 17:24:13,191 INFOorg.apache.hadoop.hdfs.server.balancer.Balancer: Will move 30 GBbytes inthis iteration

I'd expect setting the granularity would not affect speed, just thestopping threshold. Perhaps a bug?


On 05/05/2011 03:43 PM, Ferdy Galema wrote:

The decommissioning was performed with solely refreshNodes, but that'ssomewhat irrelevant because the balancing tests were performed after Ire-added the 11 empty nodes. (FYI the drives were formatted withanother unix fs). Though I did notice that the decommissioning showsabout the same metrics as that of the balancer test afterwards, notvery fast that is.
On 05/05/2011 02:57 PM, Mathias Herberts wrote:
Did you explicitely start a balancer or did you decommission the nodes
using dfs.hosts.exclude and a dfsadmin -refreshNodes?
On Thu, May 5, 2011 at 14:30, Ferdy Galema<ferdy.gal...@kalooga.com>wrote:
Hi,
On our 15node cluster (1GB ethernet and 4x1TB disk per node) Inoticed thatdistcp does a much better job at rebalancing than the dedicatedbalancerdoes. We needed to decommision 11 nodes, so that prior torebalancing we had4 used and 11 empty nodes. The 4 used nodes had about 25% usageeach. Mostof our files are of average size: We have about 500K files in 280Kblocks
and 800K blocks total (blocksize is 64MB).

So I changed dfs.balance.bandwidthPerSec to 800100100 and restarted the
cluster. Started the balancer tool and I noticed that the it movedabout
200GB in 1 hour. (I grepped the balancer log for "Need to move").
After stopping the balancer I started a distcp. This tool copied900GB injust 45 minutes, with an average replication of 2 so it's totalthroughputwas around 2.4 TB/hour. Fair enough, it is not purely rebalancingbecausethe 4 overused nodes also get new blocks, still it performs muchbetter.
Munin confirms the much higher disk/ethernet throughputs of the distcp.
Are these characteristics to be expected? Either way, can thebalancer be
boosted even more? (Aside the dfs.balance.bandwidthPerSec property).

Ferdy.

Re: distcp performing much better for rebalancing than dedicated balancer

Reply via email to