[ 
https://issues.apache.org/jira/browse/HDFS-2821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tsz Wo (Nicholas), SZE updated HDFS-2821:
-----------------------------------------

    Assignee: Devaraj K
    
> Improve the Balancer to move data from over utilized nodes to under utilized 
> nodes using balanced nodes
> -------------------------------------------------------------------------------------------------------
>
>                 Key: HDFS-2821
>                 URL: https://issues.apache.org/jira/browse/HDFS-2821
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>    Affects Versions: 0.20.205.0, 0.24.0, 0.23.1
>            Reporter: Devaraj K
>            Assignee: Devaraj K
>
> h5.Cluster State Before Balancer Run:
> ||Node||Last Contact||Admin 
> State||Configured||Capacity(TB)||Used(TB)||Remaining(TB)||Used(%)||Remaining(%)||Blocks||
> |xxx-x-xx-n1|0|In Service|4.25|1.76|  0.84|1.65|41.34|38.86|8465|
> |xxx-x-xx-n2|1|In Service|6.03|1.76|0.94      |3.33|29.1|55.24|8465|
> |xxx-x-xx-n3|2|In Service|6.93|1.76|0.99 |4.18|25.35|60.31|8465|
> |xxx-x-xx-n4|2|In Service|10.5|0|0.54|9.97|0|94.9|0|
> \\
> \\
> h5.Cluster State After Balancer Run:
> ||Node||Last Contact||Admin 
> State||Configured||Capacity(TB)||Used(TB)||Remaining(TB)||Used(%)||Remaining(%)||Blocks||
> |xxx-x-xx-n1|2|In Service|4.25|0.95|0.84|2.46|22.36|57.84|4830|
> |xxx-x-xx-n2|1|In Service|6.03|1.2|0.94|3.88|19.95|64.4|5858|
> |xxx-x-xx-n3|0|In Service|6.93|1.38|0.99|4.56|19.9|65.76|6327|
> |xxx-x-xx-n4|2|In Service|10.5|1.74|0.54|8.23|16.53|78.37|8383|
> \\
> Currently balancer moves the data from over utilized nodes to the under 
> utilized nodes and this process continues till the cluster balanced or there 
> is no data to move from source to destination. In this process if some nodes 
> usage comes to avgUtilization these will not be participated in the balance 
> process further.
> The above table shows the cluster usage before the balancer run and after 
> balancer run using 1 as threshold. After balancer completion, still n1 is 
> over utilized and n4 is under utilized. This may be because of n4 contains 
> all the blocks which are present in n1.  I feel this can be improved further 
> by moving data from over utilized nodes to balanced nodes and then balanced 
> nodes to under utilized nodes.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to