[ 
https://issues.apache.org/jira/browse/HDFS-17646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17890912#comment-17890912
 ] 

ASF GitHub Bot commented on HDFS-17646:
---------------------------------------

huangzhaobo99 commented on PR #7120:
URL: https://github.com/apache/hadoop/pull/7120#issuecomment-2422243631

   > Good idea. Only one concerns, if give one small limitTopNodes is it 
possible to impact DataNode service when trigger balancer. Such as one corner 
case, given set limitTopNodes=1 and there are thousands underUtilized nodes, 
then the top one datanode will involve transfer flood, right? So IMO we should 
give some caution information to end user at least. FYI. Thanks.
   
   Similar to using the '-source' param, top one datanode will involve transfer 
flood, this corner case will continue to migrate data from the node until the 
conditions for Balance are met.
   However, the blocks and speed of each migration are affected by other 
params, and the impact on the cluster is controllable.




> Add Option to limit Balancer prefer highly utilized nodes num in each 
> iteration
> -------------------------------------------------------------------------------
>
>                 Key: HDFS-17646
>                 URL: https://issues.apache.org/jira/browse/HDFS-17646
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>            Reporter: Zhaobo Huang
>            Assignee: Zhaobo Huang
>            Priority: Major
>              Labels: pull-request-available
>
> Limit the number of topN nodes to avoid excessive nodes affecting cluster 
> stability.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

Reply via email to