cndaimin opened a new pull request #4078: URL: https://github.com/apache/hadoop/pull/4078
The decommission always fail when we start decommission multiple nodes on a cluster whose racks is not enough, a cluster with 6 racks to deploy RS-6-3, for example. We find that those decommission nodes cover at least a rack, it's actulaly like we are decommission one or more racks. And rack decommission is not well supported currently, especially for cluster whose racks is not enough already. In this patch, we add `numOfExcludedRacks` to indicate how many racks are in decommission(excluded) and fix the calculation of `BlockPlacementStatusDefault#getAdditionalReplicasRequired`. And in `ErasureCodingWork#addTaskToDatanode`, we adjust the process order as we should take care of decommission first, especially when rack is not enough. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org