[ https://issues.apache.org/jira/browse/HDFS-6133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14313513#comment-14313513 ]
Yongjun Zhang commented on HDFS-6133: ------------------------------------- Hi [~zhaoyunjiong], Nice work! Thanks Nicholas for the review, the patch looks good to me too. One nit: The {{DFSOutputStream#getPinnings}} has nested loop. The search cost is NumNodesInPipeline * NumFavoredNodes. From application point of view, I wonder how many favoredNodes could be for a big cluster (?). If needed, we could create a hash set when doing {{setFavoredNodes}} for quicker search. This may not be needed if the number is small. Anyways, I don't think we need to make this change in this jira, just something to watch out. Thanks. > Make Balancer support exclude specified path > -------------------------------------------- > > Key: HDFS-6133 > URL: https://issues.apache.org/jira/browse/HDFS-6133 > Project: Hadoop HDFS > Issue Type: Improvement > Components: balancer & mover, namenode > Reporter: zhaoyunjiong > Assignee: zhaoyunjiong > Attachments: HDFS-6133-1.patch, HDFS-6133-10.patch, > HDFS-6133-2.patch, HDFS-6133-3.patch, HDFS-6133-4.patch, HDFS-6133-5.patch, > HDFS-6133-6.patch, HDFS-6133-7.patch, HDFS-6133-8.patch, HDFS-6133-9.patch, > HDFS-6133.patch > > > Currently, run Balancer will destroying Regionserver's data locality. > If getBlocks could exclude blocks belongs to files which have specific path > prefix, like "/hbase", then we can run Balancer without destroying > Regionserver's data locality. -- This message was sent by Atlassian JIRA (v6.3.4#6332)