[ https://issues.apache.org/jira/browse/HDFS-17743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17934353#comment-17934353 ]
ASF GitHub Bot commented on HDFS-17743: --------------------------------------- weisong44 opened a new pull request, #7498: URL: https://github.com/apache/hadoop/pull/7498 Added support for random datanode ordering in getBlockLocations() and put it behind a configuration parameter with the default behavior remaining unchanged. (cherry picked from commit ef9043fcc001d3f2c9a6209d21011ac44207970c) <!-- Thanks for sending a pull request! 1. If this is your first time, please read our contributor guidelines: https://cwiki.apache.org/confluence/display/HADOOP/How+To+Contribute 2. Make sure your PR title starts with JIRA issue id, e.g., 'HADOOP-17799. Your PR title ...'. --> ### Description of PR ### How was this patch tested? ### For code changes: - [x] Does the title or this PR starts with the corresponding JIRA issue id (e.g. 'HADOOP-17799. Your PR title ...')? - [ ] Object storage: have the integration tests been executed and the endpoint declared according to the connector-specific documentation? N/A - [ ] If adding new dependencies to the code, are these dependencies licensed in a way that is compatible for inclusion under [ASF 2.0](http://www.apache.org/legal/resolved.html#category-a)? N/A - [ ] If applicable, have you updated the `LICENSE`, `LICENSE-binary`, `NOTICE-binary` files? N/A > Support random datanode ordering in getBlockLocations() > ------------------------------------------------------- > > Key: HDFS-17743 > URL: https://issues.apache.org/jira/browse/HDFS-17743 > Project: Hadoop HDFS > Issue Type: Improvement > Components: namenode > Affects Versions: 2.10.0, 3.4.1 > Reporter: Wei Song > Priority: Major > Labels: pull-request-available > > At LinkedIn, we aren't able to rely on data locality due to various reasons, > overall the current implementation of sorting by network topology didn't > improve overall performance, it also caused unnecessary concentration of load > on specific datanodes. We would like to request for a random policy that > randomly order datanodes, we expect this policy to improve load distribution. > > The new policy should be configurable, and it is disabled. When enabled, it > replaces the current network topology based datanode ordering. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org