[ 
https://issues.apache.org/jira/browse/HDFS-17743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17934353#comment-17934353
 ] 

ASF GitHub Bot commented on HDFS-17743:
---------------------------------------

weisong44 opened a new pull request, #7498:
URL: https://github.com/apache/hadoop/pull/7498

   
   Added support for random datanode ordering in getBlockLocations() and put it 
behind a configuration parameter with the default behavior remaining unchanged.
   
   (cherry picked from commit ef9043fcc001d3f2c9a6209d21011ac44207970c)
   
   <!--
     Thanks for sending a pull request!
       1. If this is your first time, please read our contributor guidelines: 
https://cwiki.apache.org/confluence/display/HADOOP/How+To+Contribute
       2. Make sure your PR title starts with JIRA issue id, e.g., 
'HADOOP-17799. Your PR title ...'.
   -->
   
   ### Description of PR
   
   
   ### How was this patch tested?
   
   
   ### For code changes:
   
   - [x] Does the title or this PR starts with the corresponding JIRA issue id 
(e.g. 'HADOOP-17799. Your PR title ...')?
   - [ ] Object storage: have the integration tests been executed and the 
endpoint declared according to the connector-specific documentation? N/A
   - [ ] If adding new dependencies to the code, are these dependencies 
licensed in a way that is compatible for inclusion under [ASF 
2.0](http://www.apache.org/legal/resolved.html#category-a)? N/A
   - [ ] If applicable, have you updated the `LICENSE`, `LICENSE-binary`, 
`NOTICE-binary` files? N/A
   
   




> Support random datanode ordering in getBlockLocations()
> -------------------------------------------------------
>
>                 Key: HDFS-17743
>                 URL: https://issues.apache.org/jira/browse/HDFS-17743
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: namenode
>    Affects Versions: 2.10.0, 3.4.1
>            Reporter: Wei Song
>            Priority: Major
>              Labels: pull-request-available
>
> At LinkedIn, we aren't able to rely on data locality due to various reasons, 
> overall the current implementation of sorting by network topology didn't 
> improve overall performance, it also caused unnecessary concentration of load 
> on specific datanodes. We would like to request for a random policy that 
> randomly order datanodes, we expect this policy to improve load distribution. 
>  
> The new policy should be configurable, and it is disabled. When enabled, it 
> replaces the current network topology based datanode ordering.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

Reply via email to