[ https://issues.apache.org/jira/browse/HDFS-7891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14380482#comment-14380482 ]
Zhe Zhang commented on HDFS-7891: --------------------------------- Thanks Nicholas and Walter for the discussion! bq. I think when we use stripe layout, even when stripe layout is not used. All 6+3 replica(it's better called blocks belonging to same group) comes from client. As long as the replica is place on the remote rack, the traffic is the same. Under the striping layout, in _initial file creation_, all raw or parity data comes directly from client. So inter-rack traffic won't be affected by placement policy (unless we consider the factor that client is likely co-located with 1st DN). Two other scenarios do incur inter-DN traffic: # Background block reconstruction. Here a new DN is chosen as reconstruction destination and will pull data from other DNs in the group # EC+contiguous layout. Here all blocks are initially triplicated. Then for each formed group, a new DN is selected as the encoder, which pulls data from raw data blocks > A block placement policy with best fault tolerance > -------------------------------------------------- > > Key: HDFS-7891 > URL: https://issues.apache.org/jira/browse/HDFS-7891 > Project: Hadoop HDFS > Issue Type: Sub-task > Reporter: Walter Su > Assignee: Walter Su > Attachments: HDFS-7891.002.patch, HDFS-7891.003.patch, > HDFS-7891.004.patch, HDFS-7891.patch, PlacementPolicyBenchmark.txt, > testresult.txt > > > a block placement policy tries its best to place replicas to most racks. -- This message was sent by Atlassian JIRA (v6.3.4#6332)