[ 
https://issues.apache.org/jira/browse/HDFS-7891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14380482#comment-14380482
 ] 

Zhe Zhang commented on HDFS-7891:
---------------------------------

Thanks Nicholas and Walter for the discussion!

bq. I think when we use stripe layout, even when stripe layout is not used. All 
6+3 replica(it's better called blocks belonging to same group) comes from 
client. As long as the replica is place on the remote rack, the traffic is the 
same.
Under the striping layout, in _initial file creation_, all raw or parity data 
comes directly from client. So inter-rack traffic won't be affected by 
placement policy (unless we consider the factor that client is likely 
co-located with 1st DN). 

Two other scenarios do incur inter-DN traffic:
# Background block reconstruction. Here a new DN is chosen as reconstruction 
destination and will pull data from other DNs in the group
# EC+contiguous layout. Here all blocks are initially triplicated. Then for 
each formed group, a new DN is selected as the encoder, which pulls data from 
raw data blocks

> A block placement policy with best fault tolerance
> --------------------------------------------------
>
>                 Key: HDFS-7891
>                 URL: https://issues.apache.org/jira/browse/HDFS-7891
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>            Reporter: Walter Su
>            Assignee: Walter Su
>         Attachments: HDFS-7891.002.patch, HDFS-7891.003.patch, 
> HDFS-7891.004.patch, HDFS-7891.patch, PlacementPolicyBenchmark.txt, 
> testresult.txt
>
>
> a block placement policy tries its best to place replicas to most racks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to