[ 
https://issues.apache.org/jira/browse/HDFS-10604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Doris Gu updated HDFS-10604:
----------------------------
    Description: 
The biggest difference this feature will bring is *making blocks belong to the 
same file to save in the same region(DN group).*
So the process will be:
1.Config DN groups, for example
bq.Region1:dn1,dn2,dn3
bq.Region2:dn4,dn5,dn6
bq.Region3:dn7,dn8,dn9,dn10

2.Client uploads a file, first analyze whether this file has any existed blocks:
bq.i)Yes:assign new blocks to the DN group where the existed blocks belong to.
bq.ii)No:assign new blocks to a DN group which is chosen by some certain policy 
to avoid imbalance.

3.Other related processes,including append,balancer etc. also need to modify as 
well.   

The benefit we wish is when some DNs are down at the same time, the number of 
affected files(miss all replicas) is small.
But we are wondering if this is worth doing or not, or if there are problems we 
haven't noticed.

  was:
The biggest difference this feature will bring is *strong* making blocks belong 
to the same file to save in the same region(DN group).*strong*
So the process will be:
1.Config DN groups, for example
bq.Region1:dn1,dn2,dn3
bq.Region2:dn4,dn5,dn6
bq.Region3:dn7,dn8,dn9,dn10

2.Client uploads a file, first analyze whether this file has any existed blocks:
bq.i)Yes:assign new blocks to the DN group where the existed blocks belong to.
bq.ii)No:assign new blocks to a DN group which is chosen by some certain policy 
to avoid imbalance.

3.Other related processes,including append,balancer etc. also need to modify as 
well.   

The benefit we wish is when some DNs are down at the same time, the number of 
affected files(miss all replicas) is small.
But we are wondering if this is worth doing or not, or if there are problems we 
haven't noticed.


> What about this?Group DNs and add DN groups--named region to HDFS model , use 
> this region to instead of single DN when saving files.
> ------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: HDFS-10604
>                 URL: https://issues.apache.org/jira/browse/HDFS-10604
>             Project: Hadoop HDFS
>          Issue Type: Wish
>            Reporter: Doris Gu
>
> The biggest difference this feature will bring is *making blocks belong to 
> the same file to save in the same region(DN group).*
> So the process will be:
> 1.Config DN groups, for example
> bq.Region1:dn1,dn2,dn3
> bq.Region2:dn4,dn5,dn6
> bq.Region3:dn7,dn8,dn9,dn10
> 2.Client uploads a file, first analyze whether this file has any existed 
> blocks:
> bq.i)Yes:assign new blocks to the DN group where the existed blocks belong to.
> bq.ii)No:assign new blocks to a DN group which is chosen by some certain 
> policy to avoid imbalance.
> 3.Other related processes,including append,balancer etc. also need to modify 
> as well.   
> The benefit we wish is when some DNs are down at the same time, the number of 
> affected files(miss all replicas) is small.
> But we are wondering if this is worth doing or not, or if there are problems 
> we haven't noticed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

Reply via email to