[jira] [Commented] (HDFS-4645) Move from randomly generated block ID to sequentially generated block ID

Kihwal Lee (JIRA) Thu, 28 Mar 2013 08:10:43 -0700

    [ 
https://issues.apache.org/jira/browse/HDFS-4645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13616319#comment-13616319
 ]


Kihwal Lee commented on HDFS-4645:
----------------------------------

bq. This seems fine to me in the abstract, though I'm not sure what the 
original motivation of randomly generating the IDs was.

Bucketizing or distributing blocks based on block numbers. Sorting based on 
block numbers to get a ramdom block ordering. I think the repl queue did this. 
I don't know if it still does it.

bq. One challenge with implementing this will be how to do upgrades from 
existing clusters which already have block IDs randomly distributed across the 
valid range. Any thoughts on how to handle that?

I am not concerned about block id collision within namenode. Even for a large 
name space, the id space is very sparsely populated. Between datanode and 
namenode, generation number will resolve any conflict. I am more worried about 
components that depend on the randomness of block ids, which will make upgrade 
hard.

Will new block ids be allowed to be < 0?
                
> Move from randomly generated block ID to sequentially generated block ID
> ------------------------------------------------------------------------
>
>                 Key: HDFS-4645
>                 URL: https://issues.apache.org/jira/browse/HDFS-4645
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: namenode
>    Affects Versions: 3.0.0
>            Reporter: Suresh Srinivas
>            Assignee: Suresh Srinivas
>
> Currently block IDs are randomly generated. This means there is no pattern to 
> block ID generation and no guarantees such as uniqueness of block ID for the 
> life time of the system can be made. I propose using SequentialNumber for 
> block ID generation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HDFS-4645) Move from randomly generated block ID to sequentially generated block ID

Reply via email to