[ 
https://issues.apache.org/jira/browse/HDFS-2832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13813539#comment-13813539
 ] 

Arpit Agarwal commented on HDFS-2832:
-------------------------------------

Konstantin,
{quote}
I thought that the storageID approach that each DN is able to generate a unique 
id independently of the others is a good feature to retain.
{quote}
Storage (UU)IDs are independently generated on the Datanode in 
{{DataStorage#format}}.

{quote}
UUID as you noted is not unique and needs to be coordinated through NameNode.
{quote}
Not true. {{UUID#randomUUID}} generates RFC-4122 compliant UUIDs which are 
unique for all practical purposes without NameNode coordination.

{quote}
You can also add to storageID an attribute that characterizes the disk volume 
or the directory as a new component. Examples of the new attribute could be 
disk serial number, or the storage directory inode number. It seems that 
introduction of UUIDs was unnecessary, unless of course I missed some context.
{quote}
Part of the rationale is in HDFS-5115. Making them UUIDs simplifies the 
generation logic. Decoupling them from volume/directory characteristics allows 
future storage media that do not have a disk serial number or inode number.

> Enable support for heterogeneous storages in HDFS
> -------------------------------------------------
>
>                 Key: HDFS-2832
>                 URL: https://issues.apache.org/jira/browse/HDFS-2832
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>    Affects Versions: 0.24.0
>            Reporter: Suresh Srinivas
>            Assignee: Suresh Srinivas
>         Attachments: 20130813-HeterogeneousStorage.pdf, h2832_20131023.patch, 
> h2832_20131023b.patch, h2832_20131025.patch, h2832_20131028.patch, 
> h2832_20131028b.patch, h2832_20131029.patch, h2832_20131103.patch, 
> h2832_20131104.patch
>
>
> HDFS currently supports configuration where storages are a list of 
> directories. Typically each of these directories correspond to a volume with 
> its own file system. All these directories are homogeneous and therefore 
> identified as a single storage at the namenode. I propose, change to the 
> current model where Datanode * is a * storage, to Datanode * is a collection 
> * of strorages. 



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to