[ https://issues.apache.org/jira/browse/HDFS-2832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13813495#comment-13813495 ]
Konstantin Shvachko commented on HDFS-2832: ------------------------------------------- Hey guys, I was wondering if we really need to change storageID to UUID. I thought that the storageID approach that _each DN is able to generate a unique id independently of the others_ is a good feature to retain. UUID as you noted is not unique and needs to be coordinated through NameNode. I understand you have multiple storages on the same DN, and you need unique ids independently of the ip, and port. # They should be unique with existing implementation of {{createNewStorageId()}}. {code}storageid = <random, ip, port, currentTime>{code} If you generate ids sequentially one after another, currentTime should be different. It can be replaced by nano-time if id generation is done in different threads. # You can also add to storageID an attribute that characterizes the disk volume or the directory as a new component. Examples of the new attribute could be disk serial number, or the storage directory inode number. It seems that introduction of UUIDs was unnecessary, unless of course I missed some context. > Enable support for heterogeneous storages in HDFS > ------------------------------------------------- > > Key: HDFS-2832 > URL: https://issues.apache.org/jira/browse/HDFS-2832 > Project: Hadoop HDFS > Issue Type: New Feature > Affects Versions: 0.24.0 > Reporter: Suresh Srinivas > Assignee: Suresh Srinivas > Attachments: 20130813-HeterogeneousStorage.pdf, h2832_20131023.patch, > h2832_20131023b.patch, h2832_20131025.patch, h2832_20131028.patch, > h2832_20131028b.patch, h2832_20131029.patch, h2832_20131103.patch > > > HDFS currently supports configuration where storages are a list of > directories. Typically each of these directories correspond to a volume with > its own file system. All these directories are homogeneous and therefore > identified as a single storage at the namenode. I propose, change to the > current model where Datanode * is a * storage, to Datanode * is a collection > * of strorages. -- This message was sent by Atlassian JIRA (v6.1#6144)