[ https://issues.apache.org/jira/browse/HDFS-3370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13294597#comment-13294597 ]
Karthik Ranganathan commented on HDFS-3370: ------------------------------------------- @Konstantin: << This can be modeled by symlinks on the application (HBase) level without making any changes in HDFS. >> Modeling this on top of HBase would essentially mean implementing the hardlink feature at the HBase level for all its files. This means that every application that needs a similar feature needs to use symbolic links to implement hardlinks. We have already implemented this at the underlying filesystem level for HBase backups - except that on disk/node failure, the re-replication would increase the total size of data in the cluster which was getting hard to provision. Hence the natural progression towards putting it in HDFS. > HDFS hardlink > ------------- > > Key: HDFS-3370 > URL: https://issues.apache.org/jira/browse/HDFS-3370 > Project: Hadoop HDFS > Issue Type: New Feature > Reporter: Hairong Kuang > Assignee: Liyin Tang > Attachments: HDFS-HardLink.pdf > > > We'd like to add a new feature hardlink to HDFS that allows harlinked files > to share data without copying. Currently we will support hardlinking only > closed files, but it could be extended to unclosed files as well. > Among many potential use cases of the feature, the following two are > primarily used in facebook: > 1. This provides a lightweight way for applications like hbase to create a > snapshot; > 2. This also allows an application like Hive to move a table to a different > directory without breaking current running hive queries. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira