[ https://issues.apache.org/jira/browse/HDFS-3370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13271453#comment-13271453 ]
Daryn Sharp commented on HDFS-3370: ----------------------------------- I fully agree that posix and/or linux conventions should ideally be followed. I did a little testing and didn't realize that a hard link retains the same attrs (owner, group, perms, xattrs, etc) as the original file. Changing one implicit changes the others, so that negates some issues such as differing replication factor concerns. Perhaps hard link creation can be restricted to only the file owner and superuser. The quota concerns are still a bit more complex. Unixy systems like linux and bsd only have fs level quotas for users, so quota handling is trivial compared to directory level quotas in hdfs. Since all hard links implicitly have the same owner, quotas are as simple as incrementing the user's ds quota is at file creation, and decrement when all links are removed. This why a DOS is possible against a user. I'm sorry if I'm missing a detail, but I remain unclear on how you are proposing to handle the directory level quotas. I don't fully grok how finding a common ancestor with a quota is sufficient because quotas can be added or removed at any time. Maybe part of the issue too is I have nested directories with individual quotas in mind, whereas maybe you are assuming one and only one quota from the root? I look forward to your thoughts. > HDFS hardlink > ------------- > > Key: HDFS-3370 > URL: https://issues.apache.org/jira/browse/HDFS-3370 > Project: Hadoop HDFS > Issue Type: New Feature > Reporter: Hairong Kuang > Assignee: Liyin Tang > Attachments: HDFS-HardLink.pdf > > > We'd like to add a new feature hardlink to HDFS that allows harlinked files > to share data without copying. Currently we will support hardlinking only > closed files, but it could be extended to unclosed files as well. > Among many potential use cases of the feature, the following two are > primarily used in facebook: > 1. This provides a lightweight way for applications like hbase to create a > snapshot; > 2. This also allows an application like Hive to move a table to a different > directory without breaking current running hive queries. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira