[ https://issues.apache.org/jira/browse/HDFS-2802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13271497#comment-13271497 ]
Eli Collins commented on HDFS-2802: ----------------------------------- Hey Hari, Could you elaborate some on scenario #3? The motivation is test, so I assume use cases are things like (1) giving an experimental Hive/Pig user safe access /datasets/xyz or (2) running a test/dev HBase instance from an existing /hbase directory. Use case #1 seems like it could be accomplished in practice with external tables (or, in general, analysis programs that can separate their input data from directories they create/mutate). Use case #2 seems unlikely to be used in practice. Could be useful if /hbase is production (ie big) but in that case you wouldn't want to run a dev/test HBase build against a production HDFS cluster (as the production HBase instance runs there as well). Ie the usual case here is that you have a separate test/dev HBase/HDFS instance and you need/want to copy the datasets. What's your plan for handling append and truncate? The design document does not discuss whether you're punting on these, or if not the semantics (eg for a snapshot taken while a block is being written) or the implementation. There's relevant design and implementation discussion in HDFS-233 and [Snapshots in Hadoop Distributed File System|http://www.cs.berkeley.edu/~sameerag/hdfs-snapshots.pdf] that's worth referencing. Looking forward to checking out your prototype patch. Thanks, Eli > Support for RW/RO snapshots in HDFS > ----------------------------------- > > Key: HDFS-2802 > URL: https://issues.apache.org/jira/browse/HDFS-2802 > Project: Hadoop HDFS > Issue Type: New Feature > Components: data-node, name-node > Affects Versions: 0.24.0 > Reporter: Hari Mankude > Assignee: Hari Mankude > Attachments: snapshot-one-pager.pdf > > > Snapshots are point in time images of parts of the filesystem or the entire > filesystem. Snapshots can be a read-only or a read-write point in time copy > of the filesystem. There are several use cases for snapshots in HDFS. I will > post a detailed write-up soon with with more information. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira