[ https://issues.apache.org/jira/browse/HADOOP-9700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13704093#comment-13704093 ]
Binglin Chang commented on HADOOP-9700: --------------------------------------- Currently, SnapshotDiffReport(even SnapshotDiffInfo) lacks information to support minimal diff transfer, I think there are 3 aspects at least: # simple file/dir renaming, I think it can be handled without the help of InodeID # changing dir hierarchical structure, I'm not sure current diff report format can express this kind of change, looks like diff of changing complex dir hierarchical structure can only be archived by fully comparing whole InodeID sets of the two snapshots. # file append, like Luke mentioned. SnapshotDiffInfo is private, we can either change SnapshotDiffInfo to public or add more information to diff report. Any suggestions for which direction to go? > Snapshot support for distcp > --------------------------- > > Key: HADOOP-9700 > URL: https://issues.apache.org/jira/browse/HADOOP-9700 > Project: Hadoop Common > Issue Type: New Feature > Components: tools/distcp > Reporter: Binglin Chang > Assignee: Binglin Chang > Attachments: HADOOP-9700-demo.patch > > > Add snapshot incremental copy ability to distcp, so we can do iterative > consistent backup between hadoop clusters. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira