[ https://issues.apache.org/jira/browse/HDFS-12544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Manoj Govindassamy updated HDFS-12544: -------------------------------------- Attachment: HDFS-12544.01.patch Attached v01 patch to address the following 1. {{DirectorySnapshottableFeature#computeDiff}} now accepts a scope directory to limit the computation of the diff under a snapshot root directory 2. Config to enable/disable this feature 3. Snapshot DiffReport tests to verify the new behavior [~yzhangal]/others, can you please take a look at the patch? > SnapshotDiff - support diff generation on any snapshot root descendant > directory > -------------------------------------------------------------------------------- > > Key: HDFS-12544 > URL: https://issues.apache.org/jira/browse/HDFS-12544 > Project: Hadoop HDFS > Issue Type: Improvement > Components: hdfs > Affects Versions: 3.0.0-beta1 > Reporter: Manoj Govindassamy > Assignee: Manoj Govindassamy > Attachments: HDFS-12544.01.patch > > > {noformat} > # hdfs snapshotDiff <snapshot_root_path> <from_snapshot_name> > <to_snapshot_name> > {noformat} > Using snapshot diff command, we can generate a diff report between any two > given snapshots under a snapshot root directory. The command today only > accepts the path that is a snapshot root. There are many deployments where > the snapshot root is configured at the higher level directory but the diff > report needed is only for a specific directory under the snapshot root. In > these cases, the diff report can be filtered for changes pertaining to the > directory we are interested in. But when the snapshot root directory is very > huge, the snapshot diff report generation can take minutes even if we are > interested to know the changes only in a small directory. So, it would be > highly performant if the diff report calculation can be limited to the > snapshot directory only instead of the whole snapshot root. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org