[ 
https://issues.apache.org/jira/browse/HDFS-12544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Manoj Govindassamy updated HDFS-12544:
--------------------------------------
    Attachment: HDFS-12544.01.patch

Attached v01 patch to address the following
1. {{DirectorySnapshottableFeature#computeDiff}} now accepts a scope directory 
to limit the computation of the diff under a snapshot root directory
2. Config to enable/disable this feature 
3. Snapshot DiffReport tests to verify the new behavior
[~yzhangal]/others, can you please take a look at the patch?

> SnapshotDiff - support diff generation on any snapshot root descendant 
> directory
> --------------------------------------------------------------------------------
>
>                 Key: HDFS-12544
>                 URL: https://issues.apache.org/jira/browse/HDFS-12544
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: hdfs
>    Affects Versions: 3.0.0-beta1
>            Reporter: Manoj Govindassamy
>            Assignee: Manoj Govindassamy
>         Attachments: HDFS-12544.01.patch
>
>
> {noformat}
> # hdfs snapshotDiff <snapshot_root_path> <from_snapshot_name> 
> <to_snapshot_name>
> {noformat}
> Using snapshot diff command, we can generate a diff report between any two 
> given snapshots under a snapshot root directory. The command today only 
> accepts the path that is a snapshot root. There are many deployments where 
> the snapshot root is configured at the higher level directory but the diff 
> report needed is only for a specific directory under the snapshot root. In 
> these cases, the diff report can be filtered for changes pertaining to the 
> directory we are interested in. But when the snapshot root directory is very 
> huge, the snapshot diff report generation can take minutes even if we are 
> interested to know the changes only in a small directory. So, it would be 
> highly performant if the diff report calculation can be limited to the 
> snapshot directory only instead of the whole snapshot root.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

Reply via email to