[ 
https://issues.apache.org/jira/browse/HDFS-12544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yongjun Zhang updated HDFS-12544:
---------------------------------
    Description: 
{noformat}
# hdfs snapshotDiff <snapshot_root_path> <from_snapshot_name> <to_snapshot_name>
{noformat}

Using snapshot diff command, we can generate a diff report between any two 
given snapshots under a snapshot root directory. The command today only accepts 
the path that is a snapshot root. There are many deployments where the snapshot 
root is configured at the higher level directory but the diff report needed is 
only for a specific directory under the snapshot root. In these cases, the diff 
report can be filtered for changes pertaining to the directory we are 
interested in. But when the snapshot root directory is very huge, the snapshot 
diff report generation can take minutes even if we are interested to know the 
changes only in a small directory. So, it would be highly performant if the 
diff report calculation can be limited to only the interesting sub-directory of 
the snapshot root instead of the whole snapshot root.

  was:
{noformat}
# hdfs snapshotDiff <snapshot_root_path> <from_snapshot_name> <to_snapshot_name>
{noformat}

Using snapshot diff command, we can generate a diff report between any two 
given snapshots under a snapshot root directory. The command today only accepts 
the path that is a snapshot root. There are many deployments where the snapshot 
root is configured at the higher level directory but the diff report needed is 
only for a specific directory under the snapshot root. In these cases, the diff 
report can be filtered for changes pertaining to the directory we are 
interested in. But when the snapshot root directory is very huge, the snapshot 
diff report generation can take minutes even if we are interested to know the 
changes only in a small directory. So, it would be highly performant if the 
diff report calculation can be limited to the snapshot directory only instead 
of the whole snapshot root.


> SnapshotDiff - support diff generation on any snapshot root descendant 
> directory
> --------------------------------------------------------------------------------
>
>                 Key: HDFS-12544
>                 URL: https://issues.apache.org/jira/browse/HDFS-12544
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: hdfs
>    Affects Versions: 3.0.0-beta1
>            Reporter: Manoj Govindassamy
>            Assignee: Manoj Govindassamy
>         Attachments: HDFS-12544.01.patch, HDFS-12544.02.patch
>
>
> {noformat}
> # hdfs snapshotDiff <snapshot_root_path> <from_snapshot_name> 
> <to_snapshot_name>
> {noformat}
> Using snapshot diff command, we can generate a diff report between any two 
> given snapshots under a snapshot root directory. The command today only 
> accepts the path that is a snapshot root. There are many deployments where 
> the snapshot root is configured at the higher level directory but the diff 
> report needed is only for a specific directory under the snapshot root. In 
> these cases, the diff report can be filtered for changes pertaining to the 
> directory we are interested in. But when the snapshot root directory is very 
> huge, the snapshot diff report generation can take minutes even if we are 
> interested to know the changes only in a small directory. So, it would be 
> highly performant if the diff report calculation can be limited to only the 
> interesting sub-directory of the snapshot root instead of the whole snapshot 
> root.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

Reply via email to