[ https://issues.apache.org/jira/browse/MAPREDUCE-5899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tsz Wo Nicholas Sze moved HADOOP-10608 to MAPREDUCE-5899: --------------------------------------------------------- Key: MAPREDUCE-5899 (was: HADOOP-10608) Project: Hadoop Map/Reduce (was: Hadoop Common) > Support incremental data copy in DistCp > --------------------------------------- > > Key: MAPREDUCE-5899 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-5899 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: distcp > Reporter: Jing Zhao > Assignee: Jing Zhao > Attachments: HADOOP-10608.000.patch, HADOOP-10608.001.patch > > > Currently when doing distcp with -update option, for two files with the same > file names but with different file length or checksum, we overwrite the whole > file. It will be good if we can detect the case where (sourceFile = > targetFile + appended_data), and only transfer the appended data segment to > the target. This will be very useful if we're doing incremental distcp. -- This message was sent by Atlassian JIRA (v6.2#6252)