Distcp is very slow ------------------- Key: MAPREDUCE-1231 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1231 Project: Hadoop Map/Reduce Issue Type: Bug Components: distcp Affects Versions: 0.20.1 Reporter: Jothi Padmanabhan Assignee: Jothi Padmanabhan Fix For: 0.20.2
Currently distcp does a checksums check in addition to file length check to decide if a remote file has to be copied. If the number of files is high (thousands), this checksum check is proving to be fairly costly leading to a long time before the copy is started. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.