[ 
https://issues.apache.org/jira/browse/HADOOP-8233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16386728#comment-16386728
 ] 

Steve Loughran commented on HADOOP-8233:
----------------------------------------

* 0-byte length can be skipped
* blocksize is trouble, as for filesystems != HDFS, there's no guarantee that 
different blocksize ==> Different checksum.

> Turn CRC checking off for 0 byte size and differing blocksizes
> --------------------------------------------------------------
>
>                 Key: HADOOP-8233
>                 URL: https://issues.apache.org/jira/browse/HADOOP-8233
>             Project: Hadoop Common
>          Issue Type: Bug
>    Affects Versions: 0.23.3
>            Reporter: Dave Thompson
>            Assignee: Dave Thompson
>            Priority: Major
>         Attachments: HADOOP-8233-branch-0.23.2.patch
>
>
> DistcpV2 (hadoop-tools/hadoop-distcp/..) can fail from checksum failure, 
> sometimes when copying a 0 byte file.    Root cause of this may have to do 
> with an inconsistent nature of HDFS when creating 0 byte files, however 
> distcp can avoid this issue by not checking CRC when size is zero.
> Further, distcp fails checksum when copying from two clusters that use 
> different blocksizes.  In this case it does not make sense to check CRC, as 
> it is a guaranteed failure.
> We need to turn CRC checking off for the above two cases.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org

Reply via email to