Adam Antal created HDFS-13960:
---------------------------------

             Summary: hdfs dfs -checksum command should optionally show block 
size in output
                 Key: HDFS-13960
                 URL: https://issues.apache.org/jira/browse/HDFS-13960
             Project: Hadoop HDFS
          Issue Type: Improvement
          Components: hdfs
            Reporter: Adam Antal


The hdfs checksum command computes the checksum in a distributed manner, which 
would take into account the block size. In other words, the block size 
determines how the file will be broken up.

Therefore itĀ can happen that the checksum command produces different outputs 
for the exact same file only differing in the block size: checksum(fileABlock1) 
+ checksum(fileABlock2) != checksum(fileABlock1 + fileABlock2)

I suggest to add an option to the hdfs dfs -checksum command which would 
displays the block sizeĀ along with the output, and that could also be helpful 
in some other cases where this piece of information is needed.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-dev-h...@hadoop.apache.org

Reply via email to