[ 
https://issues.apache.org/jira/browse/HDFS-3859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13443483#comment-13443483
 ] 

Todd Lipcon commented on HDFS-3859:
-----------------------------------

Sure, it's overkill, but it's not that expensive and we already have an 
implementation of it sitting around. It's also handy because "md5sum" is 
commonly available on the command line, and we use it for FSImages already as 
well. Performance-wise, my laptop can md5sum at about 500MB/sec, so given that 
log segments under recovery are likely to be much smaller than 500M, I don't 
think we should be concerned about that.
                
> QJM: implement md5sum verification
> ----------------------------------
>
>                 Key: HDFS-3859
>                 URL: https://issues.apache.org/jira/browse/HDFS-3859
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>    Affects Versions: QuorumJournalManager (HDFS-3077)
>            Reporter: Todd Lipcon
>            Assignee: Todd Lipcon
>
> When the QJM passes journal segments between nodes, it should use an md5sum 
> field to make sure the data doesn't get corrupted during transit. This also 
> serves as an extra safe-guard to make sure that the data is consistent across 
> all nodes when finalizing a segment.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to