[ https://issues.apache.org/jira/browse/HDFS-9719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15186493#comment-15186493 ]
Uma Maheswara Rao G commented on HDFS-9719: ------------------------------------------- [~drankye], Thanks Kai for working on this JIRA. Overall changes looks good, I have the following comments/question though. # I am seeing some variable names still like “toRecoverLen”. Can we take chance in this patch to change them like toReconstructLen ? # doReadMinimum : this method name looks to be wrong. Its actually reading from minimum required sourced data nodes. But this name looks like it is reading minimum data length/what? # I think the current refactored names not representing the functioning what they are actually doing. Example StripedReaders, looks like holder class, but it is doing more than that. Also I assume StripedReader itself should handle multiple chunk/cell readers. So, how about renaming class name like StripedReaders -> StripedReader and StripedReader -> StripedChunkReader and the same comment applies for StripedWriter* Please check if this naming make sense to you. If you have more suggestion on better naming that would be great. > Refactoring ErasureCodingWorker into smaller reusable constructs > ---------------------------------------------------------------- > > Key: HDFS-9719 > URL: https://issues.apache.org/jira/browse/HDFS-9719 > Project: Hadoop HDFS > Issue Type: Sub-task > Reporter: Kai Zheng > Assignee: Kai Zheng > Attachments: HDFS-9719-v1.patch, HDFS-9719-v2.patch, > HDFS-9719-v3.patch, HDFS-9719-v4.patch, HDFS-9719-v5.patch, HDFS-9719-v6.patch > > > This would suggest and refactor {{ErasureCodingWorker}} into smaller > constructs to be reused in other places like block group checksum computing > in datanode side. As discussed in HDFS-8430 and implemented in HDFS-9694 > patch, checksum computing for striped block groups would be distributed to > datanode in the group, where data block data should be able to be > reconstructed when missed/corrupted to recompute the block checksum. The most > needed codes are in the current ErasureCodingWorker and could be reused in > order to avoid duplication. Fortunately, we have very good and complete > tests, which would make the refactoring much easier. The refactoring will > also help a lot for subsequent tasks in phase II for non-striping erasure > coded files and blocks. -- This message was sent by Atlassian JIRA (v6.3.4#6332)