[ 
https://issues.apache.org/jira/browse/HDFS-9719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15186493#comment-15186493
 ] 

Uma Maheswara Rao G commented on HDFS-9719:
-------------------------------------------

[~drankye], Thanks Kai for working on this JIRA. Overall changes looks good, I 
have the following comments/question though.

# I am seeing some variable names still like “toRecoverLen”. Can we take chance 
in this patch to change them like toReconstructLen ?
# doReadMinimum : this method name looks to be wrong. Its actually reading from 
minimum required sourced data nodes. But this name looks like it is reading 
minimum data length/what?
# I think the current refactored names not representing the functioning what 
they are actually doing. Example StripedReaders, looks like holder class, but 
it is doing more than that. Also I assume StripedReader itself should handle 
multiple chunk/cell readers. So, how about renaming class name like 
StripedReaders -> StripedReader and StripedReader -> StripedChunkReader and the 
same comment applies for StripedWriter*
Please check if this naming make sense to you. If you have more suggestion on 
better naming that would be great.

> Refactoring ErasureCodingWorker into smaller reusable constructs
> ----------------------------------------------------------------
>
>                 Key: HDFS-9719
>                 URL: https://issues.apache.org/jira/browse/HDFS-9719
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>            Reporter: Kai Zheng
>            Assignee: Kai Zheng
>         Attachments: HDFS-9719-v1.patch, HDFS-9719-v2.patch, 
> HDFS-9719-v3.patch, HDFS-9719-v4.patch, HDFS-9719-v5.patch, HDFS-9719-v6.patch
>
>
> This would suggest and refactor {{ErasureCodingWorker}} into smaller 
> constructs to be reused in other places like block group checksum computing 
> in datanode side. As discussed in HDFS-8430 and implemented in HDFS-9694 
> patch, checksum computing for striped block groups would be distributed to 
> datanode in the group, where data block data should be able to be 
> reconstructed when missed/corrupted to recompute the block checksum. The most 
> needed codes are in the current ErasureCodingWorker and could be reused in 
> order to avoid duplication. Fortunately, we have very good and complete 
> tests, which would make the refactoring much easier. The refactoring will 
> also help a lot for subsequent tasks in phase II for non-striping erasure 
> coded files and blocks. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to