[jira] [Commented] (HDFS-17365) EC: Add extra redunency configuration in checkStreamerFailures to prevent data loss.

ASF GitHub Bot (Jira) Wed, 31 Jan 2024 19:49:03 -0800


    [ 
https://issues.apache.org/jira/browse/HDFS-17365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17813029#comment-17813029
 ]


ASF GitHub Bot commented on HDFS-17365:
---------------------------------------

hfutatzhanghb opened a new pull request, #6517:
URL: https://github.com/apache/hadoop/pull/6517

   ### Description of PR
   Refer to HDFS-17365.
   
   Currently, when we write ec files. It can tolerate with at most 
`numAllBlocks - numDataBlocks` failed streamers.
   However, this can lead to block missing exception in some case.
   
   For example, When we use RS-6-3-1024K ec policy,  we have written 6 blocks 
successfully. 
   this block is going to be reconstructed, but before that, one of the 6 
blocks lost, we may lose the data forerver.
   Or if we restart datanode at that time, client will get block missing 
exception.
   
   So, we should better add some redundancy here.
   




> EC: Add extra redunency configuration in checkStreamerFailures to prevent 
> data loss.
> ------------------------------------------------------------------------------------
>
>                 Key: HDFS-17365
>                 URL: https://issues.apache.org/jira/browse/HDFS-17365
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: ec
>            Reporter: farmmamba
>            Assignee: farmmamba
>            Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

[jira] [Commented] (HDFS-17365) EC: Add extra redunency configuration in checkStreamerFailures to prevent data loss.

Reply via email to