[jira] [Commented] (HDFS-16999) Fix wrong use of processFirstBlockReport()

ASF GitHub Bot (Jira) Fri, 05 May 2023 08:07:46 -0700


    [ 
https://issues.apache.org/jira/browse/HDFS-16999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17719905#comment-17719905
 ]


ASF GitHub Bot commented on HDFS-16999:
---------------------------------------

zhangshuyan0 commented on PR #5622:
URL: https://github.com/apache/hadoop/pull/5622#issuecomment-1536398861

   @Hexiaoqiao Thanks for your review.  I think missing blocks is a more 
serious problem than performance loss. If the first block report is processed 
using `processFirstBlockReport` when datanode restart, the incorrect metadata 
in Namenode memory will not be fixed until the next block report. In my 
opinion, the processing of the first block report after datanode restart should 
be the same as the processing of subsequent block reports.
   Here are the answers to your questions.
   a. `processFirstBlockReport` is to speed up the startup of NameNode. If 
additional logic is added to it, it will prolong the time that NameNode is in 
safemode.
   b.  A new datanode has no replicas (or very few replicas), and its 
`reports`(param of `blockReport` RPC)  and `DatanodeStorageInfo#BlockIterator` 
almost have no content. So this PR has no effect on adding new datanodes.
   
   




> Fix wrong use of processFirstBlockReport()
> ------------------------------------------
>
>                 Key: HDFS-16999
>                 URL: https://issues.apache.org/jira/browse/HDFS-16999
>             Project: Hadoop HDFS
>          Issue Type: Bug
>            Reporter: Shuyan Zhang
>            Assignee: Shuyan Zhang
>            Priority: Major
>              Labels: pull-request-available
>
> `processFirstBlockReport()` is used to process first block report from 
> datanode. It does not calculating `toRemove` list because it believes that 
> there is no metadata about the datanode in the namenode. However, If a 
> datanode is re registered after restarting, its `blockReportCount` will be 
> updated to 0. That is to say, the first block report after a datanode 
> restarts will be processed by `processFirstBlockReport()`.  This is 
> unreasonable because the metadata of the datanode already exists in namenode 
> at this time, and if redundant replica metadata is not removed in time, the 
> blocks with insufficient replicas cannot be reconstruct in time, which 
> increases the risk of missing block. In summary, `processFirstBlockReport()` 
> should only be used when the namenode restarts, not when the datanode 
> restarts. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (HDFS-16999) Fix wrong use of processFirstBlockReport()

Reply via email to