[jira] [Work logged] (HDFS-15562) StandbyCheckpointer will do checkpoint repeatedly while connecting observer/active namenode failed

ASF GitHub Bot (Jira) Tue, 03 Nov 2020 14:59:57 -0800


     [ 
https://issues.apache.org/jira/browse/HDFS-15562?focusedWorklogId=507370&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-507370
 ]


ASF GitHub Bot logged work on HDFS-15562:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 03/Nov/20 22:58
            Start Date: 03/Nov/20 22:58
    Worklog Time Spent: 10m 
      Work Description: aihuaxu opened a new pull request #2430:
URL: https://github.com/apache/hadoop/pull/2430


   Standby namenode does the checkpoint and uploads the image to the other 
active/observer namenodes. If other namenodes, e.g., observers are down for 
maintenance, currently uploading will fail and retrigger the checkpoint and 
image uploading immediately. That is causing unnecessary network traffic. This 
patch logs a message when unloading fails and continue with the regular 
checkpoint schedule. 
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
-------------------

            Worklog Id:     (was: 507370)
    Remaining Estimate: 0h
            Time Spent: 10m

> StandbyCheckpointer will do checkpoint repeatedly while connecting 
> observer/active namenode failed
> --------------------------------------------------------------------------------------------------
>
>                 Key: HDFS-15562
>                 URL: https://issues.apache.org/jira/browse/HDFS-15562
>             Project: Hadoop HDFS
>          Issue Type: Bug
>            Reporter: SunHao
>            Assignee: Aihua Xu
>            Priority: Major
>         Attachments: HDFS-15562.patch
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> We find the standby namenode will do checkpoint over and over while 
> connecting observer/active namenode failed.
> StandbyCheckpointer won't update “lastCheckpointTime” when upload new fsimage 
> to the other namenode failed, so that the standby namenode will keep doing 
> checkpoint repeatedly.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

[jira] [Work logged] (HDFS-15562) StandbyCheckpointer will do checkpoint repeatedly while connecting observer/active namenode failed

Reply via email to