[jira] [Commented] (HDFS-2753) Standby namenode stuck in safemode during a failover

Todd Lipcon (Commented) (JIRA) Tue, 10 Jan 2012 13:59:02 -0800

    [ 
https://issues.apache.org/jira/browse/HDFS-2753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13183626#comment-13183626
 ]


Todd Lipcon commented on HDFS-2753:
-----------------------------------

The test adds blocks while the SBN is down. This makes them get queued up in 
the block received list of that BPServiceActor.
When it restarts, the DN calls register(), followed by 
reportReceivedDeletedBlocks(), followed by blockReport(). So the received 
blocks always show up first.

If you comment out the fix, the test case reliably fails with the error you 
described (stuck in safemode).
                
> Standby namenode stuck in safemode during a failover
> ----------------------------------------------------
>
>                 Key: HDFS-2753
>                 URL: https://issues.apache.org/jira/browse/HDFS-2753
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>          Components: ha, name-node
>    Affects Versions: HA branch (HDFS-1623)
>            Reporter: Hari Mankude
>            Assignee: Hari Mankude
>         Attachments: HDFS-2753.patch, hdfs-2753.txt
>
>
> Write traffic initiated from the client. Manual failover is done by killing 
> NN and converting a  different standby to active. NN is restarted as standby. 
> The restarted standby stays in safemode forever. More information in the 
> description.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HDFS-2753) Standby namenode stuck in safemode during a failover

Reply via email to