[ 
https://issues.apache.org/jira/browse/YARN-5685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15530642#comment-15530642
 ] 

Daniel Templeton commented on YARN-5685:
----------------------------------------

The issue is more than that change from YARN-4559.  I'm still digging, but even 
with that issue resolved the RMs are still all stuck in standby because the 
state store isn't started until the RM transitions to active, but it doesn't 
transition to active unless the state store is started.

> Non-embedded HA failover is broken
> ----------------------------------
>
>                 Key: YARN-5685
>                 URL: https://issues.apache.org/jira/browse/YARN-5685
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: resourcemanager
>    Affects Versions: 2.9.0, 3.0.0-alpha1
>            Reporter: Daniel Templeton
>            Assignee: Daniel Templeton
>            Priority: Critical
>
> YARN-4559 broke RM HA when embedded automatic failover is disabled.  The 
> {{ZKRMStateStore}} will now only start its monitoring thread when automatic 
> failover not enabled (which is patently useless).  I presume the intended 
> change was to have the monitoring thread started when automatic failover is 
> not *embedded*.
> If HA is enabled with automatic failover enabled and embedded failover 
> disabled, all RMs all come up in standby state.  To make one of them active, 
> the {{--forcemanual}} flag must be used when manually triggering the state 
> change.  Should the active go down, the standby will not become active and 
> must be manually transitioned with the {{--forcemanual}} flag.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to