[ https://issues.apache.org/jira/browse/YARN-4041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14681217#comment-14681217 ]
Rohith Sharma K S commented on YARN-4041: ----------------------------------------- Recently in test cluster faced the similar issue i.e around 60 apps were running. On RM switch, each applications took around 8 minutes to renew delegation token which is 8 min* 60 apps = 480minutes for recovery. YARN-3639 is the issue raised for the same. > Slow delegation token renewal can severely prolong RM recovery > -------------------------------------------------------------- > > Key: YARN-4041 > URL: https://issues.apache.org/jira/browse/YARN-4041 > Project: Hadoop YARN > Issue Type: Bug > Components: resourcemanager > Affects Versions: 2.6.0 > Reporter: Jason Lowe > Assignee: Sunil G > > When the RM does a work-preserving restart it synchronously tries to renew > delegation tokens for every active application. If a token server happens to > be down or is running slow and a lot of the active apps were using tokens > from that server then it can have a huge impact on the time it takes the RM > to process the restart. -- This message was sent by Atlassian JIRA (v6.3.4#6332)