[ https://issues.apache.org/jira/browse/YARN-3979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14646510#comment-14646510 ]
Rohith Sharma K S commented on YARN-3979: ----------------------------------------- Oops, 50 lakh events!!!! I checked the attached logs, since you have attached only ERROR logs, did not able to trace it. One observation is there are many InvalidStateTransitions events CLEAN_UP in RMNodeImpl. # Would you possible give RM logs, if not able to attach to JIRA, could you send me through mail. # would give more info like what is the cluster size? how much is apps are running? how many were completed? What is the state of state of NodeManager i.e whether they are running OR any other state? Which version of Hadoop are you using? > Am in ResourceLocalizationService hang 10 min cause RM kill AM > --------------------------------------------------------------- > > Key: YARN-3979 > URL: https://issues.apache.org/jira/browse/YARN-3979 > Project: Hadoop YARN > Issue Type: Bug > Affects Versions: 2.2.0 > Environment: CentOS 6.5 Hadoop-2.2.0 > Reporter: zhangyubiao > Attachments: ERROR103.log > > > 2015-07-27 02:46:17,348 INFO > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Created localizer for container_1437735375558 > _104282_01_000001 > 2015-07-27 02:56:18,510 INFO SecurityLogger.org.apache.hadoop.ipc.Server: > Auth successful for appattempt_1437735375558_104282_000001 (auth:SIMPLE) > 2015-07-27 02:56:18,510 INFO > SecurityLogger.org.apache.hadoop.security.authorize.ServiceAuthorizationManager: > Authorization successful for appattempt_1437735375558_104282_0 > 00001 (auth:TOKEN) for protocol=interface > org.apache.hadoop.yarn.api.ContainerManagementProtocolPB -- This message was sent by Atlassian JIRA (v6.3.4#6332)