lujie created YARN-9223: --------------------------- Summary: NPE happens in NM while loading recovery fails Key: YARN-9223 URL: https://issues.apache.org/jira/browse/YARN-9223 Project: Hadoop YARN Issue Type: Bug Reporter: lujie Assignee: lujie
In org.apache.hadoop.yarn.server.nodemanager.NodeManager.serviceInit: {code:java} try { initAndStartRecoveryStore(conf); } catch (IOException e) { String recoveryDirName = conf.get(YarnConfiguration.NM_RECOVERY_DIR); throw new YarnRuntimeException("Unable to initialize recovery directory at " + recoveryDirName, e); } ............ this.context = createNMContext(containerTokenSecretManager, nmTokenSecretManager, nmStore, isDistSchedulingEnabled, conf); {code} while Recovery fails, the context is null, and YarnRuntimeException will cause serviceStop to run(due to Shutdown Hook): {code:java} // Cleanup ResourcePluginManager ResourcePluginManager rpm = context.getResourcePluginManager(); {code} hence NPE happens: {code:java} java.lang.NullPointerException at org.apache.hadoop.yarn.server.nodemanager.NodeManager.serviceStop(NodeManager.java:530) at org.apache.hadoop.service.AbstractService.stop(AbstractService.java:220) at org.apache.hadoop.service.ServiceOperations.stop(ServiceOperations.java:54) at org.apache.hadoop.service.ServiceOperations.stopQuietly(ServiceOperations.java:102) at org.apache.hadoop.service.AbstractService.init(AbstractService.java:172) at org.apache.hadoop.yarn.server.nodemanager.NodeManager.initAndStartNodeManager(NodeManager.java:984) at org.apache.hadoop.yarn.server.nodemanager.NodeManager.main(NodeManager.java:1064) {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org