Haibo Chen created MAPREDUCE-6657: ------------------------------------- Summary: job history server can fail on startup when NameNode is in start phase Key: MAPREDUCE-6657 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6657 Project: Hadoop Map/Reduce Issue Type: Bug Components: jobhistoryserver Reporter: Haibo Chen Assignee: Haibo Chen
Job history server will try to create a history directory in HDFS on startup. When NameNode is in safe mode, it will keep retrying for a configurable time period. However, it should also keeps retrying if the name node is in start state. Safe mode does not happen until the NN is out of the startup phase. A RetriableException with the text "NameNode still not started" is thrown when the NN is in its internal service startup phase. We should add the check for this specific exception in isBecauseSafeMode() to account for that. -- This message was sent by Atlassian JIRA (v6.3.4#6332)