I have a cluster running Cloudera's 0.20.1+152-1 version of Hadoop. All was
well, but there was an unfortunate power outage that affected just the
namenode. Everything seemed largely normal upon resumption (I did have to
recreate the local version of hadoop.tmp.dir to get the namenode to start),
but now I find that none of the status webpages is working: Jetty is
returning 404s for everything. The actual JobTracker appears fine: I am
able to submit jobs and get results. Here's what I see:
$ telnet localhost 50030
Trying 127.0.0.1...
Connected to localhost.
Escape character is '^]'.
GET /jobtracker.jsp HTTP/1.1
Host: localhost
HTTP/1.1 404 /jobtracker.jsp
Content-Type: text/html; charset=iso-8859-1
Cache-Control: must-revalidate,no-cache,no-store
Content-Length: 1412
Server: Jetty(6.1.14)
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"/>
<title>Error 404 /jobtracker.jsp</title>
</head>
<body><h2>HTTP ERROR: 404</h2><pre>/jobtracker.jsp</pre>
<p>RequestURI=/jobtracker.jsp</p><p><i><small><a
href="http://jetty.mortbay.org/">Powered by
Jetty://</a></small></i></p><br/>
<br/>
<br/>
<br/>
<br/>
<br/>
<br/>
<br/>
<br/>
<br/>
<br/>
<br/>
<br/>
<br/>
<br/>
<br/>
<br/>
<br/>
<br/>
<br/>
</body>
</html>
^]
telnet> quit
In contrast, another cluster running the slightly more up-to-date
0.20.1+169.68.1 returns what you'd expect, e.g.
$ telnet localhost 50030
Trying 127.0.0.1...
Connected to localhost.
Escape character is '^]'.
GET /jobtracker.jsp HTTP/1.1
Host: localhost
HTTP/1.1 200 OK
Content-Type: text/html; charset=utf-8
Expires: Thu, 01 Jan 1970 00:00:00 GMT
Set-Cookie: JSESSIONID=12c1udmu09jok;Path=/
Content-Length: 2851
Server: Jetty(6.1.14)
<html>
<head>
<title>hdp-nn-pri Hadoop Map/Reduce Administration</title>
<link rel="stylesheet" type="text/css" href="/static/hadoop.css">
<link rel="icon" type="image/vnd.microsoft.icon"
href="/static/images/favicon.ico" />
<script type="text/javascript" src="/static/jobtracker.js"></script>
</head>
<body>
<h1>hdp-nn-pri Hadoop Map/Reduce Administration</h1>
.
.
.
etc.
I assume this stuff is under the control of the "webapp" directory and that
appears identical between the two clusters: I did a recursive diff. Anyway,
I've looked at a bunch of things and don't see any problems, so I'm kind of
at wits currently.
Any suggestions would be most appreciated.
--
Robert Crocombe