----- Original Message -----
From: Jay Vyas <jayunit...@gmail.com>
Date: Saturday, October 29, 2011 8:27 pm
Subject: can't format namenode....
To: common-user@hadoop.apache.org

> Hi guys : In order to fix some issues im having (recently posted), 
> I'vedecided to try to make sure my name node is formatted.... But 
> the formatting
> fails (see 1 below) .....
> 
> So... To trace the failure,  I figured I would grep through all log 
> filesfor exceptions.
> I've curated the results here ... does this look familiar to anyone ?
> 
> Clearly, something is very wrong with my CDH hadoop installation.
> 
> 1) To attempt to solve this, I figured I would format my namenode.  
> Oddly,when I run "hadoop -namenode format" I get the following 
> stack trace :
> 
> 11/10/29 14:39:37 INFO namenode.NameNode: STARTUP_MSG:
> /************************************************************
> STARTUP_MSG: Starting NameNode
> STARTUP_MSG:   host = localhost.localdomain/127.0.0.1
> STARTUP_MSG:   args = [-format]
> STARTUP_MSG:   version = 0.20.2-cdh3u1
> STARTUP_MSG:   build = file:///tmp/topdir/BUILD/hadoop-0.20.2-
> cdh3u1 -r
> bdafb1dbffd0d5f2fbc6ee022e1c8df6500fd638; compiled by 'root' on Mon 
> Jul 18
> 09:40:22 PDT 2011
> ************************************************************/
> Re-format filesystem in /var/lib/hadoop-0.20/cache/hadoop/dfs/name 
> ? (Y or
> N) Y
> 11/10/29 14:39:40 INFO util.GSet: VM type       = 64-bit
> 11/10/29 14:39:40 INFO util.GSet: 2% max memory = 19.33375 MB
> 11/10/29 14:39:40 INFO util.GSet: capacity      = 2^21 = 2097152 
> entries11/10/29 14:39:40 INFO util.GSet: recommended=2097152, 
> actual=209715211/10/29 14:39:40 INFO namenode.FSNamesystem: 
> fsOwner=cloudera11/10/29 14:39:40 INFO namenode.FSNamesystem: 
> supergroup=supergroup11/10/29 14:39:40 INFO namenode.FSNamesystem: 
> isPermissionEnabled=false11/10/29 14:39:40 INFO namenode.FSNamesystem:
> dfs.block.invalidate.limit=1000
> 11/10/29 14:39:40 INFO namenode.FSNamesystem: 
> isAccessTokenEnabled=falseaccessKeyUpdateInterval=0 min(s), 
> accessTokenLifetime=0 min(s)
> 11/10/29 14:39:41 ERROR namenode.NameNode: java.io.IOException: Cannot
> remove current directory: /var/lib/hadoop-
> 0.20/cache/hadoop/dfs/name/current    at
> org.apache.hadoop.hdfs.server.common.Storage$StorageDirectory.clearDirectory(Storage.java:303)
>    at
> org.apache.hadoop.hdfs.server.namenode.FSImage.format(FSImage.java:1244)
>    at
> org.apache.hadoop.hdfs.server.namenode.FSImage.format(FSImage.java:1263)
>    at
> org.apache.hadoop.hdfs.server.namenode.NameNode.format(NameNode.java:1100)
>    at
> org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:1217)
>    at
> org.apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.java:1233)
> 
> 11/10/29 14:39:41 INFO namenode.NameNode: SHUTDOWN_MSG:
> /************************************************************
> SHUTDOWN_MSG: Shutting down NameNode at 
> localhost.localdomain/127.0.0.1************************************************************/
> 

Are you able to remove this directory explicitely?
  /var/lib/hadoop-0.20/cache/hadoop/dfs/name/current
> 
> 2) Here are the exceptions (abridged , i removed repetetive parts 
> regarding"replicated to 0 nodes instead of 1"
This is because, the file is not replicated to minimum replication (1).
> 
> 2011-10-28 22:36:52,669 INFO org.apache.hadoop.ipc.Server: IPC Server
> handler 9 on 8020, call
> addBlock(/var/lib/hadoop-
> 0.20/cache/mapred/mapred/system/jobtracker.info,DFSClient_-
> 134960056, null) from 127.0.0.1:35163: error:
> java.io.IOException: File /var
> lib/hadoop-0.20/cache/mapred/mapred/system/jobtracker.info could 
> only be
> replicated to 0 nodes, instead of 1
> java.io.IOException: File /var/lib/hadoop-
> 0.20/cache/mapred/mapred/system/jobtracker.info could only be 
> replicated to 0 nodes, instead of 1
> STARTUP_MSG:   host = java.net.UnknownHostException: gcrc15.uchc.net:
> gcrc15.uchc.net
> java.net.UnknownHostException: gcrc15.uchc.net: gcrc15.uchc.net
> java.net.UnknownHostException: gcrc15.uchc.net: gcrc15.uchc.net
> java.net.UnknownHostException: gcrc15.uchc.net: gcrc15.uchc.net
> java.net.UnknownHostException: gcrc15.uchc.net: gcrc15.uchc.net
> java.net.UnknownHostException: gcrc15.uchc.net: gcrc15.uchc.net
> 2011-10-28 22:30:03,413 WARN org.apache.hadoop.hdfs.DFSClient: 
> DataStreamerException: org.apache.hadoop.ipc.RemoteException: 
> java.io.IOException: File
> /var/lib/hadoop-0.20/cache/mapred/mapred/system/jobtracker.info 
> could only
> be replicated to 0 nodes, in
> tead of 1
> .......................... REPEATED SEVERAL TIMES 
> ......................2011-10-28 22:36:52,716 WARN 
> org.apache.hadoop.hdfs.DFSClient: DataStreamer
> Exception: org.apache.hadoop.ipc.RemoteException: 
> java.io.IOException: File
> /var/lib/hadoop-0.20/cache/mapred/mapred/system/jobtracker.info 
> could only
> be replicated to 0 nodes, in
> tead of 1
> org.apache.hadoop.ipc.RemoteException: java.io.IOException: File
> /var/lib/hadoop-0.20/cache/mapred/mapred/system/jobtracker.info 
> could only
> be replicated to 0 nodes, instead of 1
> STARTUP_MSG:   host = java.net.UnknownHostException: gcrc15.uchc.net:
> gcrc15.uchc.net
> java.net.UnknownHostException: gcrc15.uchc.net: gcrc15.uchc.net
> java.net.UnknownHostException: gcrc15.uchc.net: gcrc15.uchc.net
> STARTUP_MSG:   host = java.net.UnknownHostException: gcrc15.uchc.net:
> gcrc15.uchc.net
> java.net.UnknownHostException: gcrc15.uchc.net: gcrc15.uchc.net
> 2011-10-28 22:29:48,131 ERROR
> org.apache.hadoop.hdfs.server.datanode.DataNode:
> java.net.UnknownHostException: gcrc15.uchc.net: gcrc15.uchc.net
> SHUTDOWN_MSG: Shutting down DataNode at java.net.UnknownHostException:
> gcrc15.uchc.net: gcrc15.uchc.net
> STARTUP_MSG:   host = java.net.UnknownHostException: gcrc15.uchc.net:
> gcrc15.uchc.net
> java.net.UnknownHostException: gcrc15.uchc.net: gcrc15.uchc.net
> 2011-10-28 22:30:03,406 INFO org.apache.hadoop.ipc.Server: IPC Server
> handler 3 on 8020, call
> addBlock(/var/lib/hadoop-
> 0.20/cache/mapred/mapred/system/jobtracker.info,DFSClient_157537399, null) 
> from 127.0.0.1:33519: error: java.io.IOException:
> File /var/
> ib/hadoop-0.20/cache/mapred/mapred/system/jobtracker.info could 
> only be
> replicated to 0 nodes, instead of 1
> java.io.IOException: File /var/lib/hadoop-
> 0.20/cache/mapred/mapred/system/jobtracker.info could only be 
> replicated to 0 nodes, instead of 1
> ...............................(repeated several times)
> .................................
> STARTUP_MSG:   host = java.net.UnknownHostException: gcrc15.uchc.net:
> gcrc15.uchc.net
> java.net.UnknownHostException: gcrc15.uchc.net: gcrc15.uchc.net
> java.net.UnknownHostException: gcrc15.uchc.net: gcrc15.uchc.net
> 2011-10-28 22:30:05,066 ERROR org.apache.hadoop.mapred.TaskTracker: 
> Can not
> start task tracker because java.net.UnknownHostException: 
> gcrc15.uchc.net:gcrc15.uchc.net
> SHUTDOWN_MSG: Shutting down TaskTracker at 
> java.net.UnknownHostException:gcrc15.uchc.net: gcrc15.uchc.net
> 
Did you configure the host mappings correctly in /etc/hosts file?
Also can you please check, whether the DNs are sending heartbeats to NN.

Regards,
Uma

Reply via email to