Hello George,

Have you looked at your DFS health page (http://NN:50070/)? I believe
you have missing or fallen DataNode instances.

I'd start them back up, after checking their (DataNode's) logs to
figure out why they died.

On Wed, Sep 21, 2011 at 7:28 PM, George Kousiouris
<gkous...@mail.ntua.gr> wrote:
>
> Hi all,
>
> We are trying to run a mahout job in a hadoop cluster, but we keep getting
> the same status. The job passes the initial mahout stages and when it comes
> to be executed as a MR job, it seems to be stuck at 0% progress. Through the
> UI we see that it is submitted but not running. After a while it gets
> killed. In the logs the error shown is this one:
>
> 2011-09-21 07:47:50,507 INFO org.apache.hadoop.mapred.JobTracker: problem
> cleaning system directory:
> hdfs://master/var/lib/hadoop-0.20/cache/hdfs/mapred/system
> org.apache.hadoop.ipc.RemoteException:
> org.apache.hadoop.hdfs.server.namenode.SafeModeException: Cannot create
> directory /var/lib/hadoop-0.20/cache/hdfs/mapred/system. Name nod$
> The reported blocks 0 needs additional 12 blocks to reach the threshold
> 0.9990 of total blocks 13. Safe mode will be turned off automatically.
>        at
> org.apache.hadoop.hdfs.server.namenode.FSNamesystem.mkdirsInternal(FSNamesystem.java:1966)
>        at
> org.apache.hadoop.hdfs.server.namenode.FSNamesystem.mkdirs(FSNamesystem.java:1940)
>        at
> org.apache.hadoop.hdfs.server.namenode.NameNode.mkdirs(NameNode.java:770)
>        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>        at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>        at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>        at java.lang.reflect.Method.invoke(Method.java:597)
>
>
> Some staging files seem to have been created however.
>
> I was thinking of sending this to the mahout mailing list but it seems a
> more core hadoop issue.
>
> We are using the following command to launch the mahout example:
> ./mahout org.apache.mahout.clustering.syntheticcontrol.kmeans.Job --input
> hdfs://master/user/hdfs/testdata/synthetic_control.data --output
> hdfs://master/user/hdfs/testdata/output --t1 0.5 --t2 1 --maxIter 50
>
> Any clues?
> George
>
> --
>
> ---------------------------
>
> George Kousiouris
> Electrical and Computer Engineer
> Division of Communications,
> Electronics and Information Engineering
> School of Electrical and Computer Engineering
> Tel: +30 210 772 2546
> Mobile: +30 6939354121
> Fax: +30 210 772 2569
> Email: gkous...@mail.ntua.gr
> Site: http://users.ntua.gr/gkousiou/
>
> National Technical University of Athens
> 9 Heroon Polytechniou str., 157 73 Zografou, Athens, Greece
>
>



-- 
Harsh J

Reply via email to