JobTracker and TaskTrackers fail with a misleading error if one of the 
mapreduce.cluster.dir has unusable permissions / is unavailable.
---------------------------------------------------------------------------------------------------------------------------------------

                 Key: MAPREDUCE-2390
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2390
             Project: Hadoop Map/Reduce
          Issue Type: Bug
          Components: jobtracker, tasktracker
    Affects Versions: 0.20.2
         Environment: CDH3 and Apache 0.20 || Linux
            Reporter: Harsh J Chouraria
            Assignee: Harsh J Chouraria


To reproduce, have a mapred.local.dir property set to a few directories. Before 
starting up the JT, set one of these directories' permission as 'd---------', 
and then start the JT/TT. The JT, although it tries to ignore this directory, 
fails with an odd and misleading message claiming that its configured address 
in use.

Fixing the permission clears this issue!

This was also reported in the mailing lists by Ted Yu, quite a few months ago. 
But I had forgotten about filing a bug for it here. Still seems to happen. A 
log is attached below.

{code}
2011-03-17 00:40:32,321 WARN org.apache.hadoop.mapred.JobTracker: Error 
starting tracker: java.io.IOException: Cannot create toBeDeleted in 
/home/hack/.tmplocalz/2
        at 
org.apache.hadoop.util.MRAsyncDiskService.<init>(MRAsyncDiskService.java:86)
        at org.apache.hadoop.mapred.JobTracker.<init>(JobTracker.java:2189)
        at org.apache.hadoop.mapred.JobTracker.<init>(JobTracker.java:2022)
        at org.apache.hadoop.mapred.JobTracker.startTracker(JobTracker.java:276)
        at org.apache.hadoop.mapred.JobTracker.startTracker(JobTracker.java:268)
        at org.apache.hadoop.mapred.JobTracker.main(JobTracker.java:4712)

2011-03-17 00:40:33,322 INFO 
org.apache.hadoop.security.token.delegation.AbstractDelegationTokenSecretManager:
 Updating the current master key for generating delegation tokens
2011-03-17 00:40:33,322 INFO 
org.apache.hadoop.security.token.delegation.AbstractDelegationTokenSecretManager:
 Starting expired delegation token remover thread, tokenRemoverScanInterval=60 
min(s)
2011-03-17 00:40:33,322 INFO 
org.apache.hadoop.security.token.delegation.AbstractDelegationTokenSecretManager:
 Updating the current master key for generating delegation tokens
2011-03-17 00:40:33,322 INFO org.apache.hadoop.mapred.JobTracker: Scheduler 
configured with (memSizeForMapSlotOnJT, memSizeForReduceSlotOnJT, 
limitMaxMemForMapTasks, limitMaxMemForReduceTasks) (-1, -1, -1, -1)
2011-03-17 00:40:33,322 INFO org.apache.hadoop.util.HostsFileReader: Refreshing 
hosts (include/exclude) list
2011-03-17 00:40:33,350 INFO org.apache.hadoop.mapred.JobTracker: Starting 
jobtracker with owner as hack
2011-03-17 00:40:33,351 FATAL org.apache.hadoop.mapred.JobTracker: 
java.net.BindException: Problem binding to localhost/127.0.0.1:8021 : Address 
already in use
        at org.apache.hadoop.ipc.Server.bind(Server.java:227)
        at org.apache.hadoop.ipc.Server$Listener.<init>(Server.java:314)
        at org.apache.hadoop.ipc.Server.<init>(Server.java:1411)
        at org.apache.hadoop.ipc.RPC$Server.<init>(RPC.java:510)
        at org.apache.hadoop.ipc.RPC.getServer(RPC.java:471)
        at org.apache.hadoop.mapred.JobTracker.<init>(JobTracker.java:2112)
        at org.apache.hadoop.mapred.JobTracker.<init>(JobTracker.java:2022)
        at org.apache.hadoop.mapred.JobTracker.startTracker(JobTracker.java:276)
        at org.apache.hadoop.mapred.JobTracker.startTracker(JobTracker.java:268)
        at org.apache.hadoop.mapred.JobTracker.main(JobTracker.java:4712)
Caused by: java.net.BindException: Address already in use
        at sun.nio.ch.Net.bind(Native Method)
        at 
sun.nio.ch.ServerSocketChannelImpl.bind(ServerSocketChannelImpl.java:126)
        at sun.nio.ch.ServerSocketAdaptor.bind(ServerSocketAdaptor.java:59)
        at org.apache.hadoop.ipc.Server.bind(Server.java:225)
        ... 9 more

2011-03-17 00:40:33,352 INFO org.apache.hadoop.mapred.JobTracker: SHUTDOWN_MSG: 
/************************************************************
SHUTDOWN_MSG: Shutting down JobTracker at QDuo/127.0.0.1
************************************************************/
{code}

The list conversation in context, at {{search-hadoop.com}}:
http://search-hadoop.com/m/FzN7iqreL/problem+starting+cdh3b2+jobtracker&subj=problem+starting+cdh3b2+jobtracker

I'll try to investigate and post the exact problem / solution soon.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to