[
https://issues.apache.org/jira/browse/HADOOP-4744?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12651960#action_12651960
]
Amareshwari Sriramadasu commented on HADOOP-4744:
-------------------------------------------------
The log for the reducer waiting for map on the machine:
2008-12-01 10:00:19,501 INFO org.apache.hadoop.mapred.ReduceTask: Task
attempt_200812010806_0337_r_000000_1: Failed fetch #3 from
attempt_200812010806_0337_m_002787_0
2008-12-01 10:00:19,501 WARN org.apache.hadoop.mapred.ReduceTask:
attempt_200812010806_0337_r_000000_1 adding host null to penalty box, next
contact in 16 seconds
2008-12-01 10:00:20,503 INFO org.apache.hadoop.mapred.ReduceTask:
attempt_200812010806_0337_r_000000_1: Got 1 map-outputs from previous failures
2008-12-01 10:00:35,510 INFO org.apache.hadoop.mapred.ReduceTask:
attempt_200812010806_0337_r_000000_1 Need another 2 map output(s) where 0 is
already in progress
2008-12-01 10:00:35,511 INFO org.apache.hadoop.mapred.ReduceTask:
attempt_200812010806_0337_r_000000_1: Got 0 new map-outputs
2008-12-01 10:00:35,511 INFO org.apache.hadoop.mapred.ReduceTask:
attempt_200812010806_0337_r_000000_1 Scheduled 1 outputs (0 slow hosts and0 dup
hosts)
2008-12-01 10:00:35,516 WARN org.apache.hadoop.mapred.ReduceTask:
attempt_200812010806_0337_r_000000_1 copy failed:
attempt_200812010806_0337_m_002787_0 from null
2008-12-01 10:00:35,518 WARN org.apache.hadoop.mapred.ReduceTask:
java.net.ConnectException: Connection refused
at java.net.PlainSocketImpl.socketConnect(Native Method)
at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:333)
at java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:195)
at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:182)
at java.net.Socket.connect(Socket.java:519)
at sun.net.NetworkClient.doConnect(NetworkClient.java:152)
at sun.net.www.http.HttpClient.openServer(HttpClient.java:394)
at sun.net.www.http.HttpClient.openServer(HttpClient.java:529)
at sun.net.www.http.HttpClient.<init>(HttpClient.java:233)
at sun.net.www.http.HttpClient.New(HttpClient.java:306)
at sun.net.www.http.HttpClient.New(HttpClient.java:323)
at
sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnection.java:788)
at
sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.java:729)
at
sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.java:654)
at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.getInputStream(ReduceTask.java:1312)
at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.getMapOutput(ReduceTask.java:1250)
at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.copyOutput(ReduceTask.java:1163)
at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.run(ReduceTask.java:1100)
> Wrong resolution of hostname and port
> --------------------------------------
>
> Key: HADOOP-4744
> URL: https://issues.apache.org/jira/browse/HADOOP-4744
> Project: Hadoop Core
> Issue Type: Bug
> Components: mapred
> Affects Versions: 0.20.0
> Reporter: Amareshwari Sriramadasu
> Priority: Critical
> Fix For: 0.20.0
>
>
> I noticed the following for one of the hosts in a cluster:
> 1. machines.jsp page resolves the http address as just "http://hostname"
> (which doesn't work). It doesnt put the port number for the host. Even if I
> add the port number manually in the URI, the task tracker page does not come
> up.
> 2. All the tasks(both maps and reduces) which ran on the machine ran
> successfully. But tasklogs cannot be viewed, because port-number is not
> resolved. ( same problem as in (1)).
> 3. The reducers waiting for maps ran on that machine fail with connection
> failed errors saying the hostname is 'null'.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.