Hang JobTracker, running out of memory
--------------------------------------
Key: HADOOP-2128
URL: https://issues.apache.org/jira/browse/HADOOP-2128
Project: Hadoop
Issue Type: Bug
Components: mapred
Affects Versions: 0.14.3
Reporter: Koji Noguchi
This may be expected.
Hang JobTracker with 1G heapsize, top showed 99% cpu.
Ran about 80 jobs. Each with 2500 mappers 200 reducers. They finish quite
fast. 3-4 mins avg per job.
(200k tasks)
How much memory does JobTracker use for 'completed' (but not expired) jobs ?
jmap -heap showed
{noformat}
...
PS Old Generation
capacity = 932118528 (888.9375MB)
used = 932118528 (888.9375MB)
...
{noformat}
jmap -histo showed
{noformat}
num #instances #bytes class name
--------------------------------------
1: 3974182 355869992 [C
2: 5216606 125198544 java.lang.String
3: 2238560 107450880 java.util.TreeMap
4: 463206 101673488 [B
5: 1979995 63359840 java.util.TreeMap$Entry
6: 248400 35769600 org.apache.hadoop.mapred.TaskInProgress
7: 308803 30898112 [Ljava.lang.Object;
8: 978240 23477760 org.apache.hadoop.mapred.Counters$CounterRec
9: 249876 19990080 org.apache.hadoop.mapred.TaskStatus
10: 248836 19906880 java.net.URI
11: 230337 16584264 org.apache.hadoop.mapred.MapTask
...
{noformat}
Log showing many heartbeat discarded messages
{noformat}
2007-10-30 22:55:46,912 WARN org.apache.hadoop.ipc.Server: IPC Server handler 6
on 58567, call heartbeat([EMAIL PROTECTED], false, true, 3942) from
99.99.99.99:9999 discarded for being too old (2578616)
{noformat}
Is the solution either to increase the jobtracker heapsize or set shorter
'mapred.userlog.retain.hours' ?
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.