[
https://issues.apache.org/jira/browse/HADOOP-1650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12554868
]
Mukund Madhugiri commented on HADOOP-1650:
------------------------------------------
I ran the sort benchmark on 500 nodes and here is the data:
1. Sort 500 using trunk:
- Random writer: 19 mins
- Sort: 79 mins
- Sort validation: 19 mins
2. Sort 500 using trunk + patch
- Random Writer: 24 mins
- Sort: 111 mins
- Sort validation: 17 mins
I see some errors/exceptions in the JT logs:
1. See a lot of these during the random writer job:
2007-12-28 22:31:02,456 INFO org.apache.hadoop.mapred.TaskInProgress: Error
from task_200712282210_0001_m_004256_0: org.apache.hadoop.ipc.RemoteException:
org.apache.hadoop.dfs
.NotReplicatedYetException: Not replicated
yet:/user/hadoopqa/sortBenchmark500/input/_task_200712282210_0001_m_004256_0/part-04256
2007-12-28 22:31:04,381 INFO org.apache.hadoop.mapred.TaskInProgress: Error
from task_200712282210_0001_m_004279_0: org.apache.hadoop.ipc.RemoteException:
org.apache.hadoop.dfs
.NotReplicatedYetException: Not replicated
yet:/user/hadoopqa/sortBenchmark500/input/_task_200712282210_0001_m_004279_0/part-04279
2. See a log of these during the random writer job:
2007-12-28 22:31:57,715 INFO org.apache.hadoop.mapred.TaskInProgress: Error
from task_200712282210_0001_m_004485_0: java.net.SocketTimeoutException: timed
out waiting for rpc r
esponse
2007-12-28 22:31:58,368 INFO org.apache.hadoop.mapred.TaskInProgress: Error
from task_200712282210_0001_m_004272_0: java.net.SocketTimeoutException: timed
out waiting for rpc r
esponse
3. See a couple of these during the sort job:
2007-12-28 22:37:43,783 INFO org.apache.hadoop.mapred.TaskInProgress: Error
from task_200712282210_0002_m_002442_0: java.lang.OutOfMemoryError: Java heap
space
at java.util.Arrays.copyOf(Arrays.java:2786)
at java.io.ByteArrayOutputStream.write(ByteArrayOutputStream.java:94)
at java.io.DataOutputStream.write(DataOutputStream.java:90)
at org.apache.hadoop.io.BytesWritable.write(BytesWritable.java:137)
at
org.apache.hadoop.mapred.MapTask$MapOutputBuffer.collect(MapTask.java:373)
at
org.apache.hadoop.mapred.lib.IdentityMapper.map(IdentityMapper.java:40)
at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:208)
at
org.apache.hadoop.mapred.TaskTracker$Child.main(TaskTracker.java:2043)
> Upgrade Jetty to 6.x
> --------------------
>
> Key: HADOOP-1650
> URL: https://issues.apache.org/jira/browse/HADOOP-1650
> Project: Hadoop
> Issue Type: Improvement
> Components: mapred
> Reporter: Devaraj Das
> Assignee: Devaraj Das
> Attachments: hadoop-1650-jetty6.1.5.patch,
> hadoop-jetty6.1.4-lib.tar.gz, hadoop-jetty6.1.6-lib.tar.gz,
> jetty-hadoop-6.1.6.patch, jetty-hbase.patch, jetty6.1.4.patch,
> jetty6.1.6.patch
>
>
> This is the third attempt at moving to jetty6. Apparently, the jetty-6.1.4
> has fixed some of the issues we discovered in jetty during HADOOP-736 and
> HADOOP-1273. I'd like to keep this issue open for sometime so that we have
> enough time to test out things.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.