[ 
https://issues.apache.org/jira/browse/HADOOP-1650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12554868
 ] 

Mukund Madhugiri commented on HADOOP-1650:
------------------------------------------

I ran the sort benchmark on 500 nodes and here is the data:

1. Sort 500 using trunk:
- Random writer: 19 mins
- Sort: 79 mins
- Sort validation: 19 mins

2. Sort 500 using trunk + patch
- Random Writer: 24 mins
- Sort: 111 mins
- Sort validation: 17 mins

I see some errors/exceptions in the JT logs:

1. See a lot of these during the random writer job:
2007-12-28 22:31:02,456 INFO org.apache.hadoop.mapred.TaskInProgress: Error 
from task_200712282210_0001_m_004256_0: org.apache.hadoop.ipc.RemoteException: 
org.apache.hadoop.dfs
.NotReplicatedYetException: Not replicated 
yet:/user/hadoopqa/sortBenchmark500/input/_task_200712282210_0001_m_004256_0/part-04256
2007-12-28 22:31:04,381 INFO org.apache.hadoop.mapred.TaskInProgress: Error 
from task_200712282210_0001_m_004279_0: org.apache.hadoop.ipc.RemoteException: 
org.apache.hadoop.dfs
.NotReplicatedYetException: Not replicated 
yet:/user/hadoopqa/sortBenchmark500/input/_task_200712282210_0001_m_004279_0/part-04279

2. See a log of these during the random writer job:
2007-12-28 22:31:57,715 INFO org.apache.hadoop.mapred.TaskInProgress: Error 
from task_200712282210_0001_m_004485_0: java.net.SocketTimeoutException: timed 
out waiting for rpc r
esponse
2007-12-28 22:31:58,368 INFO org.apache.hadoop.mapred.TaskInProgress: Error 
from task_200712282210_0001_m_004272_0: java.net.SocketTimeoutException: timed 
out waiting for rpc r
esponse

3. See a couple of these during the sort job:
2007-12-28 22:37:43,783 INFO org.apache.hadoop.mapred.TaskInProgress: Error 
from task_200712282210_0002_m_002442_0: java.lang.OutOfMemoryError: Java heap 
space
        at java.util.Arrays.copyOf(Arrays.java:2786)
        at java.io.ByteArrayOutputStream.write(ByteArrayOutputStream.java:94)
        at java.io.DataOutputStream.write(DataOutputStream.java:90)
        at org.apache.hadoop.io.BytesWritable.write(BytesWritable.java:137)
        at 
org.apache.hadoop.mapred.MapTask$MapOutputBuffer.collect(MapTask.java:373)
        at 
org.apache.hadoop.mapred.lib.IdentityMapper.map(IdentityMapper.java:40)
        at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:208)
        at 
org.apache.hadoop.mapred.TaskTracker$Child.main(TaskTracker.java:2043)


> Upgrade Jetty to 6.x
> --------------------
>
>                 Key: HADOOP-1650
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1650
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>            Assignee: Devaraj Das
>         Attachments: hadoop-1650-jetty6.1.5.patch, 
> hadoop-jetty6.1.4-lib.tar.gz, hadoop-jetty6.1.6-lib.tar.gz, 
> jetty-hadoop-6.1.6.patch, jetty-hbase.patch, jetty6.1.4.patch, 
> jetty6.1.6.patch
>
>
> This is the third attempt at moving to jetty6. Apparently, the jetty-6.1.4 
> has fixed some of the issues we discovered in jetty during HADOOP-736 and 
> HADOOP-1273. I'd like to keep this issue open for sometime so that we have 
> enough time to test out things.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to