[ 
http://issues.apache.org/jira/browse/HADOOP-816?page=comments#action_12458018 ] 
            
Devaraj Das commented on HADOOP-816:
------------------------------------

Well, for some reason, I had the impression that the default dfs block size is 
128 MB, and that is why I was bothered about the sort benchmark performance.. 
As Owen suggested, I will just put a log message if a spill happens on the Map 
side.

> Allow the sort benchmark to set a buffersize for the map buffer
> ---------------------------------------------------------------
>
>                 Key: HADOOP-816
>                 URL: http://issues.apache.org/jira/browse/HADOOP-816
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>         Assigned To: Devaraj Das
>         Attachments: 816.patch
>
>
> Discovered that framework merges are the hotspots where most time is spent in 
> the sort benchmark. With HADOOP-331, the Map phase could potentially do a 
> merge of the spills (this merge was not done pre-HADOOP-331), and then there 
> is one compulsory merge on each reduce. It may be good to avoid the merge in 
> the Map phase, if possible.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to