[ http://issues.apache.org/jira/browse/HADOOP-816?page=comments#action_12458018 ] Devaraj Das commented on HADOOP-816: ------------------------------------
Well, for some reason, I had the impression that the default dfs block size is 128 MB, and that is why I was bothered about the sort benchmark performance.. As Owen suggested, I will just put a log message if a spill happens on the Map side. > Allow the sort benchmark to set a buffersize for the map buffer > --------------------------------------------------------------- > > Key: HADOOP-816 > URL: http://issues.apache.org/jira/browse/HADOOP-816 > Project: Hadoop > Issue Type: Improvement > Components: mapred > Reporter: Devaraj Das > Assigned To: Devaraj Das > Attachments: 816.patch > > > Discovered that framework merges are the hotspots where most time is spent in > the sort benchmark. With HADOOP-331, the Map phase could potentially do a > merge of the spills (this merge was not done pre-HADOOP-331), and then there > is one compulsory merge on each reduce. It may be good to avoid the merge in > the Map phase, if possible. -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira
