[ https://issues.apache.org/jira/browse/MAPREDUCE-4301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Thomas Graves updated MAPREDUCE-4301: ------------------------------------- Resolution: Fixed Fix Version/s: 0.23.3 Status: Resolved (was: Patch Available) +1 Thanks Bobby. I committed this to trunk, branch-2, and branch-23 > Dedupe some strings in MRAM for memory savings > ---------------------------------------------- > > Key: MAPREDUCE-4301 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-4301 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: applicationmaster > Affects Versions: 0.23.3, 2.0.0-alpha > Reporter: Robert Joseph Evans > Assignee: Robert Joseph Evans > Fix For: 0.23.3 > > Attachments: MR-4301.txt > > > Recently an OutOfMemoryError caused one of our jobs to become a zombie > (MAPREDUCE-4300). It was a rather large job with 78000+ map tasks and only > 750MB of heap configured. I took a heap dump to see if there were any > obvious memory leaks, and I could not find any, but yourkit and some digging > found some potential memory optimizations that we could do. > In this particular case we could save about 20MB if > SplitMetaInfoReader.readSplitMetaInfo only computed the JobSplitFile once > instead of for each split. (a 2 line change) > I will look into some others and see if there are more savings I can come up > with. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira