[ https://issues.apache.org/jira/browse/MAPREDUCE-7261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ahmed Hussein updated MAPREDUCE-7261: ------------------------------------- Attachment: MAPREDUCE-7261.003.patch > Memory efficiency in speculator > -------------------------------- > > Key: MAPREDUCE-7261 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-7261 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Reporter: Ahmed Hussein > Assignee: Ahmed Hussein > Priority: Minor > Attachments: MAPREDUCE-7261.001.patch, MAPREDUCE-7261.002.patch, > MAPREDUCE-7261.003.patch > > > The data structures in speculator and runtime-estimator are bloating. Data > elements such as (taskID, TA-ID, task stats, tasks speculated, tasks > finished..etc) are added to the concurrent maps but never removed. > For long running jobs, there are couple of issues: > # memory leakage: the speculator memory usage increases over time. > # performance: keeping large structures in the heap affects the performance > due to locality and cache misses. > *Suggested Fixes:* > - When a TA transitions to {{MoveContainerToSucceededFinishingTransition}}, > the TA notifies the speculator. The latter handles the event by cleaning the > internal structure accordingly. > - When a task transitions is failed/killed, the speculator is notified to > clean the internal data structure. > -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org