[jira] [Updated] (MAPREDUCE-3490) RMContainerAllocator counts failed maps towards Reduce ramp up

2011-12-28 Thread Sharad Agarwal (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sharad Agarwal updated MAPREDUCE-3490:
--

Attachment: MR-3490-alternate1.patch

 RMContainerAllocator counts failed maps towards Reduce ramp up
 --

 Key: MAPREDUCE-3490
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3490
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Arun C Murthy
Priority: Blocker
 Attachments: MAPREDUCE-3490.patch, MAPREDUCE-3490.patch, 
 MAPREDUCE-3490.patch, MAPREDUCE-3490.patch, MR-3490-alternate.patch, 
 MR-3490-alternate1.patch


 The RMContainerAllocator does not differentiate between failed and successful 
 maps while calculating whether reduce tasks are ready to launch. Failed tasks 
 are also counted towards total completed tasks. 
 Example. 4 failed maps, 10 total maps. Map%complete = 4/14 * 100 instead of 
 being 0.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3490) RMContainerAllocator counts failed maps towards Reduce ramp up

2011-12-22 Thread Sharad Agarwal (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sharad Agarwal updated MAPREDUCE-3490:
--

Attachment: MR-3490-alternate.patch

Here is the quick patch for illustration.

 RMContainerAllocator counts failed maps towards Reduce ramp up
 --

 Key: MAPREDUCE-3490
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3490
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Arun C Murthy
Priority: Blocker
 Attachments: MAPREDUCE-3490.patch, MAPREDUCE-3490.patch, 
 MAPREDUCE-3490.patch, MAPREDUCE-3490.patch, MR-3490-alternate.patch


 The RMContainerAllocator does not differentiate between failed and successful 
 maps while calculating whether reduce tasks are ready to launch. Failed tasks 
 are also counted towards total completed tasks. 
 Example. 4 failed maps, 10 total maps. Map%complete = 4/14 * 100 instead of 
 being 0.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2708) [MR-279] Design and implement MR Application Master recovery

2011-10-18 Thread Sharad Agarwal (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2708?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sharad Agarwal updated MAPREDUCE-2708:
--

Status: Patch Available  (was: Open)

 [MR-279] Design and implement MR Application Master recovery
 

 Key: MAPREDUCE-2708
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2708
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: applicationmaster, mrv2
Affects Versions: 0.23.0
Reporter: Sharad Agarwal
Assignee: Sharad Agarwal
Priority: Blocker
 Fix For: 0.23.0

 Attachments: mr2708_v1.patch, mr2708_v2.patch


 Design recovery of MR AM from crashes/node failures. The running job should 
 recover from the state it left off.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2708) [MR-279] Design and implement MR Application Master recovery

2011-10-17 Thread Sharad Agarwal (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2708?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sharad Agarwal updated MAPREDUCE-2708:
--

Attachment: mr2708_v2.patch

Rebased to latest 23 branch. All hadoop-mapreduce-client passing.

 [MR-279] Design and implement MR Application Master recovery
 

 Key: MAPREDUCE-2708
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2708
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: applicationmaster, mrv2
Affects Versions: 0.23.0
Reporter: Sharad Agarwal
Assignee: Sharad Agarwal
Priority: Blocker
 Fix For: 0.23.0

 Attachments: mr2708_v1.patch, mr2708_v2.patch


 Design recovery of MR AM from crashes/node failures. The running job should 
 recover from the state it left off.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira