[jira] [Commented] (MAPREDUCE-3315) Master-Worker Application on YARN

2012-04-04 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-3315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13246166#comment-13246166 ] Sharad Agarwal commented on MAPREDUCE-3315: --- Thanks Nikhil for the patch.

[jira] [Commented] (MAPREDUCE-3315) Master-Worker Application on YARN

2012-04-03 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-3315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13245108#comment-13245108 ] Sharad Agarwal commented on MAPREDUCE-3315: --- can have it in

[jira] [Commented] (MAPREDUCE-3315) Master-Worker Application on YARN

2012-03-22 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-3315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13235534#comment-13235534 ] Sharad Agarwal commented on MAPREDUCE-3315: --- bq. Should I use Hadoop IPC or

[jira] [Commented] (MAPREDUCE-3846) Restarted+Recovered AM hangs in some corner cases

2012-02-13 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-3846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13206860#comment-13206860 ] Sharad Agarwal commented on MAPREDUCE-3846: --- looks good. we should add a

[jira] [Commented] (MAPREDUCE-3858) Task attempt failure during commit results in task never completing

2012-02-13 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-3858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13207527#comment-13207527 ] Sharad Agarwal commented on MAPREDUCE-3858: --- +1 looks good. Thanks Tom for

[jira] [Commented] (MAPREDUCE-3846) Restarted+Recovered AM hangs in some corner cases

2012-02-10 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-3846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13205325#comment-13205325 ] Sharad Agarwal commented on MAPREDUCE-3846: --- should this be marked as

[jira] [Commented] (MAPREDUCE-3802) If an MR AM dies twice it looks like the process freezes

2012-02-07 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-3802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13203325#comment-13203325 ] Sharad Agarwal commented on MAPREDUCE-3802: --- bq. need to understand a little

[jira] [Commented] (MAPREDUCE-3802) If an MR AM dies twice it looks like the process freezes

2012-02-07 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-3802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13203332#comment-13203332 ] Sharad Agarwal commented on MAPREDUCE-3802: --- The bug is in TaskImpl over

[jira] [Commented] (MAPREDUCE-3711) AppMaster recovery for Medium to large jobs take long time

2012-01-30 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-3711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13196747#comment-13196747 ] Sharad Agarwal commented on MAPREDUCE-3711: --- bq. That is the real bug. We

[jira] [Commented] (MAPREDUCE-3634) All daemons should crash instead of hanging around when their EventHandlers get exceptions

2012-01-29 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-3634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13195964#comment-13195964 ] Sharad Agarwal commented on MAPREDUCE-3634: --- Can we set

[jira] [Commented] (MAPREDUCE-3489) EventDispatcher should have a call-back on errors for aiding tests

2012-01-27 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13194633#comment-13194633 ] Sharad Agarwal commented on MAPREDUCE-3489: --- Currently in the

[jira] [Commented] (MAPREDUCE-3711) AppMaster recovery for Medium to large jobs take long time

2012-01-26 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-3711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13194486#comment-13194486 ] Sharad Agarwal commented on MAPREDUCE-3711: --- Karam, can you upload the AM

[jira] [Commented] (MAPREDUCE-3490) RMContainerAllocator counts failed maps towards Reduce ramp up

2011-12-28 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-3490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13176621#comment-13176621 ] Sharad Agarwal commented on MAPREDUCE-3490: --- currently all bookkeeping and

[jira] [Commented] (MAPREDUCE-3490) RMContainerAllocator counts failed maps towards Reduce ramp up

2011-12-28 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-3490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13176623#comment-13176623 ] Sharad Agarwal commented on MAPREDUCE-3490: --- Note: I am on vacation rest of

[jira] [Commented] (MAPREDUCE-3490) RMContainerAllocator counts failed maps towards Reduce ramp up

2011-12-22 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-3490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13174797#comment-13174797 ] Sharad Agarwal commented on MAPREDUCE-3490: --- Hi Arun - just had a brief look

[jira] [Commented] (MAPREDUCE-3490) RMContainerAllocator counts failed maps towards Reduce ramp up

2011-12-22 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-3490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13175319#comment-13175319 ] Sharad Agarwal commented on MAPREDUCE-3490: --- bq. I think we need to stop

[jira] [Commented] (MAPREDUCE-3473) A single task tracker failure shouldn't result in Job failure

2011-12-21 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-3473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13174636#comment-13174636 ] Sharad Agarwal commented on MAPREDUCE-3473: --- a single machine failure

[jira] [Commented] (MAPREDUCE-3489) Unit tests failing silently

2011-12-01 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13160716#comment-13160716 ] Sharad Agarwal commented on MAPREDUCE-3489: --- Definately System.exit is not

[jira] [Commented] (MAPREDUCE-3473) Task failures shouldn't result in Job failures

2011-11-29 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-3473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13159158#comment-13159158 ] Sharad Agarwal commented on MAPREDUCE-3473: --- note this is *task* failure NOT

[jira] [Commented] (MAPREDUCE-3402) AMScalability test of Sleep job with 100K 1-sec maps regressed into running very slowly

2011-11-15 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-3402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13151029#comment-13151029 ] Sharad Agarwal commented on MAPREDUCE-3402: --- just fyi

[jira] [Commented] (MAPREDUCE-3315) Master-Worker Application on YARN

2011-10-31 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-3315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13140059#comment-13140059 ] Sharad Agarwal commented on MAPREDUCE-3315: --- some thoughts: - AM decides the

[jira] [Commented] (MAPREDUCE-3274) Race condition in MR App Master Preemtion can cause a dead lock

2011-10-27 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-3274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13136881#comment-13136881 ] Sharad Agarwal commented on MAPREDUCE-3274: --- JVM with ID:

[jira] [Commented] (MAPREDUCE-2708) [MR-279] Design and implement MR Application Master recovery

2011-10-23 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-2708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13133597#comment-13133597 ] Sharad Agarwal commented on MAPREDUCE-2708: --- bq. and the AM restarted with

[jira] [Commented] (MAPREDUCE-2708) [MR-279] Design and implement MR Application Master recovery

2011-10-21 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-2708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13132806#comment-13132806 ] Sharad Agarwal commented on MAPREDUCE-2708: --- bq. You write extremely

[jira] [Commented] (MAPREDUCE-2708) [MR-279] Design and implement MR Application Master recovery

2011-10-17 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-2708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13128689#comment-13128689 ] Sharad Agarwal commented on MAPREDUCE-2708: --- bq. All hadoop-mapreduce-client

[jira] [Commented] (MAPREDUCE-2708) [MR-279] Design and implement MR Application Master recovery

2011-10-14 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-2708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13127307#comment-13127307 ] Sharad Agarwal commented on MAPREDUCE-2708: --- Lot of conflict while merging.

[jira] [Commented] (MAPREDUCE-2702) [MR-279] OutputCommitter changes for MR Application Master recovery

2011-10-05 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-2702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13120708#comment-13120708 ] Sharad Agarwal commented on MAPREDUCE-2702: --- bq. uniting isRecoverySupported

[jira] [Commented] (MAPREDUCE-2702) [MR-279] OutputCommitter changes for MR Application Master recovery

2011-09-28 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-2702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13116228#comment-13116228 ] Sharad Agarwal commented on MAPREDUCE-2702: --- can be done as separate jira

[jira] [Commented] (MAPREDUCE-2693) NPE in AM causes it to lose containers which are never returned back to RM

2011-09-28 Thread Sharad Agarwal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAPREDUCE-2693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13117006#comment-13117006 ] Sharad Agarwal commented on MAPREDUCE-2693: --- Yes this bug is valid but only