Re: Query about Speculative Execution

2014-12-05 Thread Srinivas Chamarthi
Hi, may I know where should I find the sources related to speculative scheduling happens ? and also how do we discard the output coming from already completed mapper ? I am actually trying to do something similar, like spawn map tasks redundantly but not for speculative reasons but for each mappe

Re: Query about Speculative Execution

2012-12-06 Thread Ajay Srivastava
Thanks Mahesh & Harsh. On 07-Dec-2012, at 7:42 AM, Mahesh Balija wrote: To simply, if you turn-off the speculative execution then the system will never bother about slow running jobs unless they won't report beyond specified time (10 minutes). If you have set speculative execution to true the

Re: Query about Speculative Execution

2012-12-06 Thread Mahesh Balija
To simply, if you turn-off the speculative execution then the system will never bother about slow running jobs unless they won't report beyond specified time (10 minutes). If you have set speculative execution to true then the system may spawn another instance of mapper and consider the output of t

Re: Query about Speculative Execution

2012-12-06 Thread Harsh J
Given that Speculative Execution *is* the answer to such scenarios, I'd say the answer to your question without it, is *nothing*. If a task does not report status for over 10 minutes (default), it is killed and retried. If it does report status changes (such as counters, task status, etc.) but is

Query about Speculative Execution

2012-12-06 Thread Ajay Srivastava
Hi, What is the behavior of jobTracker if speculative execution is off and a task on data node is running extremely slow? Will the jobTracker simply wait till the slow running task finishes or it will try to heal the situation? Assuming that heartbeat from the node running slow task are regular