[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-22 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/spark/pull/9154


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-150049165
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/44093/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-150049161
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-150048853
  
**[Test build #44093 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/44093/consoleFull)**
 for PR 9154 at commit 
[`4394f00`](https://github.com/apache/spark/commit/4394f003f8950900eec1e27972083a2cf03f838b).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-150025058
  
**[Test build #44093 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/44093/consoleFull)**
 for PR 9154 at commit 
[`4394f00`](https://github.com/apache/spark/commit/4394f003f8950900eec1e27972083a2cf03f838b).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-150024356
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-150024378
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-21 Thread kayousterhout
Github user kayousterhout commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-150024132
  
Jenkins, retest this please


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-150009626
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/44079/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-150009623
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-150009550
  
**[Test build #44079 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/44079/consoleFull)**
 for PR 9154 at commit 
[`4394f00`](https://github.com/apache/spark/commit/4394f003f8950900eec1e27972083a2cf03f838b).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-149984317
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-149985382
  
**[Test build #44079 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/44079/consoleFull)**
 for PR 9154 at commit 
[`4394f00`](https://github.com/apache/spark/commit/4394f003f8950900eec1e27972083a2cf03f838b).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-149984278
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-20 Thread markhamstra
Github user markhamstra commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-149669055
  
This LGTM.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-20 Thread markhamstra
Github user markhamstra commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-149667908
  
> what's the point of keeping a pending list for a dead executor?

I haven't done enough code spelunking to know, but I'm wondering whether 
the pending list for a dead executor may become useful for "I'm not dead" 
executors.  [An extended Monty Python quote is very tempting, and surprisingly 
relevant, at this point.]  It's entirely possible for executors to miss 
heartbeats (usually because they have ingested some bad user code that is now 
consuming most of their CPU resources) and appear dead to the master, only to 
arise from the dead a short time later (and potentially promptly miss another 
heartbeat.)  If the before-you-were-dead pending task lists are or should be 
reattached to the Lazurus executors, then there potentially is a point to 
keeping them around.

On the other hand, the "Ah, thank you very much" approach of clubbing the 
"not dead" is often what ends up needing to be done manually for these 
executors that refuse to die, so trying to complete their resurrection and make 
them useful again may be a fool's errand regardless.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-20 Thread markhamstra
Github user markhamstra commented on a diff in the pull request:

https://github.com/apache/spark/pull/9154#discussion_r42534219
  
--- Diff: 
core/src/main/scala/org/apache/spark/scheduler/TaskSetManager.scala ---
@@ -783,18 +783,6 @@ private[spark] class TaskSetManager(
 
   /** Called by TaskScheduler when an executor is lost so we can 
re-enqueue our tasks */
   override def executorLost(execId: String, host: String, reason: 
ExecutorLossReason) {
-logInfo("Re-queueing tasks for " + execId + " from TaskSet " + 
taskSet.id)
-
-// Re-enqueue pending tasks for this host based on the status of the 
cluster. Note
-// that it's okay if we add a task to the same queue twice (if it had 
multiple preferred
-// locations), because dequeueTaskFromList will skip already-running 
tasks.
-for (index <- getPendingTasksForExecutor(execId)) {
-  addPendingTask(index, readding = true)
-}
-for (index <- getPendingTasksForHost(host)) {
-  addPendingTask(index, readding = true)
-}
--- End diff --

Ah, now I see that @CodingCat had the same suggestion.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-20 Thread markhamstra
Github user markhamstra commented on a diff in the pull request:

https://github.com/apache/spark/pull/9154#discussion_r42530729
  
--- Diff: 
core/src/main/scala/org/apache/spark/scheduler/TaskSetManager.scala ---
@@ -783,18 +783,6 @@ private[spark] class TaskSetManager(
 
   /** Called by TaskScheduler when an executor is lost so we can 
re-enqueue our tasks */
   override def executorLost(execId: String, host: String, reason: 
ExecutorLossReason) {
-logInfo("Re-queueing tasks for " + execId + " from TaskSet " + 
taskSet.id)
-
-// Re-enqueue pending tasks for this host based on the status of the 
cluster. Note
-// that it's okay if we add a task to the same queue twice (if it had 
multiple preferred
-// locations), because dequeueTaskFromList will skip already-running 
tasks.
-for (index <- getPendingTasksForExecutor(execId)) {
-  addPendingTask(index, readding = true)
-}
-for (index <- getPendingTasksForHost(host)) {
-  addPendingTask(index, readding = true)
-}
--- End diff --

After this diff, nothing is using the misspelled `readding` parameter 
anymore, so we may as well drop that from `addPendingTask`.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-19 Thread kayousterhout
Github user kayousterhout commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-149293450
  
Yeah you're right that there doesn't seem to be any point in keeping 
entries for dead executors.  I'm guessing the assumption is that all this state 
is cleaned up after the stage finishes anyway, so it's not a big deal to have 
some extra state hanging around.  But in any case, that's a separate bug.

@markhamstra if you have time to look at this, it would be helpful to have 
one more set of eyes!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-19 Thread vanzin
Github user vanzin commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-149292119
  
Ok, I see. I think I was under the impression that the code was actually 
removing tasks from old pending lists, but that doesn't seem to be the case. 
That also looks like a bug in itself - what's the point of keeping a pending 
list for a dead executor? I can see keeping a pending list for empty hosts 
because of dynamic allocation, but even that sounds fishy (the per-host pending 
list can be re-created if an executor is ever started again on that host).

So you're right, after reading the code again a few more times it does seem 
addPendingTask is redundant.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-19 Thread kayousterhout
Github user kayousterhout commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-149289335
  
To give a specific example, suppose task t1 has preferred locations on 
executor e1 (on host h1), e2 (also on host h1) and e3 (on host h2).

The data structures will look like:

pendingTasksForExecutor: {e1: t1, e2: t1, e3: t1}
pendingTasksForHost: {h1: t1, h2: t1}

We've agreed that the "addPendingTask" call is irrelevant for tasks that 
are currently running (because addPendingTask is called for any running tasks 
in handleFailedTask), so let's say t1 hasn't been run yet.

Now suppose executor e2 dies.  We never remove any entries from 
pendingTasksForExecutor or pendingTasksForHost (not in addPendingTask, nor 
anyplace else, as far as I can tell; we still won't schedule things on the died 
executor, because the TaskSetManager will never get a resource offer for it).  
addPendingTask will "readd" entries for each of t1's preferred locations 
(including the lost executor -- we don't check whether the executor is alive 
when updating the map entries).  However, all of these locations were already 
added above, so this call has no effect.

Which part of this reasoning do you think is incorrect?



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-19 Thread kayousterhout
Github user kayousterhout commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-149287156
  
What do you mean by "fixed"?  As I said in my earlier comment, I don't see 
how addPendingTasks fixes that, since addPendingTask doesn't remove anything 
from any mappings.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-19 Thread vanzin
Github user vanzin commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-149284732
  
> What do you mean about tasks being in the wrong list?

Let's say you have a task whose locality preference includes "executor 1". 
Now "executor 1" dies. That task would be in the `pendingTasksForExecutor` list 
for that executor, and now that's wrong and it should be fixed. My 
understanding is that `addPendingTask` is achieving that here.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-18 Thread kayousterhout
Github user kayousterhout commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-149106963
  
@vanzin my understanding is that addPendingTask looks through each 
preferred location for that task, and adds the task to the lists for (1) the 
list of executors corresponding to that location, (2) the list of hosts 
corresponding to that location and (3) the list of racks corresponding to that 
location.  For tasks that are not yet running, it seems like this calling this 
in executorLost should have no effect: the only difference from the previous 
time that addPendingTask was called (before the executor was lost) is that, 
before the executor was lost, the task would have been added to one additional 
list for the lost executor.

What do you mean about tasks being in the wrong list?  I see that there 
will be an entry (with a list of pending tasks) in pendingTaskForExecutor 
corresponding to an executor that is dead, but calling "addPendingTasks" never 
removes anything from any mappings, so that call doesn't fix that problem.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-16 Thread CodingCat
Github user CodingCat commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-148881593
  
the code block starts from 
https://github.com/apache/spark/pull/9154/files#diff-bad3987c83bd22d46416d3dd9d208e76R790
 seems fishy to me

are we updating 'copiesRunning' for twice?



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-16 Thread CodingCat
Github user CodingCat commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-148881300
  
errrare we distinguish running and pending in TaskInfo?I really 
need to update my knowledge about scheduler code

if no, the `handleFailedTask` is actually called within 
https://github.com/apache/spark/blob/8ac71d62d976bbfd0159cac6816dd8fa580ae1cb/core/src/main/scala/org/apache/spark/scheduler/TaskSetManager.scala#L816

what it does is we `re-add` all tasks whose executorId equals to the ID of 
this just failed executor


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-16 Thread vanzin
Github user vanzin commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-148881135
  
@CodingCat but `handleFailedTask` only calls `addPendingTask` for, as the 
name suggests, failed tasks. This code is re-scheduling *pending* tasks that 
haven't even begun execution yet.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-16 Thread CodingCat
Github user CodingCat commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-148880182
  
@vanzin I think the key point is at 


https://github.com/apache/spark/blob/8ac71d62d976bbfd0159cac6816dd8fa580ae1cb/core/src/main/scala/org/apache/spark/scheduler/TaskSetManager.scala#L821

and in handleFailedTask, we have called addPendingTask


https://github.com/apache/spark/blob/8ac71d62d976bbfd0159cac6816dd8fa580ae1cb/core/src/main/scala/org/apache/spark/scheduler/TaskSetManager.scala#L652

My suggestion is that maybe we shall remove `readding` parameter in 
addToList, as the removed lines in this patch are the only places where we are 
not using the default values...




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-16 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-148878074
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-16 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-148878028
  
  [Test build #43868 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43868/console)
 for   PR 9154 at commit 
[`26aa899`](https://github.com/apache/spark/commit/26aa8999a62eba632c670ba3dc57065f0d5ea4df).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-16 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-148878075
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43868/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-16 Thread vanzin
Github user vanzin commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-148870844
  
>they're still in all of the correct pending lists, so calling 
addPendingTask has no effect

That's not really accurate, is it? `TaskSetManager.executorLost` is called 
by `TaskSchedulerImpl.executorLost` after it has updated the list of available 
executors and which hosts they live on; so if there are pending tasks for the 
executor and / or the host, they're currently in the wrong list, and the 
`addPendingTask` calls fix that.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-16 Thread kayousterhout
Github user kayousterhout commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-148867654
  
@CodingCat @cmccabe you both also modified this code recently...does this 
change look reasonable to you?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-16 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-148866726
  
  [Test build #43868 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43868/consoleFull)
 for   PR 9154 at commit 
[`26aa899`](https://github.com/apache/spark/commit/26aa8999a62eba632c670ba3dc57065f0d5ea4df).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-16 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-148865777
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-16 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/9154#issuecomment-148865770
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-11163] Remove unnecessary addPendingTas...

2015-10-16 Thread kayousterhout
GitHub user kayousterhout opened a pull request:

https://github.com/apache/spark/pull/9154

[SPARK-11163] Remove unnecessary addPendingTask calls.

This commit removes unnecessary calls to addPendingTask in
TaskSetManager.executorLost. These calls are unnecessary: for
tasks that are still pending and haven't been launched, they're
still in all of the correct pending lists, so calling addPendingTask
has no effect. For tasks that are currently running (which may still be
in the pending lists, depending on how they were scheduled), we call
addPendingTask in handleFailedTask, so the calls at the beginning
of executorLost are redundant.

I think these calls are left over from when we re-computed the locality
levels in addPendingTask; now that we call recomputeLocality separately,
I don't think these are necessary.

@markhamstra can you take a look at this?

cc @vanzin 

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/kayousterhout/spark-1 SPARK-11163

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/9154.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #9154


commit 26aa8999a62eba632c670ba3dc57065f0d5ea4df
Author: Kay Ousterhout 
Date:   2015-10-16T23:32:00Z

[SPARK-11163] Remove unnecessary addPendingTask calls.

This commit removes unnecessary calls to addPendingTask in
TaskSetManager.executorLost. These calls are unnecessary: for
tasks that are still pending and haven't been launched, they're
still in all of the correct pending lists, so calling addPendingTask
has no effect. For tasks that are currently running (which may still be
in the pending lists, depending on how they were scheduled), we call
addPendingTask in handleFailedTask, so the calls at the beginning
of executorLost are redundant.

I think these calls are left over from when we re-computed the locality
levels in addPendingTask; now that we call recomputeLocality separately,
I don't think these are necessary.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org