GitHub user kayousterhout opened a pull request: https://github.com/apache/spark/pull/13646
[SPARK-15927] Eliminate redundant DAGScheduler code. ## What changes were proposed in this pull request? To try to eliminate redundant code to traverse the RDD dependency graph, this PR creates a new function getShuffleDependencies that returns shuffle dependencies that are immediate parents of a given RDD. This new function is used by getParentStages and getAncestorShuffleDependencies. cc @squito @markhamstra FYI @rxin You can merge this pull request into a Git repository by running: $ git pull https://github.com/kayousterhout/spark-1 SPARK-15927 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/13646.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #13646 ---- commit 5078bb2037201cbcd51223d2acf971a7a7fcc0cb Author: Kay Ousterhout <kayousterh...@gmail.com> Date: 2016-06-10T23:12:02Z [SPARK-15927] Eliminate redundant DAGScheduler code. commit 42a8d16ed0b7e8175a58d1d6fa21685cc36c85c2 Author: Kay Ousterhout <kayousterh...@gmail.com> Date: 2016-06-13T19:43:32Z Improved method comment ---- --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org