[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-29 Thread rezasafi
Github user rezasafi commented on the issue: https://github.com/apache/spark/pull/19388 The new PR is here: https://github.com/apache/spark/pull/19848 --- - To unsubscribe, e-mail:

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-29 Thread rezasafi
Github user rezasafi commented on the issue: https://github.com/apache/spark/pull/19388 I am closing this PR in favor of another one with a simpler approach that I will open shortly. Thank you everyone for the comments, here. ---

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-27 Thread rezasafi
Github user rezasafi commented on the issue: https://github.com/apache/spark/pull/19388 @mridulm thank you very much for the comment. I was thinking about something similar, but more like sending both stageID and a monotonically increasing jobId to the commit method and then let try

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-27 Thread mridulm
Github user mridulm commented on the issue: https://github.com/apache/spark/pull/19388 Internally, spark can continue to use stageId for its commit protocol - which is independent of what we expose as jobId to mapred/mapreduce committer. As an implementation detail, we were making

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-27 Thread rezasafi
Github user rezasafi commented on the issue: https://github.com/apache/spark/pull/19388 Thank you very much, @mridulm for the comment. Yeah, I agree that the provided solution is complex. The complexity came form the fact that in this solution both executors and the driver use a

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-27 Thread mridulm
Github user mridulm commented on the issue: https://github.com/apache/spark/pull/19388 I agree with @vanzin - this looks very complicated for a enforcing a fairly simple constraint. It would be easier to depend on a AtomicInteger in driver for the id - and propagate that to

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19388 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/84033/ Test PASSed. ---

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19388 Merged build finished. Test PASSed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19388 **[Test build #84033 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/84033/testReport)** for PR 19388 at commit

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19388 **[Test build #84033 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/84033/testReport)** for PR 19388 at commit

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-20 Thread squito
Github user squito commented on the issue: https://github.com/apache/spark/pull/19388 Jenkins, retest this please --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19388 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/84006/ Test FAILed. ---

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19388 Merged build finished. Test FAILed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19388 **[Test build #84006 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/84006/testReport)** for PR 19388 at commit

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-19 Thread rezasafi
Github user rezasafi commented on the issue: https://github.com/apache/spark/pull/19388 Thank you very much, @dongjoon-hyun. I forgot to pushed the fix for streaming failure. @vanzin My understanding based on reading the code before this change is that the logic behind it avoids

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19388 **[Test build #84006 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/84006/testReport)** for PR 19388 at commit

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19388 Merged build finished. Test FAILed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19388 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/84002/ Test FAILed. ---

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19388 **[Test build #84002 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/84002/testReport)** for PR 19388 at commit

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19388 **[Test build #84002 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/84002/testReport)** for PR 19388 at commit

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-19 Thread dongjoon-hyun
Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/19388 Retest this please. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19388 Merged build finished. Test FAILed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19388 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/83976/ Test FAILed. ---

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-17 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19388 **[Test build #83976 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/83976/testReport)** for PR 19388 at commit

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-17 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/19388 High level question: do you need to create the commit id based on the rdd + stage ids, or could it be something else (like a monotonically increasing value, or a uuid)? That would probably simplify

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-11-17 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19388 **[Test build #83976 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/83976/testReport)** for PR 19388 at commit

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-10-30 Thread squito
Github user squito commented on the issue: https://github.com/apache/spark/pull/19388 also @mridulm I think you worked on the committer protocol before, both in spark and MR, so would be great to get your thoughts. ---

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-10-27 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/19388 I haven't seen any issues with speculation, most people aren't on 2.2 yet though either. Unfortunately won't have time to look at this today in detail, will try to early next week. ---

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-10-26 Thread squito
Github user squito commented on the issue: https://github.com/apache/spark/pull/19388 @rezasafi and I talked about this for a while offline. Its a lot trickier than this seems at first. Here's whats going on: 1. `SparkHadoopWriter` is pretending `rddId` is `stageId`, but

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-10-25 Thread rezasafi
Github user rezasafi commented on the issue: https://github.com/apache/spark/pull/19388 Sorry for the delay. It seems that to be able to commit the same rdd in different stages we need to use stageId. So the jobId and other configurations in the write method of SparkHadoopWriter

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-09-29 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19388 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82331/ Test FAILed. ---

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-09-29 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19388 Merged build finished. Test FAILed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-09-29 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19388 **[Test build #82331 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82331/testReport)** for PR 19388 at commit

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-09-29 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19388 Merged build finished. Test FAILed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-09-29 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19388 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82325/ Test FAILed. ---

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-09-29 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19388 **[Test build #82325 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82325/testReport)** for PR 19388 at commit

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-09-29 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19388 **[Test build #82331 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82331/testReport)** for PR 19388 at commit

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-09-29 Thread gatorsmile
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/19388 retest this please --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-09-29 Thread gatorsmile
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/19388 cc @jiangxb1987 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-09-29 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19388 **[Test build #82325 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82325/testReport)** for PR 19388 at commit

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-09-28 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19388 **[Test build #82304 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82304/testReport)** for PR 19388 at commit

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-09-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19388 Merged build finished. Test FAILed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-09-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19388 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82304/ Test FAILed. ---

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-09-28 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19388 **[Test build #82304 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82304/testReport)** for PR 19388 at commit

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-09-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19388 Merged build finished. Test FAILed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-09-28 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19388 **[Test build #82300 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82300/testReport)** for PR 19388 at commit

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-09-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19388 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82300/ Test FAILed. ---

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-09-28 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19388 **[Test build #82300 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82300/testReport)** for PR 19388 at commit

[GitHub] spark issue #19388: [SPARK-22162] Executors and the driver should use consis...

2017-09-28 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/19388 ok to test --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: