[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2015-01-07 Thread JoshRosen
Github user JoshRosen closed the pull request at: https://github.com/apache/spark/pull/3687 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is e

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2015-01-07 Thread JoshRosen
Github user JoshRosen commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-69080052 I'm going to close this for now; I'll open separate, smaller PRs for the remaining test cleanup. --- If your project is set up for it, you can reply to this email and

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-25 Thread JoshRosen
Github user JoshRosen commented on a diff in the pull request: https://github.com/apache/spark/pull/3687#discussion_r22269847 --- Diff: streaming/src/test/scala/org/apache/spark/streaming/CheckpointSuite.scala --- @@ -281,34 +278,45 @@ class CheckpointSuite extends TestSuiteBase {

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-25 Thread JoshRosen
Github user JoshRosen commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-68094079 Just realized that my last comment was a bit confusing, since SPARK-1600 is not related to the FileInputStream ManualClock fix. I'll file a new improvement JIRA to cov

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-24 Thread JoshRosen
Github user JoshRosen commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-68091244 I agree that it's a good idea to split this up. For starters, I'm going to try splitting off only the fix for the FileInputDStream test (SPARK-1600), since that

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-24 Thread tdas
Github user tdas commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-68082434 At a high-level i think, let spilt this PR into two PRs 1. Fixing saveAsHadoopFiles 2. Removing thread loops This would allow the discussion of one to procee

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-24 Thread JoshRosen
Github user JoshRosen commented on a diff in the pull request: https://github.com/apache/spark/pull/3687#discussion_r22265904 --- Diff: streaming/src/test/scala/org/apache/spark/streaming/TestSuiteBase.scala --- @@ -104,6 +106,77 @@ class TestOutputStreamWithPartitions[T: ClassTag

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-24 Thread tdas
Github user tdas commented on a diff in the pull request: https://github.com/apache/spark/pull/3687#discussion_r22265867 --- Diff: streaming/src/test/scala/org/apache/spark/streaming/TestSuiteBase.scala --- @@ -104,6 +106,77 @@ class TestOutputStreamWithPartitions[T: ClassTag](par

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-24 Thread tdas
Github user tdas commented on a diff in the pull request: https://github.com/apache/spark/pull/3687#discussion_r22265799 --- Diff: streaming/src/test/scala/org/apache/spark/streaming/CheckpointSuite.scala --- @@ -281,34 +278,45 @@ class CheckpointSuite extends TestSuiteBase {

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-24 Thread tdas
Github user tdas commented on a diff in the pull request: https://github.com/apache/spark/pull/3687#discussion_r22265787 --- Diff: streaming/src/test/scala/org/apache/spark/streaming/CheckpointSuite.scala --- @@ -46,8 +46,6 @@ class CheckpointSuite extends TestSuiteBase {

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-24 Thread tdas
Github user tdas commented on a diff in the pull request: https://github.com/apache/spark/pull/3687#discussion_r22265770 --- Diff: streaming/src/test/scala/org/apache/spark/streaming/BasicOperationsSuite.scala --- @@ -639,7 +639,7 @@ class BasicOperationsSuite extends TestSuiteBas

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-24 Thread tdas
Github user tdas commented on a diff in the pull request: https://github.com/apache/spark/pull/3687#discussion_r22265754 --- Diff: streaming/src/main/scala/org/apache/spark/streaming/util/Clock.scala --- @@ -59,9 +59,11 @@ class SystemClock() extends Clock { private[streaming]

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-24 Thread tdas
Github user tdas commented on a diff in the pull request: https://github.com/apache/spark/pull/3687#discussion_r22265733 --- Diff: streaming/src/main/scala/org/apache/spark/streaming/dstream/FileInputDStream.scala --- @@ -74,12 +74,15 @@ class FileInputDStream[K: ClassTag, V: Clas

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-16 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-67273407 [Test build #24524 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24524/consoleFull) for PR 3687 at commit [`1304776`](https://gith

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-16 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-67273413 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-16 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-67272979 [Test build #24523 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24523/consoleFull) for PR 3687 at commit [`520bade`](https://gith

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-16 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-67272984 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-16 Thread JoshRosen
Github user JoshRosen commented on a diff in the pull request: https://github.com/apache/spark/pull/3687#discussion_r21949212 --- Diff: streaming/src/test/scala/org/apache/spark/streaming/CheckpointSuite.scala --- @@ -431,15 +454,15 @@ class CheckpointSuite extends TestSuiteBase {

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-16 Thread JoshRosen
Github user JoshRosen commented on a diff in the pull request: https://github.com/apache/spark/pull/3687#discussion_r21949136 --- Diff: streaming/src/test/scala/org/apache/spark/streaming/TestSuiteBase.scala --- @@ -286,22 +359,23 @@ trait TestSuiteBase extends FunSuite with Befor

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-16 Thread JoshRosen
Github user JoshRosen commented on a diff in the pull request: https://github.com/apache/spark/pull/3687#discussion_r21949039 --- Diff: streaming/src/test/scala/org/apache/spark/streaming/CheckpointSuite.scala --- @@ -46,8 +46,6 @@ class CheckpointSuite extends TestSuiteBase {

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-16 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-67268177 [Test build #24524 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24524/consoleFull) for PR 3687 at commit [`1304776`](https://githu

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-16 Thread JoshRosen
Github user JoshRosen commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-67268086 Alright, I think I fixed up a few problems in my first version of the SPARK-1600 fix, so hopefully it works now. For those who are interested in the details:

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-16 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-67267461 [Test build #24523 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24523/consoleFull) for PR 3687 at commit [`520bade`](https://githu

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-16 Thread JoshRosen
Github user JoshRosen commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-67265698 It turns out that my current use of `file.setLastModified` is brittle since the underlying filesystem may only support one-second resolution: ``` * Al

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-16 Thread JoshRosen
Github user JoshRosen commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-67239409 This latest run failed three tests: - org.apache.spark.streaming.CheckpointSuite.recovery with file input stream - org.apache.spark.streaming.FailureSuite

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-16 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-67137400 [Test build #24495 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24495/consoleFull) for PR 3687 at commit [`ee8c8f8`](https://gith

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-16 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-67137409 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-16 Thread JoshRosen
Github user JoshRosen commented on a diff in the pull request: https://github.com/apache/spark/pull/3687#discussion_r21884849 --- Diff: streaming/src/test/scala/org/apache/spark/streaming/InputStreamsSuite.scala --- @@ -238,14 +190,17 @@ class InputStreamsSuite extends TestSuiteBas

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-16 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-67130485 [Test build #24495 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24495/consoleFull) for PR 3687 at commit [`ee8c8f8`](https://githu

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-16 Thread JoshRosen
Github user JoshRosen commented on a diff in the pull request: https://github.com/apache/spark/pull/3687#discussion_r21884670 --- Diff: streaming/src/test/scala/org/apache/spark/streaming/CheckpointSuite.scala --- @@ -305,7 +305,7 @@ class CheckpointSuite extends TestSuiteBase {

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-16 Thread JoshRosen
Github user JoshRosen commented on a diff in the pull request: https://github.com/apache/spark/pull/3687#discussion_r21884615 --- Diff: streaming/src/main/scala/org/apache/spark/streaming/dstream/FileInputDStream.scala --- @@ -74,12 +74,14 @@ class FileInputDStream[K: ClassTag, V:

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-16 Thread JoshRosen
Github user JoshRosen commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-67130169 I pushed some more changes (this is still WIP). In addition to our use of Thread.sleep, I think I should also look out for cases where streaming directly calls System.

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-13 Thread JoshRosen
Github user JoshRosen commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-66892579 This latest round of test failures was due to the flaky `WriteAheadLogBackedBlockRDDSuite` tests ([SPARK-4826](https://issues.apache.org/jira/browse/SPARK-4826)). I'm

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-13 Thread tdas
Github user tdas commented on a diff in the pull request: https://github.com/apache/spark/pull/3687#discussion_r21787343 --- Diff: streaming/src/test/scala/org/apache/spark/streaming/TestSuiteBase.scala --- @@ -104,6 +106,49 @@ class TestOutputStreamWithPartitions[T: ClassTag](par

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-13 Thread tdas
Github user tdas commented on a diff in the pull request: https://github.com/apache/spark/pull/3687#discussion_r21787336 --- Diff: streaming/src/test/scala/org/apache/spark/streaming/TestSuiteBase.scala --- @@ -104,6 +106,49 @@ class TestOutputStreamWithPartitions[T: ClassTag](par

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-13 Thread tdas
Github user tdas commented on a diff in the pull request: https://github.com/apache/spark/pull/3687#discussion_r21787328 --- Diff: docs/configuration.md --- @@ -709,7 +709,9 @@ Apart from these, the following properties are also available, and may be useful If set to true,

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-13 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-66871584 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-13 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-66871580 [Test build #24435 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24435/consoleFull) for PR 3687 at commit [`12635b4`](https://gith

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-13 Thread tdas
Github user tdas commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-66871384 @JoshRosen This is a wonderful and much-required refactoring. I havent been able to see it in detail yet, will do when I reach home. --- If your project is set up for it,

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-13 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-66870761 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-13 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-66870757 [Test build #24434 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24434/consoleFull) for PR 3687 at commit [`3db335f`](https://gith

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-13 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-66870136 [Test build #24435 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24435/consoleFull) for PR 3687 at commit [`12635b4`](https://githu

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-13 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-66869474 [Test build #24434 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24434/consoleFull) for PR 3687 at commit [`3db335f`](https://githu

[GitHub] spark pull request: [WIP] Remove many uses of Thread.sleep() from ...

2014-12-12 Thread JoshRosen
Github user JoshRosen commented on the pull request: https://github.com/apache/spark/pull/3687#issuecomment-66865631 /cc @tdas. This is a work-in-progress towards removing most of the Thread.sleep() calls. I'm making slow-but-steady progress; would love your feedback + any suggestio