spark git commit: [SPARK-12760][DOCS] invalid lambda expression in python example for …

2016-01-23 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 dca238af7 -> e8ae242f9 [SPARK-12760][DOCS] invalid lambda expression in python example for … …local vs cluster srowen thanks for the PR at https://github.com/apache/spark/pull/10866! sorry it took me a while. This is rela

spark git commit: [SPARK-12760][DOCS] invalid lambda expression in python example for …

2016-01-23 Thread srowen
Repository: spark Updated Branches: refs/heads/master 358a33bbf -> 56f57f894 [SPARK-12760][DOCS] invalid lambda expression in python example for … …local vs cluster srowen thanks for the PR at https://github.com/apache/spark/pull/10866! sorry it took me a while. This is rela

spark git commit: [SPARK-11137][STREAMING] Make StreamingContext.stop() exception-safe

2016-01-23 Thread srowen
Repository: spark Updated Branches: refs/heads/master aca2a0165 -> 5f5698012 [SPARK-11137][STREAMING] Make StreamingContext.stop() exception-safe Make StreamingContext.stop() exception-safe Author: jayadevanmurali Closes #10807 from

spark git commit: [SPARK-12760][DOCS] inaccurate description for difference between local vs cluster mode in closure handling

2016-01-23 Thread srowen
may or may not work as intended in local mode Author: Sean Owen <so...@cloudera.com> Closes #10866 from srowen/SPARK-12760. (cherry picked from commit aca2a0165405b9eba27ac5e4739e36a618b96676) Signed-off-by: Sean Owen <so...@cloudera.com> Project: http://git-wip-us.apache.org/repos

spark git commit: [SPARK-12760][DOCS] inaccurate description for difference between local vs cluster mode in closure handling

2016-01-23 Thread srowen
may or may not work as intended in local mode Author: Sean Owen <so...@cloudera.com> Closes #10866 from srowen/SPARK-12760. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/aca2a016 Tree: http://git-wip-us.apache.org/repos/

spark git commit: [SPARK-12932][JAVA API] improved error message for java type inference failure

2016-01-25 Thread srowen
Repository: spark Updated Branches: refs/heads/master 3adebfc9a -> d8e480521 [SPARK-12932][JAVA API] improved error message for java type inference failure Author: Andy Grove Closes #10865 from andygrove/SPARK-12932. Project:

spark git commit: [SPARK-12755][CORE] Stop the event logger before the DAG scheduler

2016-01-25 Thread srowen
Repository: spark Updated Branches: refs/heads/master d8e480521 -> 4ee8191e5 [SPARK-12755][CORE] Stop the event logger before the DAG scheduler [SPARK-12755][CORE] Stop the event logger before the DAG scheduler to avoid a race condition where the standalone master attempts to build the app's

spark git commit: [SPARK-12932][JAVA API] improved error message for java type inference failure

2016-01-25 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 88614dd0f -> 88114d3d8 [SPARK-12932][JAVA API] improved error message for java type inference failure Author: Andy Grove Closes #10865 from andygrove/SPARK-12932. (cherry picked from commit

spark git commit: [SPARK-12755][CORE] Stop the event logger before the DAG scheduler

2016-01-25 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 88114d3d8 -> b40e58cf2 [SPARK-12755][CORE] Stop the event logger before the DAG scheduler [SPARK-12755][CORE] Stop the event logger before the DAG scheduler to avoid a race condition where the standalone master attempts to build the

spark git commit: [SPARK-12755][CORE] Stop the event logger before the DAG scheduler

2016-01-25 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.5 132718ad7 -> ae6fcc6bc [SPARK-12755][CORE] Stop the event logger before the DAG scheduler [SPARK-12755][CORE] Stop the event logger before the DAG scheduler to avoid a race condition where the standalone master attempts to build the

spark git commit: [SPARK-12983][CORE][DOC] Correct metrics.properties.template

2016-01-27 Thread srowen
Repository: spark Updated Branches: refs/heads/master 4db255c7a -> 90b0e5624 [SPARK-12983][CORE][DOC] Correct metrics.properties.template There are some typos or plain unintelligible sentences in the metrics template. Author: BenFradet Closes #10902 from

spark git commit: [SPARK-1680][DOCS] Explain environment variables for running on YARN in cluster mode

2016-01-27 Thread srowen
Repository: spark Updated Branches: refs/heads/master 90b0e5624 -> 093291cf9 [SPARK-1680][DOCS] Explain environment variables for running on YARN in cluster mode JIRA 1680 added a property called spark.yarn.appMasterEnv. This PR draws users' attention to this special case by adding an

spark git commit: [SPARK-12995][GRAPHX] Remove deprecate APIs from Pregel

2016-02-15 Thread srowen
Repository: spark Updated Branches: refs/heads/master a8bbc4f50 -> 56d49397e [SPARK-12995][GRAPHX] Remove deprecate APIs from Pregel Author: Takeshi YAMAMURO Closes #10918 from maropu/RemoveDeprecateInPregel. Project:

spark git commit: [SPARK-13312][MLLIB] Update java train-validation-split example in ml-guide

2016-02-15 Thread srowen
Repository: spark Updated Branches: refs/heads/master 56d49397e -> adb548365 [SPARK-13312][MLLIB] Update java train-validation-split example in ml-guide Response to JIRA https://issues.apache.org/jira/browse/SPARK-13312. This contribution is my original work and I license the work to this

spark git commit: [SPARK-13312][MLLIB] Update java train-validation-split example in ml-guide

2016-02-15 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 ec40c5a59 -> 71f53edc0 [SPARK-13312][MLLIB] Update java train-validation-split example in ml-guide Response to JIRA https://issues.apache.org/jira/browse/SPARK-13312. This contribution is my original work and I license the work to

spark git commit: Correct SparseVector.parse documentation

2016-02-16 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 71f53edc0 -> d95089190 Correct SparseVector.parse documentation There's a small typo in the SparseVector.parse docstring (which says that it returns a DenseVector rather than a SparseVector), which seems to be incorrect. Author:

spark git commit: Correct SparseVector.parse documentation

2016-02-16 Thread srowen
Repository: spark Updated Branches: refs/heads/master fee739f07 -> 827ed1c06 Correct SparseVector.parse documentation There's a small typo in the SparseVector.parse docstring (which says that it returns a DenseVector rather than a SparseVector), which seems to be incorrect. Author: Miles

spark git commit: [SPARK-12247][ML][DOC] Documentation for spark.ml's ALS and collaborative filtering in general

2016-02-16 Thread srowen
Repository: spark Updated Branches: refs/heads/master 827ed1c06 -> 00c72d27b [SPARK-12247][ML][DOC] Documentation for spark.ml's ALS and collaborative filtering in general This documents the implementation of ALS in `spark.ml` with example code in scala, java and python. Author: BenFradet

spark git commit: [SPARK-13339][DOCS] Clarify commutative / associative operator requirements for reduce, fold

2016-02-19 Thread srowen
ull/11091 Author: Sean Owen <so...@cloudera.com> Closes #11217 from srowen/SPARK-13339. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/fb7e2179 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/fb7e2179 Diff: h

spark git commit: [SPARK-13302][PYSPARK][TESTS] Move the temp file creation and cleanup outside of the doctests

2016-02-20 Thread srowen
Repository: spark Updated Branches: refs/heads/master dfb2ae2f1 -> 9ca79c1ec [SPARK-13302][PYSPARK][TESTS] Move the temp file creation and cleanup outside of the doctests Some of the new doctests in ml/clustering.py have a lot of setup code, move the setup code to the general test init to

spark git commit: [SPARK-13248][STREAMING] Remove deprecated Streaming APIs.

2016-02-21 Thread srowen
Repository: spark Updated Branches: refs/heads/master d9efe63ec -> 1a340da8d [SPARK-13248][STREAMING] Remove deprecated Streaming APIs. Remove deprecated Streaming APIs and adjust sample applications. Author: Luciano Resende Closes #11139 from

spark git commit: [SPARK-13300][DOCUMENTATION] Added pygments.rb dependancy

2016-02-14 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 107290c94 -> ec40c5a59 [SPARK-13300][DOCUMENTATION] Added pygments.rb dependancy Looks like pygments.rb gem is also required for jekyll build to work. At least on Ubuntu/RHEL I could not do build without this dependency. So added this

spark git commit: [SPARK-13300][DOCUMENTATION] Added pygments.rb dependancy

2016-02-14 Thread srowen
Repository: spark Updated Branches: refs/heads/master 354d4c24b -> 331293c30 [SPARK-13300][DOCUMENTATION] Added pygments.rb dependancy Looks like pygments.rb gem is also required for jekyll build to work. At least on Ubuntu/RHEL I could not do build without this dependency. So added this to

spark git commit: [SPARK-13278][CORE] Launcher fails to start with JDK 9 EA

2016-02-14 Thread srowen
Repository: spark Updated Branches: refs/heads/master 331293c30 -> 22e9723d6 [SPARK-13278][CORE] Launcher fails to start with JDK 9 EA See http://openjdk.java.net/jeps/223 for more information about the JDK 9 version string scheme. Author: Claes Redestad Closes

svn commit: r1730318 - /spark/README.md

2016-02-14 Thread srowen
Author: srowen Date: Sun Feb 14 11:46:05 2016 New Revision: 1730318 URL: http://svn.apache.org/viewvc?rev=1730318=rev Log: Add note about installing Pygments with jekyll for Spark site (SPARK-13300) Modified: spark/README.md Modified: spark/README.md URL: http://svn.apache.org/viewvc/spark

spark git commit: [MINOR][DOCS] Fix all typos in markdown files of `doc` and similar patterns in other comments

2016-02-22 Thread srowen
Repository: spark Updated Branches: refs/heads/master 1b144455b -> 024482bf5 [MINOR][DOCS] Fix all typos in markdown files of `doc` and similar patterns in other comments ## What changes were proposed in this pull request? This PR tries to fix all typos in all markdown files under `docs`

spark git commit: [SPARK-13186][STREAMING] migrate away from SynchronizedMap

2016-02-22 Thread srowen
Repository: spark Updated Branches: refs/heads/master 39ff15457 -> 8f35d3eac [SPARK-13186][STREAMING] migrate away from SynchronizedMap trait SynchronizedMap in package mutable is deprecated: Synchronization via traits is deprecated as it is inherently unreliable. Change to

spark git commit: [SPARK-13399][STREAMING] Fix checkpointsuite type erasure warnings

2016-02-22 Thread srowen
Repository: spark Updated Branches: refs/heads/master ef1047fca -> 1b144455b [SPARK-13399][STREAMING] Fix checkpointsuite type erasure warnings ## What changes were proposed in this pull request? Change the checkpointsuite getting the outputstreams to explicitly be unchecked on the generic

spark git commit: [SPARK-12153][SPARK-7617][MLLIB] add support of arbitrary length sentence and other tuning for Word2Vec

2016-02-22 Thread srowen
Repository: spark Updated Branches: refs/heads/master 8f35d3eac -> ef1047fca [SPARK-12153][SPARK-7617][MLLIB] add support of arbitrary length sentence and other tuning for Word2Vec add support of arbitrary length sentence by using the nature representation of sentences in the input. add

[2/8] spark git commit: [SPARK-13595][BUILD] Move docker, extras modules into external

2016-03-09 Thread srowen
http://git-wip-us.apache.org/repos/asf/spark/blob/256704c7/extras/kinesis-asl/src/main/scala/org/apache/spark/streaming/kinesis/KinesisUtils.scala -- diff --git

[4/8] spark git commit: [SPARK-13595][BUILD] Move docker, extras modules into external

2016-03-09 Thread srowen
http://git-wip-us.apache.org/repos/asf/spark/blob/256704c7/extras/README.md -- diff --git a/extras/README.md b/extras/README.md deleted file mode 100644 index 1b4174b..000 --- a/extras/README.md +++ /dev/null @@ -1 +0,0 @@

[7/8] spark git commit: [SPARK-13595][BUILD] Move docker, extras modules into external

2016-03-09 Thread srowen
http://git-wip-us.apache.org/repos/asf/spark/blob/256704c7/external/docker-integration-tests/src/test/scala/org/apache/spark/util/DockerUtils.scala -- diff --git

[6/8] spark git commit: [SPARK-13595][BUILD] Move docker, extras modules into external

2016-03-09 Thread srowen
http://git-wip-us.apache.org/repos/asf/spark/blob/256704c7/external/kinesis-asl/src/main/java/org/apache/spark/examples/streaming/JavaKinesisWordCountASL.java -- diff --git

[1/8] spark git commit: [SPARK-13595][BUILD] Move docker, extras modules into external

2016-03-09 Thread srowen
Repository: spark Updated Branches: refs/heads/master 7791d0c3a -> 256704c77 http://git-wip-us.apache.org/repos/asf/spark/blob/256704c7/pom.xml -- diff --git a/pom.xml b/pom.xml index 90f4672..a3af20c 100644 --- a/pom.xml +++

[3/8] spark git commit: [SPARK-13595][BUILD] Move docker, extras modules into external

2016-03-09 Thread srowen
http://git-wip-us.apache.org/repos/asf/spark/blob/256704c7/extras/kinesis-asl/src/main/scala/org/apache/spark/examples/streaming/KinesisWordCountASL.scala -- diff --git

[8/8] spark git commit: [SPARK-13595][BUILD] Move docker, extras modules into external

2016-03-09 Thread srowen
;so...@cloudera.com> Closes #11523 from srowen/SPARK-13595. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/256704c7 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/256704c7 Diff: http://git-wip-us.apache.org/repos/asf

svn commit: r1734288 - in /spark: documentation.md site/documentation.html

2016-03-09 Thread srowen
Author: srowen Date: Wed Mar 9 17:06:53 2016 New Revision: 1734288 URL: http://svn.apache.org/viewvc?rev=1734288=rev Log: Add Big Data Analytics book to list of Spark resources Modified: spark/documentation.md spark/site/documentation.html Modified: spark/documentation.md URL: http

[5/8] spark git commit: [SPARK-13595][BUILD] Move docker, extras modules into external

2016-03-09 Thread srowen
http://git-wip-us.apache.org/repos/asf/spark/blob/256704c7/external/kinesis-asl/src/main/scala/org/apache/spark/streaming/kinesis/KinesisUtils.scala -- diff --git

spark git commit: [SPARK-13663][CORE] Upgrade Snappy Java to 1.1.2.1

2016-03-10 Thread srowen
tps://github.com/apache/spark/pull/11524 ## How was this patch tested? Jenkins tests. Author: Sean Owen <so...@cloudera.com> Closes #11631 from srowen/SPARK-13663. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/927e22ef T

spark git commit: [SPARK-13758][STREAMING][CORE] enhance exception message to avoid misleading

2016-03-10 Thread srowen
Repository: spark Updated Branches: refs/heads/master 927e22eff -> 74267beb3 [SPARK-13758][STREAMING][CORE] enhance exception message to avoid misleading We have a recoverable Spark streaming job with checkpoint enabled, it could be executed correctly at first time, but throw following

spark git commit: [SPARK-13663][CORE] Upgrade Snappy Java to 1.1.2.1

2016-03-10 Thread srowen
sts. Author: Sean Owen <so...@cloudera.com> Closes #11631 from srowen/SPARK-13663. (cherry picked from commit 927e22eff894cb9b4fe8e5de3578517997292a8a) Signed-off-by: Sean Owen <so...@cloudera.com> Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.a

spark git commit: [SPARK-12653][SQL] Re-enable test "SPARK-8489: MissingRequirementError during reflection"

2016-03-16 Thread srowen
Repository: spark Updated Branches: refs/heads/master 92024797a -> 431a3d04b [SPARK-12653][SQL] Re-enable test "SPARK-8489: MissingRequirementError during reflection" ## What changes were proposed in this pull request? The purpose of

[1/2] spark git commit: [MINOR][DOCS] Fix more typos in comments/strings.

2016-03-14 Thread srowen
Repository: spark Updated Branches: refs/heads/master e58fa19d1 -> acdf21970 http://git-wip-us.apache.org/repos/asf/spark/blob/acdf2197/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/PartitioningUtils.scala

spark git commit: [SPARK-13746][TESTS] stop using deprecated SynchronizedSet

2016-03-14 Thread srowen
Repository: spark Updated Branches: refs/heads/master acdf21970 -> 31d069d4c [SPARK-13746][TESTS] stop using deprecated SynchronizedSet trait SynchronizedSet in package mutable is deprecated Author: Wilson Wu Closes #11580 from

spark git commit: [SPARK-13803] restore the changes in SPARK-3411

2016-03-15 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.5 7a24d94a8 -> 58fbdf6f7 [SPARK-13803] restore the changes in SPARK-3411 ## What changes were proposed in this pull request? This patch contains the functionality to balance the load of the cluster-mode drivers among workers This

spark git commit: [SPARK-13803] restore the changes in SPARK-3411

2016-03-15 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.4 54ff1f9e1 -> dae346172 [SPARK-13803] restore the changes in SPARK-3411 ## What changes were proposed in this pull request? This patch contains the functionality to balance the load of the cluster-mode drivers among workers This

spark git commit: [SPARK-13803] restore the changes in SPARK-3411

2016-03-15 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 6935b5080 -> 765307f41 [SPARK-13803] restore the changes in SPARK-3411 ## What changes were proposed in this pull request? This patch contains the functionality to balance the load of the cluster-mode drivers among workers This

spark git commit: [SPARK-13803] restore the changes in SPARK-3411

2016-03-15 Thread srowen
Repository: spark Updated Branches: refs/heads/master dafd70fbf -> bd5365bbe [SPARK-13803] restore the changes in SPARK-3411 ## What changes were proposed in this pull request? This patch contains the functionality to balance the load of the cluster-mode drivers among workers This patch

spark git commit: [SPARK-13769][CORE] Update Java Doc in Spark Submit

2016-03-09 Thread srowen
Repository: spark Updated Branches: refs/heads/master c3689bc24 -> 8e8633e0b [SPARK-13769][CORE] Update Java Doc in Spark Submit JIRA : https://issues.apache.org/jira/browse/SPARK-13769 The java doc here

spark git commit: [SPARK-13692][CORE][SQL] Fix trivial Coverity/Checkstyle defects

2016-03-09 Thread srowen
Repository: spark Updated Branches: refs/heads/master 035d3acdf -> f3201aeeb [SPARK-13692][CORE][SQL] Fix trivial Coverity/Checkstyle defects ## What changes were proposed in this pull request? This issue fixes the following potential bugs and Java coding style detected by Coverity and

spark git commit: [SPARK-13631][CORE] Thread-safe getLocationsWithLargestOutputs

2016-03-09 Thread srowen
ion with 10 threads, shows that this fixes it. I have not found a minimal test case to add to Spark, but I will attempt to do so if desired. The same test case was tripping up on SPARK-4454, which was fixed by making other DAGScheduler code thread-safe. shivaram srowen Author: Andy Sloane &l

spark git commit: [SPARK-13640][SQL] Synchronize ScalaReflection.mirror method.

2016-03-09 Thread srowen
Repository: spark Updated Branches: refs/heads/master f3201aeeb -> 2c5af7d4d [SPARK-13640][SQL] Synchronize ScalaReflection.mirror method. ## What changes were proposed in this pull request? `ScalaReflection.mirror` method should be synchronized when scala version is `2.10` because

[1/2] spark git commit: [SPARK-13702][CORE][SQL][MLLIB] Use diamond operator for generic instance creation in Java code.

2016-03-09 Thread srowen
Repository: spark Updated Branches: refs/heads/master cbff2803e -> c3689bc24 http://git-wip-us.apache.org/repos/asf/spark/blob/c3689bc2/sql/hive/src/test/java/org/apache/spark/sql/hive/test/Complex.java -- diff --git

[2/2] spark git commit: [SPARK-13702][CORE][SQL][MLLIB] Use diamond operator for generic instance creation in Java code.

2016-03-09 Thread srowen
[SPARK-13702][CORE][SQL][MLLIB] Use diamond operator for generic instance creation in Java code. ## What changes were proposed in this pull request? In order to make `docs/examples` (and other related code) more simple/readable/user-friendly, this PR replaces existing codes like the

spark git commit: [SPARK-13281][CORE] Switch broadcast of RDD to exception from warning

2016-03-19 Thread srowen
Repository: spark Updated Branches: refs/heads/master 9412547e7 -> 5f6bdf97c [SPARK-13281][CORE] Switch broadcast of RDD to exception from warning ## What changes were proposed in this pull request? In SparkContext, throw Illegalargumentexception when trying to broadcast rdd directly,

spark git commit: [MINOR][ML] When trainingSummary is None, it should throw RuntimeException.

2016-03-19 Thread srowen
Repository: spark Updated Branches: refs/heads/master bb1fda01f -> 7783b6f38 [MINOR][ML] When trainingSummary is None, it should throw RuntimeException. ## What changes were proposed in this pull request? When trainingSummary is None, it should throw ```RuntimeException```. cc mengxr ## How

spark git commit: [SPARK-13823][SPARK-13397][SPARK-13395][CORE] More warnings, StandardCharset follow up

2016-03-16 Thread srowen
5 more build warnings ## How was this patch tested? Jenkins tests Author: Sean Owen <so...@cloudera.com> Closes #11725 from srowen/SPARK-13823.2. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/3b461d9e Tree: http://gi

spark git commit: [SPARK-13793][CORE] PipedRDD doesn't propagate exceptions while reading parent RDD

2016-03-16 Thread srowen
Repository: spark Updated Branches: refs/heads/master 56d88247f -> 1d95fb678 [SPARK-13793][CORE] PipedRDD doesn't propagate exceptions while reading parent RDD ## What changes were proposed in this pull request? PipedRDD creates a child thread to read output of the parent stage and feed it

spark git commit: [SPARK-13396] Stop using our internal deprecated .metrics on Exceptio…

2016-03-16 Thread srowen
Repository: spark Updated Branches: refs/heads/master 3b461d9ec -> 56d88247f [SPARK-13396] Stop using our internal deprecated .metrics on Exceptio… JIRA: https://issues.apache.org/jira/browse/SPARK-13396 Stop using our internal deprecated .metrics on ExceptionFailure instead use

spark git commit: [SPARK-13906] Ensure that there are at least 2 dispatcher threads.

2016-03-16 Thread srowen
Repository: spark Updated Branches: refs/heads/master 431a3d04b -> 05ab2948a [SPARK-13906] Ensure that there are at least 2 dispatcher threads. ## What changes were proposed in this pull request? Force at least two dispatcher-event-loop threads. Since SparkDeploySchedulerBackend (in

spark git commit: [SPARK-13889][YARN] Fix integer overflow when calculating the max number of executor failure

2016-03-16 Thread srowen
Repository: spark Updated Branches: refs/heads/master 1d95fb678 -> 496d2a2b4 [SPARK-13889][YARN] Fix integer overflow when calculating the max number of executor failure ## What changes were proposed in this pull request? The max number of executor failure before failing the application is

spark git commit: [SPARK-13810][CORE] Add Port Configuration Suggestions on Bind Exceptions

2016-03-13 Thread srowen
Repository: spark Updated Branches: refs/heads/master db88d0204 -> 515e4afbc [SPARK-13810][CORE] Add Port Configuration Suggestions on Bind Exceptions ## What changes were proposed in this pull request? Currently, when a java.net.BindException is thrown, it displays the following message:

spark git commit: [SPARK-14154][MLLIB] Simplify the implementation for Kolmogorov–Smirnov test

2016-03-29 Thread srowen
Repository: spark Updated Branches: refs/heads/master a632bb56f -> d2a819a63 [SPARK-14154][MLLIB] Simplify the implementation for Kolmogorov–Smirnov test ## What changes were proposed in this pull request? jira: https://issues.apache.org/jira/browse/SPARK-14154 I just read the code for

spark git commit: Better host description for multi-master mesos

2016-04-07 Thread srowen
Repository: spark Updated Branches: refs/heads/master 9ca0760d6 -> db75ccb55 Better host description for multi-master mesos ## What changes were proposed in this pull request? Since not having the correct zk url causes job failure, the documentation should include all parameters ## How was

spark git commit: [SPARK-14470] Allow for overriding both httpclient and httpcore versions

2016-04-08 Thread srowen
Repository: spark Updated Branches: refs/heads/master 644709801 -> 583b5e053 [SPARK-14470] Allow for overriding both httpclient and httpcore versions ## What changes were proposed in this pull request? This splits commons.httpclient.version from commons.httpcore.version, since these two

spark git commit: [SPARK-14402][HOTFIX] Fix ExpressionDescription annotation

2016-04-08 Thread srowen
Repository: spark Updated Branches: refs/heads/master 73b56a3c6 -> 644709801 [SPARK-14402][HOTFIX] Fix ExpressionDescription annotation ## What changes were proposed in this pull request? Fix for the error introduced in

spark git commit: [MINOR][ML] Fixed MLlib build warnings

2016-04-11 Thread srowen
Repository: spark Updated Branches: refs/heads/master 26d7af911 -> e9e1adc03 [MINOR][ML] Fixed MLlib build warnings ## What changes were proposed in this pull request? Fixes to eliminate warnings during package and doc builds. ## How was this patch tested? Existing unit tests Author:

spark git commit: [SPARK-14301][EXAMPLES] Java examples code merge and clean up.

2016-04-09 Thread srowen
Repository: spark Updated Branches: refs/heads/master 00288ea2a -> 72e66bb27 [SPARK-14301][EXAMPLES] Java examples code merge and clean up. ## What changes were proposed in this pull request? This fix tries to remove duplicate Java code in examples/mllib and examples/ml. The following

spark git commit: Added omitted word in error message

2016-04-06 Thread srowen
Repository: spark Updated Branches: refs/heads/master 25a4c8e0c -> 24015199f Added omitted word in error message ## What changes were proposed in this pull request? Added an omitted word in the error message displayed by the Graphx Pregel API when `maxIterations <= 0` ## How was this patch

spark git commit: [SPARK-14430][BUILD] use https while downloading binaries from build/mvn

2016-04-06 Thread srowen
Repository: spark Updated Branches: refs/heads/master 24015199f -> 5e64dab86 [SPARK-14430][BUILD] use https while downloading binaries from build/mvn ## What changes were proposed in this pull request? `./build/mvn` file was downloading binaries in non HTTPS mode. This PR tends to fix it.

spark git commit: [MINOR][DOCS] Use `spark-submit` instead of `sparkR` to submit R script.

2016-03-19 Thread srowen
Repository: spark Updated Branches: refs/heads/master 1970d911d -> 2082a4956 [MINOR][DOCS] Use `spark-submit` instead of `sparkR` to submit R script. ## What changes were proposed in this pull request? Since `sparkR` is not used for submitting R Scripts from Spark 2.0, a user faces the

spark git commit: [SPARK-13776][WEBUI] Limit the max number of acceptors and selectors for Jetty

2016-03-19 Thread srowen
Repository: spark Updated Branches: refs/heads/master 1974d1d34 -> 65b75e66e [SPARK-13776][WEBUI] Limit the max number of acceptors and selectors for Jetty ## What changes were proposed in this pull request? As each acceptor/selector in Jetty will use one thread, the number of threads

spark git commit: [MINOR][DOC] Fix nits in JavaStreamingTestExample

2016-03-19 Thread srowen
Repository: spark Updated Branches: refs/heads/master 0f1015ffd -> 53f32a22d [MINOR][DOC] Fix nits in JavaStreamingTestExample ## What changes were proposed in this pull request? Fix some nits discussed in https://github.com/apache/spark/pull/11776#issuecomment-198207419 use !rdd.isEmpty

spark git commit: [SPARK-13901][CORE] correct the logDebug information when jump to the next locality level

2016-03-19 Thread srowen
Repository: spark Updated Branches: refs/heads/master 357d82d84 -> ea9ca6f04 [SPARK-13901][CORE] correct the logDebug information when jump to the next locality level JIRA Issue:https://issues.apache.org/jira/browse/SPARK-13901 In getAllowedLocalityLevel method of TaskSetManager,we get wrong

spark git commit: [SPARK-13823][HOTFIX] Increase tryAcquire timeout and assert it succeeds to fix failure on slow machines

2016-03-19 Thread srowen
out is still here to prevent the test from hanging forever, and to detect really slow response.) The timeout is increased to a second to allow plenty of time anyway. ## How was this patch tested? Jenkins tests Author: Sean Owen <so...@cloudera.com> Closes #11763 from srowen/SPARK-13823.

[1/3] spark git commit: [SPARK-14011][CORE][SQL] Enable `LineLength` Java checkstyle rule

2016-03-21 Thread srowen
Repository: spark Updated Branches: refs/heads/master e47408814 -> 20fd25410 http://git-wip-us.apache.org/repos/asf/spark/blob/20fd2541/sql/core/src/main/java/org/apache/spark/sql/execution/vectorized/ColumnarBatch.java --

spark git commit: [MINOR][DOCS] Add proper periods and spaces for CLI help messages and `config` doc.

2016-03-21 Thread srowen
Repository: spark Updated Branches: refs/heads/master 20fd25410 -> 761c2d1b6 [MINOR][DOCS] Add proper periods and spaces for CLI help messages and `config` doc. ## What changes were proposed in this pull request? This PR adds some proper periods and spaces to Spark CLI help messages and

spark git commit: [SPARK-14028][STREAMING][KINESIS][TESTS] Remove deprecated methods; fix two other warnings

2016-03-21 Thread srowen
Repository: spark Updated Branches: refs/heads/master 761c2d1b6 -> c35c60fa9 [SPARK-14028][STREAMING][KINESIS][TESTS] Remove deprecated methods; fix two other warnings ## What changes were proposed in this pull request? - Removed two methods that has been deprecated since 1.4 - Fixed two

spark git commit: [SPARK-14035][MLLIB] Make error message more verbose for mllib NaiveBayesSuite

2016-03-23 Thread srowen
Repository: spark Updated Branches: refs/heads/master abacf5f25 -> 4d955cd69 [SPARK-14035][MLLIB] Make error message more verbose for mllib NaiveBayesSuite ## What changes were proposed in this pull request? Print more info about failed NaiveBayesSuite tests which have exhibited flakiness.

spark git commit: [SPARK-2208] Fix for local metrics tests can fail on fast machines

2016-03-24 Thread srowen
Repository: spark Updated Branches: refs/heads/master dd9ca7b96 -> 5519760e0 [SPARK-2208] Fix for local metrics tests can fail on fast machines ## What changes were proposed in this pull request? A fix for local metrics tests that can fail on fast machines. This is probably what is suggested

spark git commit: [SPARK-13019][DOCS] fix for scala-2.10 build: Replace example code in mllib-statistics.md using include_example

2016-03-24 Thread srowen
Repository: spark Updated Branches: refs/heads/master 048a7594e -> dd9ca7b96 [SPARK-13019][DOCS] fix for scala-2.10 build: Replace example code in mllib-statistics.md using include_example ## What changes were proposed in this pull request? This PR for ticket SPARK-13019 is based on

spark git commit: [SPARK-13887][PYTHON][TRIVIAL][BUILD] Make lint-python script fail fast

2016-03-25 Thread srowen
Repository: spark Updated Branches: refs/heads/master e9b6e7d85 -> 55a605763 [SPARK-13887][PYTHON][TRIVIAL][BUILD] Make lint-python script fail fast ## What changes were proposed in this pull request? Change lint python script to stop on first error rather than building them up so its

spark git commit: Revert "[SPARK-2208] Fix for local metrics tests can fail on fast machines". The test appears to still be flaky after this change, or more flaky.

2016-03-24 Thread srowen
Repository: spark Updated Branches: refs/heads/master 5519760e0 -> 342079dc4 Revert "[SPARK-2208] Fix for local metrics tests can fail on fast machines". The test appears to still be flaky after this change, or more flaky. This reverts commit 5519760e0fe7d52170b38a52ce3d670d158e2aba.

spark git commit: [SPARK-13986][CORE][MLLIB] Remove `DeveloperApi`-annotations for non-publics

2016-03-21 Thread srowen
Repository: spark Updated Branches: refs/heads/master 17a3f0067 -> df61fbd97 [SPARK-13986][CORE][MLLIB] Remove `DeveloperApi`-annotations for non-publics ## What changes were proposed in this pull request? Spark uses `DeveloperApi` annotation, but sometimes it seems to conflict with

spark git commit: [SPARK-13901][CORE] correct the logDebug information when jump to the next locality level

2016-03-20 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 765307f41 -> 1fcd17f84 [SPARK-13901][CORE] correct the logDebug information when jump to the next locality level JIRA Issue:https://issues.apache.org/jira/browse/SPARK-13901 In getAllowedLocalityLevel method of TaskSetManager,we get

spark git commit: [MINOR] Fix newly added java-lint errors

2016-03-26 Thread srowen
Repository: spark Updated Branches: refs/heads/master d23ad7c1c -> 180846585 [MINOR] Fix newly added java-lint errors ## What changes were proposed in this pull request? This PR fixes some newly added java-lint errors(unused-imports, line-lengsth). ## How was this patch tested? Pass the

spark git commit: [SPARK-14089][CORE][MLLIB] Remove methods that has been deprecated since 1.1, 1.2, 1.3, 1.4, and 1.5

2016-03-26 Thread srowen
Repository: spark Updated Branches: refs/heads/master 180846585 -> 62a85eb09 [SPARK-14089][CORE][MLLIB] Remove methods that has been deprecated since 1.1, 1.2, 1.3, 1.4, and 1.5 ## What changes were proposed in this pull request? Removed methods that has been deprecated since 1.1, 1.2, 1.3,

spark git commit: [SPARK-13973][PYSPARK] ipython notebook` is going away

2016-03-26 Thread srowen
Repository: spark Updated Branches: refs/heads/master 62a85eb09 -> a91784fb6 [SPARK-13973][PYSPARK] ipython notebook` is going away ## What changes were proposed in this pull request? https://issues.apache.org/jira/browse/SPARK-13973 ## How was this patch tested? Pyspark Author: Rekha Joshi

spark git commit: [MINOR][SQL] Fix substr/substring testcases.

2016-03-27 Thread srowen
Repository: spark Updated Branches: refs/heads/master a01b6a92b -> cfcca732b [MINOR][SQL] Fix substr/substring testcases. ## What changes were proposed in this pull request? This PR fixes the following two testcases in order to test the correct usages. ``` checkSqlGeneration("SELECT

spark git commit: [SPARK-14030][MLLIB] Add parameter check to MLLIB

2016-03-24 Thread srowen
Repository: spark Updated Branches: refs/heads/master 1803bf633 -> 048a7594e [SPARK-14030][MLLIB] Add parameter check to MLLIB ## What changes were proposed in this pull request? add parameter verification to MLLIB, like numCorrections > 0 tolerance >= 0 iters > 0 regParam >= 0 ## How was

spark git commit: Fix typo in ALS.scala

2016-03-24 Thread srowen
Repository: spark Updated Branches: refs/heads/master 01849da08 -> 1803bf633 Fix typo in ALS.scala ## What changes were proposed in this pull request? Just a typo ## How was this patch tested? N/A Author: Juarez Bochi Closes #11896 from jbochi/patch-1. Project:

spark git commit: [SPARK-13705][DOCS] UpdateStateByKey Operation documentation incorrectly refers to StatefulNetworkWordCount

2016-03-07 Thread srowen
Repository: spark Updated Branches: refs/heads/master bc7a3ec29 -> 4b13896eb [SPARK-13705][DOCS] UpdateStateByKey Operation documentation incorrectly refers to StatefulNetworkWordCount ## What changes were proposed in this pull request? The reference to StatefulNetworkWordCount.scala from

spark git commit: Fixing the type of the sentiment happiness value

2016-03-07 Thread srowen
Repository: spark Updated Branches: refs/heads/master 4b13896eb -> 03f57a6c2 Fixing the type of the sentiment happiness value ## What changes were proposed in this pull request? Added the conversion to int for the 'happiness value' read from the file. Otherwise, later on line 75 the

spark git commit: [SPARK-13705][DOCS] UpdateStateByKey Operation documentation incorrectly refers to StatefulNetworkWordCount

2016-03-07 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 704a54c55 -> 18ef2f25e [SPARK-13705][DOCS] UpdateStateByKey Operation documentation incorrectly refers to StatefulNetworkWordCount ## What changes were proposed in this pull request? The reference to StatefulNetworkWordCount.scala

spark git commit: [SPARK-13599][BUILD] remove transitive groovy dependencies from spark-hive and spark-hiveserver (branch 1.6)

2016-03-07 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 18ef2f25e -> 2434f16cc [SPARK-13599][BUILD] remove transitive groovy dependencies from spark-hive and spark-hiveserver (branch 1.6) ## What changes were proposed in this pull request? This is just the patch of #11449 cherry picked to

spark git commit: [SPARK-13117][WEB UI] WebUI should use the local ip not 0.0.0.0

2016-03-08 Thread srowen
Repository: spark Updated Branches: refs/heads/master 7771c7314 -> 9bf76ddde [SPARK-13117][WEB UI] WebUI should use the local ip not 0.0.0.0 ## What changes were proposed in this pull request? In WebUI, now Jetty Server starts with SPARK_LOCAL_IP config value if it is configured otherwise it

spark git commit: [HOT-FIX][BUILD] Use the new location of `checkstyle-suppressions.xml`

2016-03-08 Thread srowen
Repository: spark Updated Branches: refs/heads/master e52e597db -> 7771c7314 [HOT-FIX][BUILD] Use the new location of `checkstyle-suppressions.xml` ## What changes were proposed in this pull request? This PR fixes `dev/lint-java` and `mvn checkstyle:check` failures due the recent file

spark git commit: [SPARK-13715][MLLIB] Remove last usages of jblas in tests

2016-03-08 Thread srowen
nes that are being modified. Author: Sean Owen <so...@cloudera.com> Closes #11560 from srowen/SPARK-13715. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/54040f8d Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/5404

spark git commit: Fix run-tests.py typos

2016-03-02 Thread srowen
Repository: spark Updated Branches: refs/heads/master 366f26d2d -> 75e618def Fix run-tests.py typos ## What changes were proposed in this pull request? The PR fixes typos in an error message in dev/run-tests.py. Author: Wojciech Jurczyk Closes #11467 from

spark git commit: [SPARK-13515] Make FormatNumber work irrespective of locale.

2016-03-02 Thread srowen
Repository: spark Updated Branches: refs/heads/master 75e618def -> d8afd45f8 [SPARK-13515] Make FormatNumber work irrespective of locale. ## What changes were proposed in this pull request? Change in class FormatNumber to make it work irrespective of locale. ## How was this patch tested?

<    5   6   7   8   9   10   11   12   13   14   >