spark git commit: [SPARK-18845][GRAPHX] PageRank has incorrect initialization value that leads to slow convergence

2016-12-15 Thread ankurdave
Repository: spark Updated Branches: refs/heads/master 172a52f5d -> 78062b852 [SPARK-18845][GRAPHX] PageRank has incorrect initialization value that leads to slow convergence ## What changes were proposed in this pull request? Change the initial value in all PageRank implementations to be

spark git commit: [SPARK-18892][SQL] Alias percentile_approx approx_percentile

2016-12-15 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.1 483624c2e -> d8548c8a7 [SPARK-18892][SQL] Alias percentile_approx approx_percentile ## What changes were proposed in this pull request? percentile_approx is the name used in Hive, and approx_percentile is the name used in Presto.

spark git commit: [SPARK-18892][SQL] Alias percentile_approx approx_percentile

2016-12-15 Thread rxin
Repository: spark Updated Branches: refs/heads/master 5a44f18a2 -> 172a52f5d [SPARK-18892][SQL] Alias percentile_approx approx_percentile ## What changes were proposed in this pull request? percentile_approx is the name used in Hive, and approx_percentile is the name used in Presto.

[1/2] spark git commit: Preparing Spark release v2.1.0-rc5

2016-12-15 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-2.1 b23220fa6 -> 483624c2e Preparing Spark release v2.1.0-rc5 Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/cd0a0836 Tree:

[2/2] spark git commit: Preparing development version 2.1.1-SNAPSHOT

2016-12-15 Thread pwendell
Preparing development version 2.1.1-SNAPSHOT Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/483624c2 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/483624c2 Diff:

[spark] Git Push Summary

2016-12-15 Thread pwendell
Repository: spark Updated Tags: refs/tags/v2.1.0-rc5 [created] cd0a08361 - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

spark git commit: [MINOR] Handle fact that mv is different on linux, mac

2016-12-15 Thread shivaram
Repository: spark Updated Branches: refs/heads/branch-2.1 62a6577bf -> b23220fa6 [MINOR] Handle fact that mv is different on linux, mac Follow up to https://github.com/apache/spark/commit/ae853e8f3bdbd16427e6f1ffade4f63abaf74abb as `mv` throws an error on the Jenkins machines if source and

spark git commit: [MINOR] Handle fact that mv is different on linux, mac

2016-12-15 Thread shivaram
Repository: spark Updated Branches: refs/heads/master 9634018c4 -> 5a44f18a2 [MINOR] Handle fact that mv is different on linux, mac Follow up to https://github.com/apache/spark/commit/ae853e8f3bdbd16427e6f1ffade4f63abaf74abb as `mv` throws an error on the Jenkins machines if source and

[2/2] spark git commit: Preparing development version 2.1.1-SNAPSHOT

2016-12-15 Thread pwendell
Preparing development version 2.1.1-SNAPSHOT Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/62a6577b Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/62a6577b Diff:

[spark] Git Push Summary

2016-12-15 Thread pwendell
Repository: spark Updated Tags: refs/tags/v2.1.0-rc4 [created] ec3172658 - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

[1/2] spark git commit: Preparing Spark release v2.1.0-rc4

2016-12-15 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-2.1 ae853e8f3 -> 62a6577bf Preparing Spark release v2.1.0-rc4 Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/ec317265 Tree:

spark git commit: [MINOR] Only rename SparkR tar.gz if names mismatch

2016-12-15 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.1 08e427287 -> ae853e8f3 [MINOR] Only rename SparkR tar.gz if names mismatch ## What changes were proposed in this pull request? For release builds the R_PACKAGE_VERSION and VERSION are the same (e.g., 2.1.0). Thus `cp` throws an error

spark git commit: [MINOR] Only rename SparkR tar.gz if names mismatch

2016-12-15 Thread rxin
Repository: spark Updated Branches: refs/heads/master 9c7f83b02 -> 9634018c4 [MINOR] Only rename SparkR tar.gz if names mismatch ## What changes were proposed in this pull request? For release builds the R_PACKAGE_VERSION and VERSION are the same (e.g., 2.1.0). Thus `cp` throws an error

spark git commit: [SPARK-18868][FLAKY-TEST] Deflake StreamingQueryListenerSuite: single listener, check trigger...

2016-12-15 Thread zsxwing
Repository: spark Updated Branches: refs/heads/master 32ff96452 -> 9c7f83b02 [SPARK-18868][FLAKY-TEST] Deflake StreamingQueryListenerSuite: single listener, check trigger... ## What changes were proposed in this pull request? Use `recentProgress` instead of `lastProgress` and filter out

spark git commit: [SPARK-18868][FLAKY-TEST] Deflake StreamingQueryListenerSuite: single listener, check trigger...

2016-12-15 Thread zsxwing
Repository: spark Updated Branches: refs/heads/branch-2.1 a7364a82e -> 08e427287 [SPARK-18868][FLAKY-TEST] Deflake StreamingQueryListenerSuite: single listener, check trigger... ## What changes were proposed in this pull request? Use `recentProgress` instead of `lastProgress` and filter out

spark git commit: [SPARK-8425][SCHEDULER][HOTFIX] fix scala 2.10 compile error

2016-12-15 Thread rxin
Repository: spark Updated Branches: refs/heads/master 0917c8ee0 -> 32ff96452 [SPARK-8425][SCHEDULER][HOTFIX] fix scala 2.10 compile error ## What changes were proposed in this pull request? https://github.com/apache/spark/commit/93cdb8a7d0f124b4db069fd8242207c82e263c52 Introduced a compile

[2/2] spark git commit: Preparing development version 2.1.1-SNAPSHOT

2016-12-15 Thread pwendell
Preparing development version 2.1.1-SNAPSHOT Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/a7364a82 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/a7364a82 Diff:

[1/2] spark git commit: Preparing Spark release v2.1.0-rc3

2016-12-15 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-2.1 b6a81f472 -> a7364a82e Preparing Spark release v2.1.0-rc3 Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/ef2ccf94 Tree:

[spark] Git Push Summary

2016-12-15 Thread pwendell
Repository: spark Updated Tags: refs/tags/v2.1.0-rc3 [created] ef2ccf942 - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

spark git commit: [SPARK-18888] partitionBy in DataStreamWriter in Python throws _to_seq not defined

2016-12-15 Thread tdas
Repository: spark Updated Branches: refs/heads/branch-2.1 900ce558a -> b6a81f472 [SPARK-1] partitionBy in DataStreamWriter in Python throws _to_seq not defined ## What changes were proposed in this pull request? `_to_seq` wasn't imported. ## How was this patch tested? Added

spark git commit: [SPARK-18888] partitionBy in DataStreamWriter in Python throws _to_seq not defined

2016-12-15 Thread tdas
Repository: spark Updated Branches: refs/heads/master 68a6dc974 -> 0917c8ee0 [SPARK-1] partitionBy in DataStreamWriter in Python throws _to_seq not defined ## What changes were proposed in this pull request? `_to_seq` wasn't imported. ## How was this patch tested? Added partitionBy to

spark git commit: [SPARK-18826][SS] Add 'latestFirst' option to FileStreamSource

2016-12-15 Thread tdas
Repository: spark Updated Branches: refs/heads/master 4f7292c87 -> 68a6dc974 [SPARK-18826][SS] Add 'latestFirst' option to FileStreamSource ## What changes were proposed in this pull request? When starting a stream with a lot of backfill and maxFilesPerTrigger, the user could often want to

spark git commit: [SPARK-18826][SS] Add 'latestFirst' option to FileStreamSource

2016-12-15 Thread tdas
Repository: spark Updated Branches: refs/heads/branch-2.1 e430915fa -> 900ce558a [SPARK-18826][SS] Add 'latestFirst' option to FileStreamSource ## What changes were proposed in this pull request? When starting a stream with a lot of backfill and maxFilesPerTrigger, the user could often want

spark git commit: [SPARK-18870] Disallowed Distinct Aggregations on Streaming Datasets

2016-12-15 Thread tdas
Repository: spark Updated Branches: refs/heads/branch-2.1 2a8de2e11 -> e430915fa [SPARK-18870] Disallowed Distinct Aggregations on Streaming Datasets ## What changes were proposed in this pull request? Check whether Aggregation operators on a streaming subplan have aggregate expressions

spark git commit: [SPARK-18870] Disallowed Distinct Aggregations on Streaming Datasets

2016-12-15 Thread tdas
Repository: spark Updated Branches: refs/heads/master 01e14bf30 -> 4f7292c87 [SPARK-18870] Disallowed Distinct Aggregations on Streaming Datasets ## What changes were proposed in this pull request? Check whether Aggregation operators on a streaming subplan have aggregate expressions with

spark git commit: [SPARK-17910][SQL] Allow users to update the comment of a column

2016-12-15 Thread rxin
Repository: spark Updated Branches: refs/heads/master 93cdb8a7d -> 01e14bf30 [SPARK-17910][SQL] Allow users to update the comment of a column ## What changes were proposed in this pull request? Right now, once a user set the comment of a column with create table command, he/she cannot