spark git commit: [SPARK-23245][SS][TESTS] Don't access `lastExecution.executedPlan` in StreamTest

2018-01-26 Thread zsxwing
Repository: spark Updated Branches: refs/heads/branch-2.3 234c854bd -> 65600bfdb [SPARK-23245][SS][TESTS] Don't access `lastExecution.executedPlan` in StreamTest ## What changes were proposed in this pull request? `lastExecution.executedPlan` is lazy val so accessing it in StreamTest may

spark git commit: [SPARK-23245][SS][TESTS] Don't access `lastExecution.executedPlan` in StreamTest

2018-01-26 Thread zsxwing
Repository: spark Updated Branches: refs/heads/master e7bc9f052 -> 6328868e5 [SPARK-23245][SS][TESTS] Don't access `lastExecution.executedPlan` in StreamTest ## What changes were proposed in this pull request? `lastExecution.executedPlan` is lazy val so accessing it in StreamTest may need

spark git commit: [MINOR][SS][DOC] Fix `Trigger` Scala/Java doc examples

2018-01-26 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.3 20c0efe48 -> 234c854bd [MINOR][SS][DOC] Fix `Trigger` Scala/Java doc examples ## What changes were proposed in this pull request? This PR fixes Scala/Java doc examples in `Trigger.java`. ## How was this patch tested? N/A. Author:

spark git commit: [MINOR][SS][DOC] Fix `Trigger` Scala/Java doc examples

2018-01-26 Thread srowen
Repository: spark Updated Branches: refs/heads/master 5b5447c68 -> e7bc9f052 [MINOR][SS][DOC] Fix `Trigger` Scala/Java doc examples ## What changes were proposed in this pull request? This PR fixes Scala/Java doc examples in `Trigger.java`. ## How was this patch tested? N/A. Author:

spark git commit: [SPARK-23214][SQL] cached data should not carry extra hint info

2018-01-26 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 7aaf23cf8 -> 20c0efe48 [SPARK-23214][SQL] cached data should not carry extra hint info ## What changes were proposed in this pull request? This is a regression introduced by https://github.com/apache/spark/pull/19864 When we lookup

spark git commit: [SPARK-23214][SQL] cached data should not carry extra hint info

2018-01-26 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 073744985 -> 5b5447c68 [SPARK-23214][SQL] cached data should not carry extra hint info ## What changes were proposed in this pull request? This is a regression introduced by https://github.com/apache/spark/pull/19864 When we lookup

spark git commit: [SPARK-23242][SS][TESTS] Don't run tests in KafkaSourceSuiteBase twice

2018-01-26 Thread zsxwing
Repository: spark Updated Branches: refs/heads/branch-2.3 30d16e116 -> 7aaf23cf8 [SPARK-23242][SS][TESTS] Don't run tests in KafkaSourceSuiteBase twice ## What changes were proposed in this pull request? KafkaSourceSuiteBase should be abstract class, otherwise KafkaSourceSuiteBase will also

spark git commit: [SPARK-23242][SS][TESTS] Don't run tests in KafkaSourceSuiteBase twice

2018-01-26 Thread zsxwing
Repository: spark Updated Branches: refs/heads/master 94c67a76e -> 073744985 [SPARK-23242][SS][TESTS] Don't run tests in KafkaSourceSuiteBase twice ## What changes were proposed in this pull request? KafkaSourceSuiteBase should be abstract class, otherwise KafkaSourceSuiteBase will also

spark git commit: [SPARK-23207][SQL] Shuffle+Repartition on a DataFrame could lead to incorrect answers

2018-01-26 Thread sameerag
Repository: spark Updated Branches: refs/heads/branch-2.3 f5911d489 -> 30d16e116 [SPARK-23207][SQL] Shuffle+Repartition on a DataFrame could lead to incorrect answers ## What changes were proposed in this pull request? Currently shuffle repartition uses RoundRobinPartitioning, the generated

spark git commit: [SPARK-23207][SQL] Shuffle+Repartition on a DataFrame could lead to incorrect answers

2018-01-26 Thread sameerag
Repository: spark Updated Branches: refs/heads/master a8a3e9b7c -> 94c67a76e [SPARK-23207][SQL] Shuffle+Repartition on a DataFrame could lead to incorrect answers ## What changes were proposed in this pull request? Currently shuffle repartition uses RoundRobinPartitioning, the generated

svn commit: r24477 - in /dev/spark/2.3.1-SNAPSHOT-2018_01_26_14_10-f5911d4-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-01-26 Thread pwendell
Author: pwendell Date: Fri Jan 26 22:23:43 2018 New Revision: 24477 Log: Apache Spark 2.3.1-SNAPSHOT-2018_01_26_14_10-f5911d4 docs [This commit notification would consist of 1441 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: Revert "[SPARK-22797][PYSPARK] Bucketizer support multi-column"

2018-01-26 Thread mlnick
Repository: spark Updated Branches: refs/heads/master dd8e257d1 -> a8a3e9b7c Revert "[SPARK-22797][PYSPARK] Bucketizer support multi-column" This reverts commit c22eaa94e85aaac649566495dcf763a5de3c8d06. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit:

spark git commit: Revert "[SPARK-22797][PYSPARK] Bucketizer support multi-column"

2018-01-26 Thread mlnick
Repository: spark Updated Branches: refs/heads/branch-2.3 ca3613be2 -> f5911d489 Revert "[SPARK-22797][PYSPARK] Bucketizer support multi-column" This reverts commit ab1b5d921b395cb7df3a3a2c4a7e5778d98e6f01. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit:

svn commit: r24473 - in /dev/spark/2.4.0-SNAPSHOT-2018_01_26_12_01-dd8e257-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-01-26 Thread pwendell
Author: pwendell Date: Fri Jan 26 20:15:18 2018 New Revision: 24473 Log: Apache Spark 2.4.0-SNAPSHOT-2018_01_26_12_01-dd8e257 docs [This commit notification would consist of 1442 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

svn commit: r24472 - in /dev/spark/2.3.1-SNAPSHOT-2018_01_26_10_01-ca3613b-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-01-26 Thread pwendell
Author: pwendell Date: Fri Jan 26 18:15:01 2018 New Revision: 24472 Log: Apache Spark 2.3.1-SNAPSHOT-2018_01_26_10_01-ca3613b docs [This commit notification would consist of 1441 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [SPARK-23218][SQL] simplify ColumnVector.getArray

2018-01-26 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 3e2525147 -> dd8e257d1 [SPARK-23218][SQL] simplify ColumnVector.getArray ## What changes were proposed in this pull request? `ColumnVector` is very flexible about how to implement array type. As a result `ColumnVector` has 3 abstract

spark git commit: [SPARK-23218][SQL] simplify ColumnVector.getArray

2018-01-26 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 ab1b5d921 -> ca3613be2 [SPARK-23218][SQL] simplify ColumnVector.getArray ## What changes were proposed in this pull request? `ColumnVector` is very flexible about how to implement array type. As a result `ColumnVector` has 3 abstract

svn commit: r24466 - in /dev/spark/2.3.1-SNAPSHOT-2018_01_26_06_01-ab1b5d9-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-01-26 Thread pwendell
Author: pwendell Date: Fri Jan 26 14:15:11 2018 New Revision: 24466 Log: Apache Spark 2.3.1-SNAPSHOT-2018_01_26_06_01-ab1b5d9 docs [This commit notification would consist of 1442 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

svn commit: r24465 - in /dev/spark/2.4.0-SNAPSHOT-2018_01_26_04_01-3e25251-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-01-26 Thread pwendell
Author: pwendell Date: Fri Jan 26 12:20:03 2018 New Revision: 24465 Log: Apache Spark 2.4.0-SNAPSHOT-2018_01_26_04_01-3e25251 docs [This commit notification would consist of 1443 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [SPARK-22068][CORE] Reduce the duplicate code between putIteratorAsValues and putIteratorAsBytes

2018-01-26 Thread wenchen
Repository: spark Updated Branches: refs/heads/master c22eaa94e -> 3e2525147 [SPARK-22068][CORE] Reduce the duplicate code between putIteratorAsValues and putIteratorAsBytes ## What changes were proposed in this pull request? The code logic between `MemoryStore.putIteratorAsValues` and

spark git commit: [SPARK-22797][PYSPARK] Bucketizer support multi-column

2018-01-26 Thread mlnick
Repository: spark Updated Branches: refs/heads/branch-2.3 d6cdc699e -> ab1b5d921 [SPARK-22797][PYSPARK] Bucketizer support multi-column ## What changes were proposed in this pull request? Bucketizer support multi-column in the python side ## How was this patch tested? existing tests and

spark git commit: [SPARK-22797][PYSPARK] Bucketizer support multi-column

2018-01-26 Thread mlnick
Repository: spark Updated Branches: refs/heads/master cd3956df0 -> c22eaa94e [SPARK-22797][PYSPARK] Bucketizer support multi-column ## What changes were proposed in this pull request? Bucketizer support multi-column in the python side ## How was this patch tested? existing tests and added

spark git commit: [SPARK-22799][ML] Bucketizer should throw exception if single- and multi-column params are both set

2018-01-26 Thread mlnick
Repository: spark Updated Branches: refs/heads/branch-2.3 fdf140e25 -> d6cdc699e [SPARK-22799][ML] Bucketizer should throw exception if single- and multi-column params are both set ## What changes were proposed in this pull request? Currently there is a mixed situation when both single- and

spark git commit: [SPARK-22799][ML] Bucketizer should throw exception if single- and multi-column params are both set

2018-01-26 Thread mlnick
Repository: spark Updated Branches: refs/heads/master d1721816d -> cd3956df0 [SPARK-22799][ML] Bucketizer should throw exception if single- and multi-column params are both set ## What changes were proposed in this pull request? Currently there is a mixed situation when both single- and

svn commit: r24458 - in /dev/spark/2.4.0-SNAPSHOT-2018_01_26_00_01-d172181-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-01-26 Thread pwendell
Author: pwendell Date: Fri Jan 26 08:15:20 2018 New Revision: 24458 Log: Apache Spark 2.4.0-SNAPSHOT-2018_01_26_00_01-d172181 docs [This commit notification would consist of 1443 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]