spark git commit: [SPARK-22060][ML] Fix CrossValidator/TrainValidationSplit param persist/load bug

2017-09-22 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master 3e6a714c9 -> f180b6534 [SPARK-22060][ML] Fix CrossValidator/TrainValidationSplit param persist/load bug ## What changes were proposed in this pull request? Currently the param of CrossValidator/TrainValidationSplit persist/loading is

[2/2] spark git commit: Preparing development version 2.1.3-SNAPSHOT

2017-09-22 Thread holden
Preparing development version 2.1.3-SNAPSHOT Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/03db7214 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/03db7214 Diff:

[1/2] spark git commit: Preparing Spark release v2.1.2-rc2

2017-09-22 Thread holden
Repository: spark Updated Branches: refs/heads/branch-2.1 d930bbb40 -> 03db72149 Preparing Spark release v2.1.2-rc2 Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/fabbb7f5 Tree:

[spark] Git Push Summary

2017-09-22 Thread holden
Repository: spark Updated Tags: refs/tags/v2.1.2-rc2 [created] fabbb7f59 - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

spark git commit: [SPARK-21766][PYSPARK][SQL] DataFrame toPandas() raises ValueError with nullable int columns

2017-09-22 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master d2b2932d8 -> 3e6a714c9 [SPARK-21766][PYSPARK][SQL] DataFrame toPandas() raises ValueError with nullable int columns ## What changes were proposed in this pull request? When calling `DataFrame.toPandas()` (without Arrow enabled), if there

spark git commit: [SPARK-22092] Reallocation in OffHeapColumnVector.reserveInternal corrupts struct and array data

2017-09-22 Thread hvanhovell
Repository: spark Updated Branches: refs/heads/master 10e37f6eb -> d2b2932d8 [SPARK-22092] Reallocation in OffHeapColumnVector.reserveInternal corrupts struct and array data ## What changes were proposed in this pull request? `OffHeapColumnVector.reserveInternal()` will only copy already

spark git commit: [UI][STREAMING] Modify the title, 'Records' instead of 'Input Size'

2017-09-22 Thread srowen
Repository: spark Updated Branches: refs/heads/master 27fc536d9 -> 10e37f6eb [UI][STREAMING] Modify the title, 'Records' instead of 'Input Size' ## What changes were proposed in this pull request? Spark Streaming is processing data should be record, so the title should be 'Records', and

spark git commit: [SPARK-21190][PYSPARK] Python Vectorized UDFs

2017-09-22 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 8f130ad40 -> 27fc536d9 [SPARK-21190][PYSPARK] Python Vectorized UDFs This PR adds vectorized UDFs to the Python API **Proposed API** Introduce a flag to turn on vectorization for a defined UDF, for example: ``` pandas_udf(DoubleType())

spark git commit: [SPARK-22072][SPARK-22071][BUILD] Improve release build scripts

2017-09-22 Thread holden
Repository: spark Updated Branches: refs/heads/branch-2.2 090b987e6 -> de6274a58 [SPARK-22072][SPARK-22071][BUILD] Improve release build scripts ## What changes were proposed in this pull request? Check JDK version (with javac) and use SPARK_VERSION for publish-release ## How was this patch

spark git commit: [SPARK-22072][SPARK-22071][BUILD] Improve release build scripts

2017-09-22 Thread holden
Repository: spark Updated Branches: refs/heads/branch-2.1 1a4b6eea8 -> d930bbb40 [SPARK-22072][SPARK-22071][BUILD] Improve release build scripts ## What changes were proposed in this pull request? Check JDK version (with javac) and use SPARK_VERSION for publish-release ## How was this patch

spark git commit: [SPARK-22072][SPARK-22071][BUILD] Improve release build scripts

2017-09-22 Thread holden
Repository: spark Updated Branches: refs/heads/master 5960686e7 -> 8f130ad40 [SPARK-22072][SPARK-22071][BUILD] Improve release build scripts ## What changes were proposed in this pull request? Check JDK version (with javac) and use SPARK_VERSION for publish-release ## How was this patch

spark git commit: [SPARK-21998][SQL] SortMergeJoinExec did not calculate its outputOrdering correctly during physical planning

2017-09-22 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 5ac96854c -> 5960686e7 [SPARK-21998][SQL] SortMergeJoinExec did not calculate its outputOrdering correctly during physical planning ## What changes were proposed in this pull request? Right now the calculation of SortMergeJoinExec's