[2/2] spark git commit: Preparing development version 2.0.0-SNAPSHOT

2016-05-17 Thread pwendell
Preparing development version 2.0.0-SNAPSHOT Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/b5450091 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/b5450091 Diff:

[spark] Git Push Summary

2016-05-17 Thread pwendell
Repository: spark Updated Tags: refs/tags/2.0.0-preview [created] 8f5a04b62 - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

[1/2] spark git commit: Preparing Spark release 2.0.0-preview

2016-05-17 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-2.0 c8be3da66 -> b54500913 Preparing Spark release 2.0.0-preview Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/8f5a04b6 Tree:

spark git commit: Prepare branch for 2.0.0-preview.

2016-05-17 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 5f5270ead -> c8be3da66 Prepare branch for 2.0.0-preview. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/c8be3da6 Tree:

[3/3] spark git commit: [SPARK-15171][SQL] Remove the references to deprecated method dataset.registerTempTable

2016-05-17 Thread lian
[SPARK-15171][SQL] Remove the references to deprecated method dataset.registerTempTable ## What changes were proposed in this pull request? Update the unit test code, examples, and documents to remove calls to deprecated method `dataset.registerTempTable`. ## How was this patch tested? This

[2/3] spark git commit: [SPARK-15171][SQL] Remove the references to deprecated method dataset.registerTempTable

2016-05-17 Thread lian
http://git-wip-us.apache.org/repos/asf/spark/blob/5f5270ea/sql/core/src/test/scala/org/apache/spark/sql/execution/PlannerSuite.scala -- diff --git a/sql/core/src/test/scala/org/apache/spark/sql/execution/PlannerSuite.scala

spark git commit: [SPARK-14346] Fix scala-2.10 build

2016-05-17 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-2.0 2dddec40d -> 1db37417c [SPARK-14346] Fix scala-2.10 build ## What changes were proposed in this pull request? Scala 2.10 build was broken by #13079. I am reverting the change of that line. Author: Yin Huai

spark git commit: [SPARK-14346] Fix scala-2.10 build

2016-05-17 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 25b315e6c -> 2a5db9c14 [SPARK-14346] Fix scala-2.10 build ## What changes were proposed in this pull request? Scala 2.10 build was broken by #13079. I am reverting the change of that line. Author: Yin Huai Closes

[2/3] spark git commit: [SPARK-15171][SQL] Remove the references to deprecated method dataset.registerTempTable

2016-05-17 Thread lian
http://git-wip-us.apache.org/repos/asf/spark/blob/25b315e6/sql/core/src/test/scala/org/apache/spark/sql/execution/PlannerSuite.scala -- diff --git a/sql/core/src/test/scala/org/apache/spark/sql/execution/PlannerSuite.scala

[3/3] spark git commit: [SPARK-15171][SQL] Remove the references to deprecated method dataset.registerTempTable

2016-05-17 Thread lian
[SPARK-15171][SQL] Remove the references to deprecated method dataset.registerTempTable ## What changes were proposed in this pull request? Update the unit test code, examples, and documents to remove calls to deprecated method `dataset.registerTempTable`. ## How was this patch tested? This

[1/3] spark git commit: [SPARK-15171][SQL] Remove the references to deprecated method dataset.registerTempTable

2016-05-17 Thread lian
Repository: spark Updated Branches: refs/heads/master b674e67c2 -> 25b315e6c http://git-wip-us.apache.org/repos/asf/spark/blob/25b315e6/sql/hive/src/test/scala/org/apache/spark/sql/hive/execution/SQLQuerySuite.scala -- diff

spark git commit: [SPARK-14346][SQL] Native SHOW CREATE TABLE for Hive tables/views

2016-05-17 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-2.0 7b62b7c11 -> 2dddec40d [SPARK-14346][SQL] Native SHOW CREATE TABLE for Hive tables/views ## What changes were proposed in this pull request? This is a follow-up of #12781. It adds native `SHOW CREATE TABLE` support for Hive tables

spark git commit: [SPARK-14346][SQL] Native SHOW CREATE TABLE for Hive tables/views

2016-05-17 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 8e8bc9f95 -> b674e67c2 [SPARK-14346][SQL] Native SHOW CREATE TABLE for Hive tables/views ## What changes were proposed in this pull request? This is a follow-up of #12781. It adds native `SHOW CREATE TABLE` support for Hive tables and

spark git commit: [SPARK-11735][CORE][SQL] Add a check in the constructor of SQLContext/SparkSession to make sure its SparkContext is not stopped

2016-05-17 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-2.0 c0bb77132 -> 7b62b7c11 [SPARK-11735][CORE][SQL] Add a check in the constructor of SQLContext/SparkSession to make sure its SparkContext is not stopped ## What changes were proposed in this pull request? Add a check in the constructor

spark git commit: [SPARK-11735][CORE][SQL] Add a check in the constructor of SQLContext/SparkSession to make sure its SparkContext is not stopped

2016-05-17 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 0f576a574 -> 8e8bc9f95 [SPARK-11735][CORE][SQL] Add a check in the constructor of SQLContext/SparkSession to make sure its SparkContext is not stopped ## What changes were proposed in this pull request? Add a check in the constructor of

spark git commit: [SPARK-15244] [PYTHON] Type of column name created with createDataFrame is not consistent.

2016-05-17 Thread davies
Repository: spark Updated Branches: refs/heads/master e2efe0529 -> 0f576a574 [SPARK-15244] [PYTHON] Type of column name created with createDataFrame is not consistent. ## What changes were proposed in this pull request? **createDataFrame** returns inconsistent types for column names.

spark git commit: [SPARK-15244] [PYTHON] Type of column name created with createDataFrame is not consistent.

2016-05-17 Thread davies
Repository: spark Updated Branches: refs/heads/branch-2.0 ff1cfce18 -> c0bb77132 [SPARK-15244] [PYTHON] Type of column name created with createDataFrame is not consistent. ## What changes were proposed in this pull request? **createDataFrame** returns inconsistent types for column names.

[3/4] spark git commit: [SPARK-14615][ML] Use the new ML Vector and Matrix in the ML pipeline based algorithms

2016-05-17 Thread meng
http://git-wip-us.apache.org/repos/asf/spark/blob/ff1cfce1/mllib/src/main/scala/org/apache/spark/ml/feature/StandardScaler.scala -- diff --git a/mllib/src/main/scala/org/apache/spark/ml/feature/StandardScaler.scala

[4/4] spark git commit: [SPARK-14615][ML] Use the new ML Vector and Matrix in the ML pipeline based algorithms

2016-05-17 Thread meng
[SPARK-14615][ML] Use the new ML Vector and Matrix in the ML pipeline based algorithms ## What changes were proposed in this pull request? Once SPARK-14487 and SPARK-14549 are merged, we will migrate to use the new vector and matrix type in the new ml pipeline based apis. ## How was this

[2/4] spark git commit: [SPARK-14615][ML] Use the new ML Vector and Matrix in the ML pipeline based algorithms

2016-05-17 Thread meng
http://git-wip-us.apache.org/repos/asf/spark/blob/ff1cfce1/mllib/src/test/scala/org/apache/spark/ml/classification/LogisticRegressionSuite.scala -- diff --git

[3/4] spark git commit: [SPARK-14615][ML] Use the new ML Vector and Matrix in the ML pipeline based algorithms

2016-05-17 Thread meng
http://git-wip-us.apache.org/repos/asf/spark/blob/e2efe052/mllib/src/main/scala/org/apache/spark/ml/feature/StandardScaler.scala -- diff --git a/mllib/src/main/scala/org/apache/spark/ml/feature/StandardScaler.scala

[1/4] spark git commit: [SPARK-14615][ML] Use the new ML Vector and Matrix in the ML pipeline based algorithms

2016-05-17 Thread meng
Repository: spark Updated Branches: refs/heads/master 9f176dd39 -> e2efe0529 http://git-wip-us.apache.org/repos/asf/spark/blob/e2efe052/mllib/src/test/scala/org/apache/spark/mllib/tree/DecisionTreeSuite.scala -- diff --git

[4/4] spark git commit: [SPARK-14615][ML] Use the new ML Vector and Matrix in the ML pipeline based algorithms

2016-05-17 Thread meng
[SPARK-14615][ML] Use the new ML Vector and Matrix in the ML pipeline based algorithms ## What changes were proposed in this pull request? Once SPARK-14487 and SPARK-14549 are merged, we will migrate to use the new vector and matrix type in the new ml pipeline based apis. ## How was this

[2/4] spark git commit: [SPARK-14615][ML] Use the new ML Vector and Matrix in the ML pipeline based algorithms

2016-05-17 Thread meng
http://git-wip-us.apache.org/repos/asf/spark/blob/e2efe052/mllib/src/test/scala/org/apache/spark/ml/classification/LogisticRegressionSuite.scala -- diff --git

spark git commit: [MINOR][DOCS] Replace remaining 'sqlContext' in ScalaDoc/JavaDoc.

2016-05-17 Thread mlnick
Repository: spark Updated Branches: refs/heads/branch-2.0 025b3e9f1 -> 1ad3bbd0a [MINOR][DOCS] Replace remaining 'sqlContext' in ScalaDoc/JavaDoc. ## What changes were proposed in this pull request? According to the recent change, this PR replaces all the remaining `sqlContext` usage with

spark git commit: [MINOR][DOCS] Replace remaining 'sqlContext' in ScalaDoc/JavaDoc.

2016-05-17 Thread mlnick
Repository: spark Updated Branches: refs/heads/master 3308a862b -> 9f176dd39 [MINOR][DOCS] Replace remaining 'sqlContext' in ScalaDoc/JavaDoc. ## What changes were proposed in this pull request? According to the recent change, this PR replaces all the remaining `sqlContext` usage with

spark git commit: [SPARK-10216][SQL] Avoid creating empty files during overwriting with group by query

2016-05-17 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-2.0 adc1c2685 -> af37bdd3a [SPARK-10216][SQL] Avoid creating empty files during overwriting with group by query ## What changes were proposed in this pull request? Currently, `INSERT INTO` with `GROUP BY` query tries to make at least 200

spark git commit: [SPARK-10216][SQL] Avoid creating empty files during overwriting with group by query

2016-05-17 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 20a89478e -> 8d05a7a98 [SPARK-10216][SQL] Avoid creating empty files during overwriting with group by query ## What changes were proposed in this pull request? Currently, `INSERT INTO` with `GROUP BY` query tries to make at least 200

spark git commit: [SPARK-14346][SQL][FOLLOW-UP] add tests for CREAT TABLE USING with partition and bucket

2016-05-17 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-2.0 110876b9a -> adc1c2685 [SPARK-14346][SQL][FOLLOW-UP] add tests for CREAT TABLE USING with partition and bucket ## What changes were proposed in this pull request? https://github.com/apache/spark/pull/12781 introduced PARTITIONED BY,

spark git commit: [SPARK-15165] [SQL] Codegen can break because toCommentSafeString is not actually safe

2016-05-17 Thread davies
Repository: spark Updated Branches: refs/heads/branch-2.0 670f48222 -> 110876b9a [SPARK-15165] [SQL] Codegen can break because toCommentSafeString is not actually safe ## What changes were proposed in this pull request? toCommentSafeString method replaces "\u" with "u" to avoid codegen

spark git commit: [SPARK-15318][ML][EXAMPLE] spark.ml Collaborative Filtering example does not work in spark-shell

2016-05-17 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 273f3d052 -> 670f48222 [SPARK-15318][ML][EXAMPLE] spark.ml Collaborative Filtering example does not work in spark-shell ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) copy & paste

spark git commit: [SPARK-15318][ML][EXAMPLE] spark.ml Collaborative Filtering example does not work in spark-shell

2016-05-17 Thread srowen
Repository: spark Updated Branches: refs/heads/master 932d80029 -> bebe5f981 [SPARK-15318][ML][EXAMPLE] spark.ml Collaborative Filtering example does not work in spark-shell ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) copy & paste

spark git commit: [SPARK-15333][DOCS] Reorganize building-spark.md; rationalize vs wiki

2016-05-17 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 b031ea7dc -> 273f3d052 [SPARK-15333][DOCS] Reorganize building-spark.md; rationalize vs wiki ## What changes were proposed in this pull request? See JIRA for the motivation. The changes are almost entirely movement of text and edits

spark git commit: [SPARK-15333][DOCS] Reorganize building-spark.md; rationalize vs wiki

2016-05-17 Thread srowen
Repository: spark Updated Branches: refs/heads/master 4134ff0c6 -> 932d80029 [SPARK-15333][DOCS] Reorganize building-spark.md; rationalize vs wiki ## What changes were proposed in this pull request? See JIRA for the motivation. The changes are almost entirely movement of text and edits to

spark git commit: [SPARK-14434][ML] User guide doc and examples for GaussianMixture in spark.ml

2016-05-17 Thread mlnick
Repository: spark Updated Branches: refs/heads/branch-2.0 c0bcecf91 -> b031ea7dc [SPARK-14434][ML] User guide doc and examples for GaussianMixture in spark.ml ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) Add guide doc and examples for

spark git commit: [SPARK-14434][ML] User guide doc and examples for GaussianMixture in spark.ml

2016-05-17 Thread mlnick
Repository: spark Updated Branches: refs/heads/master c36ca651f -> 4134ff0c6 [SPARK-14434][ML] User guide doc and examples for GaussianMixture in spark.ml ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) Add guide doc and examples for

spark git commit: [SPARK-15351][SQL] RowEncoder should support array as the external type for ArrayType

2016-05-17 Thread wenchen
Repository: spark Updated Branches: refs/heads/branch-2.0 1426235bf -> c0bcecf91 [SPARK-15351][SQL] RowEncoder should support array as the external type for ArrayType ## What changes were proposed in this pull request? This PR improves `RowEncoder` and `MapObjects`, to support array as the

spark git commit: [SPARK-15351][SQL] RowEncoder should support array as the external type for ArrayType

2016-05-17 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 122302cbf -> c36ca651f [SPARK-15351][SQL] RowEncoder should support array as the external type for ArrayType ## What changes were proposed in this pull request? This PR improves `RowEncoder` and `MapObjects`, to support array as the

spark git commit: [SPARK-15290][BUILD] Move annotations, like @Since / @DeveloperApi, into spark-tags

2016-05-17 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 6d10b2826 -> 1426235bf [SPARK-15290][BUILD] Move annotations, like @Since / @DeveloperApi, into spark-tags ## What changes were proposed in this pull request? (See https://github.com/apache/spark/pull/12416 where most of this was

spark git commit: [SPARK-15290][BUILD] Move annotations, like @Since / @DeveloperApi, into spark-tags

2016-05-17 Thread srowen
Repository: spark Updated Branches: refs/heads/master 8ad9f08c9 -> 122302cbf [SPARK-15290][BUILD] Move annotations, like @Since / @DeveloperApi, into spark-tags ## What changes were proposed in this pull request? (See https://github.com/apache/spark/pull/12416 where most of this was already

spark git commit: [SPARK-12972][CORE][TEST-MAVEN][TEST-HADOOP2.2] Update org.apache.httpcomponents.httpclient, commons-io

2016-05-17 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 0d5e29655 -> 6d10b2826 [SPARK-12972][CORE][TEST-MAVEN][TEST-HADOOP2.2] Update org.apache.httpcomponents.httpclient, commons-io ## What changes were proposed in this pull request? This is sort of a hot-fix for

spark git commit: [SPARK-12972][CORE] Update org.apache.httpcomponents.httpclient

2016-05-17 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 8e3ee683b -> 0d5e29655 [SPARK-12972][CORE] Update org.apache.httpcomponents.httpclient ## What changes were proposed in this pull request? (Retry of https://github.com/apache/spark/pull/13049) - update to httpclient 4.5 / httpcore

spark git commit: [SPARK-14906][ML] Copy linalg in PySpark to new ML package

2016-05-17 Thread meng
Repository: spark Updated Branches: refs/heads/branch-2.0 0dd1f8720 -> 8e3ee683b [SPARK-14906][ML] Copy linalg in PySpark to new ML package ## What changes were proposed in this pull request? Copy the linalg (Vector/Matrix and VectorUDT/MatrixUDT) in PySpark to new ML package. ## How was