spark git commit: [SPARK-21976][DOC] Fix wrong documentation for Mean Absolute Error.

2017-09-12 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.2 b1b5a7fdc -> 10c68366e [SPARK-21976][DOC] Fix wrong documentation for Mean Absolute Error. ## What changes were proposed in this pull request? Fixed wrong documentation for Mean Absolute Error. Even though the code is correct for the

spark git commit: [SPARK-21976][DOC] Fix wrong documentation for Mean Absolute Error.

2017-09-12 Thread srowen
Repository: spark Updated Branches: refs/heads/master dc74c0e67 -> e2ac2f1c7 [SPARK-21976][DOC] Fix wrong documentation for Mean Absolute Error. ## What changes were proposed in this pull request? Fixed wrong documentation for Mean Absolute Error. Even though the code is correct for the MAE:

spark git commit: [SPARK-21976][DOC] Fix wrong documentation for Mean Absolute Error.

2017-09-12 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.1 ae4e8ae41 -> e7696ebef [SPARK-21976][DOC] Fix wrong documentation for Mean Absolute Error. ## What changes were proposed in this pull request? Fixed wrong documentation for Mean Absolute Error. Even though the code is correct for the

spark git commit: [SPARK-14516][ML] Adding ClusteringEvaluator with the implementation of Cosine silhouette and squared Euclidean silhouette.

2017-09-12 Thread yliang
Repository: spark Updated Branches: refs/heads/master e2ac2f1c7 -> dd7816758 [SPARK-14516][ML] Adding ClusteringEvaluator with the implementation of Cosine silhouette and squared Euclidean silhouette. ## What changes were proposed in this pull request? This PR adds the ClusteringEvaluator Ev

spark git commit: [SPARK-21610][SQL][FOLLOWUP] Corrupt records are not handled properly when creating a dataframe from a file

2017-09-12 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master dd7816758 -> 7d0a3ef4c [SPARK-21610][SQL][FOLLOWUP] Corrupt records are not handled properly when creating a dataframe from a file ## What changes were proposed in this pull request? When the `requiredSchema` only contains `_corrupt_recor

spark git commit: [DOCS] Fix unreachable links in the document

2017-09-12 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.2 10c68366e -> 63098dc31 [DOCS] Fix unreachable links in the document ## What changes were proposed in this pull request? Recently, I found two unreachable links in the document and fixed them. Because of small changes related to the doc

spark git commit: [DOCS] Fix unreachable links in the document

2017-09-12 Thread srowen
Repository: spark Updated Branches: refs/heads/master 7d0a3ef4c -> 957558235 [DOCS] Fix unreachable links in the document ## What changes were proposed in this pull request? Recently, I found two unreachable links in the document and fixed them. Because of small changes related to the documen

spark git commit: [SPARK-17642][SQL] support DESC EXTENDED/FORMATTED table column commands

2017-09-12 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 957558235 -> 515910e9b [SPARK-17642][SQL] support DESC EXTENDED/FORMATTED table column commands ## What changes were proposed in this pull request? Support DESC (EXTENDED | FORMATTED) ? TABLE COLUMN command. Support DESC EXTENDED | FORMATT

spark git commit: [SPARK-21027][ML][PYTHON] Added tunable parallelism to one vs. rest in both Scala mllib and Pyspark

2017-09-12 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master 515910e9b -> 720c94fe7 [SPARK-21027][ML][PYTHON] Added tunable parallelism to one vs. rest in both Scala mllib and Pyspark # What changes were proposed in this pull request? Added tunable parallelism to the pyspark implementation of one v

spark git commit: [SPARK-21368][SQL] TPCDSQueryBenchmark can't refer query files.

2017-09-12 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 720c94fe7 -> b9b54b1c8 [SPARK-21368][SQL] TPCDSQueryBenchmark can't refer query files. ## What changes were proposed in this pull request? TPCDSQueryBenchmark packaged into a jar doesn't work with spark-submit. It's because of the failure

spark git commit: [SPARK-18608][ML] Fix double caching

2017-09-12 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master b9b54b1c8 -> c5f9b89dd [SPARK-18608][ML] Fix double caching ## What changes were proposed in this pull request? `df.rdd.getStorageLevel` => `df.storageLevel` using cmd `find . -name '*.scala' | xargs -i bash -c 'egrep -in "\.rdd\.getStora

spark git commit: [SPARK-18608][ML] Fix double caching

2017-09-12 Thread jkbradley
Repository: spark Updated Branches: refs/heads/branch-2.2 63098dc31 -> b606dc177 [SPARK-18608][ML] Fix double caching ## What changes were proposed in this pull request? `df.rdd.getStorageLevel` => `df.storageLevel` using cmd `find . -name '*.scala' | xargs -i bash -c 'egrep -in "\.rdd\.getS

spark git commit: [SPARK-21979][SQL] Improve QueryPlanConstraints framework

2017-09-12 Thread lixiao
Repository: spark Updated Branches: refs/heads/master c5f9b89dd -> 1a9857476 [SPARK-21979][SQL] Improve QueryPlanConstraints framework ## What changes were proposed in this pull request? Improve QueryPlanConstraints framework, make it robust and simple. In https://github.com/apache/spark/pull

spark git commit: [SPARK-21513][SQL] Allow UDF to_json support converting MapType to json

2017-09-12 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 1a9857476 -> 371e4e205 [SPARK-21513][SQL] Allow UDF to_json support converting MapType to json # What changes were proposed in this pull request? UDF to_json only supports converting `StructType` or `ArrayType` of `StructType`s to a json o