spark git commit: [SPARK-15553][SQL] Dataset.createTempView should use CreateViewCommand

2016-05-27 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 ada319844 -> 36045106d [SPARK-15553][SQL] Dataset.createTempView should use CreateViewCommand ## What changes were proposed in this pull request? Let `Dataset.createTempView` and `Dataset.createOrReplaceTempView` use

spark git commit: [SPARK-15553][SQL] Dataset.createTempView should use CreateViewCommand

2016-05-27 Thread rxin
Repository: spark Updated Branches: refs/heads/master 73178c755 -> f1b220eee [SPARK-15553][SQL] Dataset.createTempView should use CreateViewCommand ## What changes were proposed in this pull request? Let `Dataset.createTempView` and `Dataset.createOrReplaceTempView` use `CreateViewCommand`,

[1/2] spark git commit: [SPARK-15633][MINOR] Make package name for Java tests consistent

2016-05-27 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 3801fb4f3 -> ada319844 http://git-wip-us.apache.org/repos/asf/spark/blob/ada31984/external/java8-tests/src/test/java/test/org/apache/spark/java8/dstream/Java8APISuite.java

[2/2] spark git commit: [SPARK-15633][MINOR] Make package name for Java tests consistent

2016-05-27 Thread rxin
[SPARK-15633][MINOR] Make package name for Java tests consistent ## What changes were proposed in this pull request? This is a simple patch that makes package names for Java 8 test suites consistent. I moved everything to test.org.apache.spark to we can test package private APIs properly. Also

[1/2] spark git commit: [SPARK-15633][MINOR] Make package name for Java tests consistent

2016-05-27 Thread rxin
Repository: spark Updated Branches: refs/heads/master 9893dc975 -> 73178c755 http://git-wip-us.apache.org/repos/asf/spark/blob/73178c75/external/java8-tests/src/test/java/test/org/apache/spark/java8/dstream/Java8APISuite.java

spark git commit: [SPARK-15610][ML] update error message for k in pca

2016-05-27 Thread srowen
Repository: spark Updated Branches: refs/heads/master 88c9c467a -> 9893dc975 [SPARK-15610][ML] update error message for k in pca ## What changes were proposed in this pull request? Fix the wrong bound of `k` in `PCA` `require(k <= sources.first().size, ...` -> `require(k <

spark git commit: [SPARK-15610][ML] update error message for k in pca

2016-05-27 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 6d82e0c1b -> 3801fb4f3 [SPARK-15610][ML] update error message for k in pca ## What changes were proposed in this pull request? Fix the wrong bound of `k` in `PCA` `require(k <= sources.first().size, ...` -> `require(k <

spark git commit: [SPARK-15562][ML] Delete temp directory after program exit in DataFrameExample

2016-05-27 Thread srowen
Repository: spark Updated Branches: refs/heads/master 5d4dafe8f -> 88c9c467a [SPARK-15562][ML] Delete temp directory after program exit in DataFrameExample ## What changes were proposed in this pull request? Temp directory used to save records is not deleted after program exit in

spark git commit: [SPARK-15562][ML] Delete temp directory after program exit in DataFrameExample

2016-05-27 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 8467e2102 -> 6d82e0c1b [SPARK-15562][ML] Delete temp directory after program exit in DataFrameExample ## What changes were proposed in this pull request? Temp directory used to save records is not deleted after program exit in

spark git commit: [SPARK-15449][MLLIB][EXAMPLE] Wrong Data Format - Documentation Issue

2016-05-27 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 80a40e8e2 -> 8467e2102 [SPARK-15449][MLLIB][EXAMPLE] Wrong Data Format - Documentation Issue ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) In the MLLib naivebayes example, scala and

spark git commit: [SPARK-15449][MLLIB][EXAMPLE] Wrong Data Format - Documentation Issue

2016-05-27 Thread srowen
Repository: spark Updated Branches: refs/heads/master 4a2fb8b87 -> 5d4dafe8f [SPARK-15449][MLLIB][EXAMPLE] Wrong Data Format - Documentation Issue ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) In the MLLib naivebayes example, scala and

spark git commit: [SPARK-15594][SQL] ALTER TABLE SERDEPROPERTIES does not respect partition spec

2016-05-27 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 776d183c8 -> 4a2fb8b87 [SPARK-15594][SQL] ALTER TABLE SERDEPROPERTIES does not respect partition spec ## What changes were proposed in this pull request? These commands ignore the partition spec and change the storage properties of the

spark git commit: [SPARK-15594][SQL] ALTER TABLE SERDEPROPERTIES does not respect partition spec

2016-05-27 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-2.0 dc6e94157 -> 80a40e8e2 [SPARK-15594][SQL] ALTER TABLE SERDEPROPERTIES does not respect partition spec ## What changes were proposed in this pull request? These commands ignore the partition spec and change the storage properties of

spark git commit: [SPARK-9876][SQL] Update Parquet to 1.8.1.

2016-05-27 Thread lian
Repository: spark Updated Branches: refs/heads/branch-2.0 9c137b2e3 -> dc6e94157 [SPARK-9876][SQL] Update Parquet to 1.8.1. ## What changes were proposed in this pull request? This includes minimal changes to get Spark using the current release of Parquet, 1.8.1. ## How was this patch

spark git commit: [SPARK-9876][SQL] Update Parquet to 1.8.1.

2016-05-27 Thread lian
Repository: spark Updated Branches: refs/heads/master 019afd9c7 -> 776d183c8 [SPARK-9876][SQL] Update Parquet to 1.8.1. ## What changes were proposed in this pull request? This includes minimal changes to get Spark using the current release of Parquet, 1.8.1. ## How was this patch tested?

spark git commit: [SPARK-15431][SQL][BRANCH-2.0-TEST] rework the clisuite test cases

2016-05-27 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 21b2605dc -> 019afd9c7 [SPARK-15431][SQL][BRANCH-2.0-TEST] rework the clisuite test cases ## What changes were proposed in this pull request? This PR reworks on the CliSuite test cases for `LIST FILES/JARS` commands. CC yhuai Thanks!

spark git commit: [SPARK-15431][SQL][BRANCH-2.0-TEST] rework the clisuite test cases

2016-05-27 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-2.0 dcf498e8a -> 9c137b2e3 [SPARK-15431][SQL][BRANCH-2.0-TEST] rework the clisuite test cases ## What changes were proposed in this pull request? This PR reworks on the CliSuite test cases for `LIST FILES/JARS` commands. CC yhuai Thanks!

spark git commit: [SPARK-15413][ML][MLLIB] Change `toBreeze` to `asBreeze` in Vector and Matrix

2016-05-27 Thread jkbradley
Repository: spark Updated Branches: refs/heads/branch-2.0 a778d3c90 -> dcf498e8a [SPARK-15413][ML][MLLIB] Change `toBreeze` to `asBreeze` in Vector and Matrix ## What changes were proposed in this pull request? We're using `asML` to convert the mllib vector/matrix to ml vector/matrix now.

spark git commit: [SPARK-15413][ML][MLLIB] Change `toBreeze` to `asBreeze` in Vector and Matrix

2016-05-27 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master 130b8d07b -> 21b2605dc [SPARK-15413][ML][MLLIB] Change `toBreeze` to `asBreeze` in Vector and Matrix ## What changes were proposed in this pull request? We're using `asML` to convert the mllib vector/matrix to ml vector/matrix now. Using

spark git commit: [SPARK-15008][ML][PYSPARK] Add integration test for OneVsRest

2016-05-27 Thread jkbradley
Repository: spark Updated Branches: refs/heads/branch-2.0 e6e2f293d -> a778d3c90 [SPARK-15008][ML][PYSPARK] Add integration test for OneVsRest ## What changes were proposed in this pull request? 1. Add `_transfer_param_map_to/from_java` for OneVsRest; 2. Add `_compare_params` in ml/tests.py

spark git commit: [SPARK-15008][ML][PYSPARK] Add integration test for OneVsRest

2016-05-27 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master a3550e374 -> 130b8d07b [SPARK-15008][ML][PYSPARK] Add integration test for OneVsRest ## What changes were proposed in this pull request? 1. Add `_transfer_param_map_to/from_java` for OneVsRest; 2. Add `_compare_params` in ml/tests.py to

spark git commit: [SPARK-11959][SPARK-15484][DOC][ML] Document WLS and IRLS

2016-05-27 Thread jkbradley
Repository: spark Updated Branches: refs/heads/branch-2.0 5dd1423f4 -> e6e2f293d [SPARK-11959][SPARK-15484][DOC][ML] Document WLS and IRLS ## What changes were proposed in this pull request? * Document ```WeightedLeastSquares```(normal equation) and ```IterativelyReweightedLeastSquares```. *

spark git commit: [SPARK-11959][SPARK-15484][DOC][ML] Document WLS and IRLS

2016-05-27 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master c96244f5a -> a3550e374 [SPARK-11959][SPARK-15484][DOC][ML] Document WLS and IRLS ## What changes were proposed in this pull request? * Document ```WeightedLeastSquares```(normal equation) and ```IterativelyReweightedLeastSquares```. *

spark git commit: [SPARK-15186][ML][DOCS] Add user guide for generalized linear regression

2016-05-27 Thread jkbradley
Repository: spark Updated Branches: refs/heads/branch-2.0 d76e066d3 -> 5dd1423f4 [SPARK-15186][ML][DOCS] Add user guide for generalized linear regression ## What changes were proposed in this pull request? This patch adds a user guide section for generalized linear regression and includes

spark git commit: [SPARK-15186][ML][DOCS] Add user guide for generalized linear regression

2016-05-27 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master a96e4151a -> c96244f5a [SPARK-15186][ML][DOCS] Add user guide for generalized linear regression ## What changes were proposed in this pull request? This patch adds a user guide section for generalized linear regression and includes the

spark git commit: [SPARK-14400][SQL] ScriptTransformation does not fail the job for bad user command

2016-05-27 Thread rxin
Repository: spark Updated Branches: refs/heads/master b376a4eab -> a96e4151a [SPARK-14400][SQL] ScriptTransformation does not fail the job for bad user command ## What changes were proposed in this pull request? - Refer to the Jira for the problem: jira :

spark git commit: [SPARK-14400][SQL] ScriptTransformation does not fail the job for bad user command

2016-05-27 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 5ea58898c -> d76e066d3 [SPARK-14400][SQL] ScriptTransformation does not fail the job for bad user command ## What changes were proposed in this pull request? - Refer to the Jira for the problem: jira :

spark git commit: [HOTFIX] Scala 2.10 compile GaussianMixtureModel

2016-05-27 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-2.0 17f43cc87 -> 5ea58898c [HOTFIX] Scala 2.10 compile GaussianMixtureModel Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/5ea58898 Tree:

spark git commit: [HOTFIX] Scala 2.10 compile GaussianMixtureModel

2016-05-27 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 1b98fa2e4 -> b376a4eab [HOTFIX] Scala 2.10 compile GaussianMixtureModel Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/b376a4ea Tree:

spark git commit: [YARN][DOC][MINOR] Remove several obsolete env variables and update the doc

2016-05-27 Thread vanzin
Repository: spark Updated Branches: refs/heads/branch-2.0 074989af9 -> 17f43cc87 [YARN][DOC][MINOR] Remove several obsolete env variables and update the doc ## What changes were proposed in this pull request? Remove several obsolete env variables not supported for Spark on YARN now, also

spark git commit: [YARN][DOC][MINOR] Remove several obsolete env variables and update the doc

2016-05-27 Thread vanzin
Repository: spark Updated Branches: refs/heads/master 623aae590 -> 1b98fa2e4 [YARN][DOC][MINOR] Remove several obsolete env variables and update the doc ## What changes were proposed in this pull request? Remove several obsolete env variables not supported for Spark on YARN now, also

spark git commit: [SPARK-15531][DEPLOY] spark-class tries to use too much memory when running Launcher

2016-05-27 Thread vanzin
Repository: spark Updated Branches: refs/heads/branch-2.0 30e87b55b -> 074989af9 [SPARK-15531][DEPLOY] spark-class tries to use too much memory when running Launcher ## What changes were proposed in this pull request? Explicitly limit launcher JVM memory to modest 128m ## How was this

spark git commit: [SPARK-15531][DEPLOY] spark-class tries to use too much memory when running Launcher

2016-05-27 Thread vanzin
Repository: spark Updated Branches: refs/heads/master ce756daa4 -> 623aae590 [SPARK-15531][DEPLOY] spark-class tries to use too much memory when running Launcher ## What changes were proposed in this pull request? Explicitly limit launcher JVM memory to modest 128m ## How was this patch

spark git commit: [SPARK-15569] Reduce frequency of updateBytesWritten function in Disk…

2016-05-27 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 5bdbedf22 -> ce756daa4 [SPARK-15569] Reduce frequency of updateBytesWritten function in Disk… ## What changes were proposed in this pull request? Profiling a Spark job spilling large amount of intermediate data we found that

spark git commit: [SPARK-15569] Reduce frequency of updateBytesWritten function in Disk…

2016-05-27 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-2.0 89fdb6972 -> 30e87b55b [SPARK-15569] Reduce frequency of updateBytesWritten function in Disk… ## What changes were proposed in this pull request? Profiling a Spark job spilling large amount of intermediate data we found that

spark git commit: [MINOR][DOCS] Typo fixes in Dataset scaladoc

2016-05-27 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-2.0 f52a95248 -> 89fdb6972 [MINOR][DOCS] Typo fixes in Dataset scaladoc ## What changes were proposed in this pull request? Minor typo fixes in Dataset scaladoc * Corrected context type as SparkSession, not SQLContext. liancheng rxin

spark git commit: [MINOR][DOCS] Typo fixes in Dataset scaladoc

2016-05-27 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master a52e68133 -> 5bdbedf22 [MINOR][DOCS] Typo fixes in Dataset scaladoc ## What changes were proposed in this pull request? Minor typo fixes in Dataset scaladoc * Corrected context type as SparkSession, not SQLContext. liancheng rxin

spark git commit: [SPARK-15597][SQL] Add SparkSession.emptyDataset

2016-05-27 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-2.0 e69639f43 -> f52a95248 [SPARK-15597][SQL] Add SparkSession.emptyDataset ## What changes were proposed in this pull request? This patch adds a new function emptyDataset to SparkSession, for creating an empty dataset. ## How was this

spark git commit: [SPARK-15597][SQL] Add SparkSession.emptyDataset

2016-05-27 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 635fb30f8 -> a52e68133 [SPARK-15597][SQL] Add SparkSession.emptyDataset ## What changes were proposed in this pull request? This patch adds a new function emptyDataset to SparkSession, for creating an empty dataset. ## How was this patch

spark git commit: [SPARK-15599][SQL][DOCS] API docs for `createDataset` functions in SparkSession

2016-05-27 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-2.0 a14c88acc -> e69639f43 [SPARK-15599][SQL][DOCS] API docs for `createDataset` functions in SparkSession ## What changes were proposed in this pull request? Adds API docs and usage examples for the 3 `createDataset` calls in

spark git commit: [SPARK-15599][SQL][DOCS] API docs for `createDataset` functions in SparkSession

2016-05-27 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 4538443e2 -> 635fb30f8 [SPARK-15599][SQL][DOCS] API docs for `createDataset` functions in SparkSession ## What changes were proposed in this pull request? Adds API docs and usage examples for the 3 `createDataset` calls in `SparkSession`

spark git commit: [SPARK-15584][SQL] Abstract duplicate code: `spark.sql.sources.` properties

2016-05-27 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master d24e25157 -> 4538443e2 [SPARK-15584][SQL] Abstract duplicate code: `spark.sql.sources.` properties ## What changes were proposed in this pull request? This PR replaces `spark.sql.sources.` strings with `CreateDataSourceTableUtils.*`

spark git commit: [SPARK-15584][SQL] Abstract duplicate code: `spark.sql.sources.` properties

2016-05-27 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-2.0 a355edeef -> a14c88acc [SPARK-15584][SQL] Abstract duplicate code: `spark.sql.sources.` properties ## What changes were proposed in this pull request? This PR replaces `spark.sql.sources.` strings with `CreateDataSourceTableUtils.*`

spark git commit: [SPARK-15603][MLLIB] Replace SQLContext with SparkSession in ML/MLLib

2016-05-27 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-2.0 2cb84dd23 -> a355edeef [SPARK-15603][MLLIB] Replace SQLContext with SparkSession in ML/MLLib ## What changes were proposed in this pull request? This PR replaces all deprecated `SQLContext` occurrences with `SparkSession` in

spark git commit: [SPARK-15603][MLLIB] Replace SQLContext with SparkSession in ML/MLLib

2016-05-27 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master c17272902 -> d24e25157 [SPARK-15603][MLLIB] Replace SQLContext with SparkSession in ML/MLLib ## What changes were proposed in this pull request? This PR replaces all deprecated `SQLContext` occurrences with `SparkSession` in `ML/MLLib`

spark git commit: [SPARK-15565][SQL] Add the File Scheme to the Default Value of WAREHOUSE_PATH

2016-05-27 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-2.0 b430aa98c -> 2cb84dd23 [SPARK-15565][SQL] Add the File Scheme to the Default Value of WAREHOUSE_PATH What changes were proposed in this pull request? The default value of `spark.sql.warehouse.dir` is

spark git commit: [SPARK-15565][SQL] Add the File Scheme to the Default Value of WAREHOUSE_PATH

2016-05-27 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 6f95c6c03 -> c17272902 [SPARK-15565][SQL] Add the File Scheme to the Default Value of WAREHOUSE_PATH What changes were proposed in this pull request? The default value of `spark.sql.warehouse.dir` is

spark git commit: [SPARK-15431][SQL][HOTFIX] ignore 'list' command testcase from CliSuite for now

2016-05-27 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-2.0 b3845fede -> b430aa98c [SPARK-15431][SQL][HOTFIX] ignore 'list' command testcase from CliSuite for now ## What changes were proposed in this pull request? The test cases for `list` command added in `CliSuite` by PR #13212 can not run

spark git commit: [SPARK-15431][SQL][HOTFIX] ignore 'list' command testcase from CliSuite for now

2016-05-27 Thread yhuai
Repository: spark Updated Branches: refs/heads/master d5911d117 -> 6f95c6c03 [SPARK-15431][SQL][HOTFIX] ignore 'list' command testcase from CliSuite for now ## What changes were proposed in this pull request? The test cases for `list` command added in `CliSuite` by PR #13212 can not run in