spark git commit: [SPARK-8530] [ML] add python API for MinMaxScaler

2015-09-11 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master 6ce0886eb -> 5f4644476 [SPARK-8530] [ML] add python API for MinMaxScaler jira: https://issues.apache.org/jira/browse/SPARK-8530 add python API for MinMaxScaler jira for MinMaxScaler: https://issues.apache.org/jira/browse/SPARK-7514

spark git commit: [SPARK-10540] [SQL] Ignore HadoopFsRelationTest's "test all data types" if it is too flaky

2015-09-11 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 2e3a28075 -> 6ce0886eb [SPARK-10540] [SQL] Ignore HadoopFsRelationTest's "test all data types" if it is too flaky If hadoopFsRelationSuites's "test all data types" is too flaky we can disable it for now.

spark git commit: [SPARK-10540] [SQL] Ignore HadoopFsRelationTest's "test all data types" if it is too flaky

2015-09-11 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-1.5 4af925664 -> 295281f52 [SPARK-10540] [SQL] Ignore HadoopFsRelationTest's "test all data types" if it is too flaky If hadoopFsRelationSuites's "test all data types" is too flaky we can disable it for now.

spark git commit: [SPARK-9924] [WEB UI] Don't schedule checkForLogs while some of them …

2015-09-11 Thread vanzin
Repository: spark Updated Branches: refs/heads/branch-1.5 295281f52 -> 7f10bd620 [SPARK-9924] [WEB UI] Don't schedule checkForLogs while some of them … Cherry-pick this to branch 1.5. Author: Rohit Agarwal Closes #8701 from tgravescs/SPARK-9924-1.5 and squashes the

spark git commit: [SPARK-10546] Check partitionId's range in ExternalSorter#spill()

2015-09-11 Thread srowen
Repository: spark Updated Branches: refs/heads/master 5f4644476 -> b231ab893 [SPARK-10546] Check partitionId's range in ExternalSorter#spill() See this thread for background: http://search-hadoop.com/m/q3RTt0rWvIkHAE81 We should check the range of partition Id and provide meaningful message

spark git commit: [PYTHON] Fixed typo in exception message

2015-09-11 Thread srowen
Repository: spark Updated Branches: refs/heads/master b231ab893 -> c37386677 [PYTHON] Fixed typo in exception message Just fixing a typo in exception message, raised when attempting to pickle SparkContext. Author: Icaro Medeiros Closes #8724 from

spark git commit: [SPARK-9990] [SQL] Local hash join follow-ups

2015-09-11 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master e626ac5f5 -> c2af42b5f [SPARK-9990] [SQL] Local hash join follow-ups 1. Hide `LocalNodeIterator` behind the `LocalNode#asIterator` method 2. Add tests for this Author: Andrew Or Closes #8708 from

spark git commit: [SPARK-7142] [SQL] Minor enhancement to BooleanSimplification Optimizer rule. Incorporate review comments

2015-09-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master d5d647380 -> 1eede3b25 [SPARK-7142] [SQL] Minor enhancement to BooleanSimplification Optimizer rule. Incorporate review comments Adding changes suggested by cloud-fan in #5700 cc marmbrus Author: Yash Datta

spark git commit: [SPARK-10564] ThreadingSuite: assertion failures in threads don't fail the test

2015-09-11 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master c2af42b5f -> d74c6a143 [SPARK-10564] ThreadingSuite: assertion failures in threads don't fail the test This commit ensures if an assertion fails within a thread, it will ultimately fail the test. Otherwise we end up potentially masking

spark git commit: [SPARK-10564] ThreadingSuite: assertion failures in threads don't fail the test

2015-09-11 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.5 7f10bd620 -> fcb24387a [SPARK-10564] ThreadingSuite: assertion failures in threads don't fail the test This commit ensures if an assertion fails within a thread, it will ultimately fail the test. Otherwise we end up potentially

spark git commit: [SPARK-9014] [SQL] Allow Python spark API to use built-in exponential operator

2015-09-11 Thread davies
Repository: spark Updated Branches: refs/heads/master d74c6a143 -> c34fc1976 [SPARK-9014] [SQL] Allow Python spark API to use built-in exponential operator This PR addresses (SPARK-9014)[https://issues.apache.org/jira/browse/SPARK-9014] Added functionality: `Column` object in Python now

spark git commit: [SPARK-10442] [SQL] fix string to boolean cast

2015-09-11 Thread yhuai
Repository: spark Updated Branches: refs/heads/master c37386677 -> d5d647380 [SPARK-10442] [SQL] fix string to boolean cast When we cast string to boolean in hive, it returns `true` if the length of string is > 0, and spark SQL follows this behavior. However, this behavior is very different

spark git commit: [SPARK-9992] [SPARK-9994] [SPARK-9998] [SQL] Implement the local TopK, sample and intersect operators

2015-09-11 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 1eede3b25 -> e626ac5f5 [SPARK-9992] [SPARK-9994] [SPARK-9998] [SQL] Implement the local TopK, sample and intersect operators This PR is in conflict with #8535. I will update this one when #8535 gets merged. Author: zsxwing

spark git commit: [SPARK-10472] [SQL] Fixes DataType.typeName for UDT

2015-09-11 Thread lian
Repository: spark Updated Branches: refs/heads/master a140dd77c -> e1d7f6429 [SPARK-10472] [SQL] Fixes DataType.typeName for UDT Before this fix, `MyDenseVectorUDT.typeName` gives `mydensevecto`, which is not desirable. Author: Cheng Lian Closes #8640 from

spark git commit: [SPARK-10556] Remove explicit Scala version for sbt project build files

2015-09-11 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.5 89d351b5a -> 4af925664 [SPARK-10556] Remove explicit Scala version for sbt project build files Previously, project/plugins.sbt explicitly set scalaVersion to 2.10.4. This can cause issues when using a version of sbt that is compiled

spark git commit: [SPARK-10556] Remove explicit Scala version for sbt project build files

2015-09-11 Thread srowen
Repository: spark Updated Branches: refs/heads/master e1d7f6429 -> 9bbe33f31 [SPARK-10556] Remove explicit Scala version for sbt project build files Previously, project/plugins.sbt explicitly set scalaVersion to 2.10.4. This can cause issues when using a version of sbt that is compiled

spark git commit: [SPARK-10556] Remove explicit Scala version for sbt project build files

2015-09-11 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.4 53f119cc5 -> af69c4c90 [SPARK-10556] Remove explicit Scala version for sbt project build files Previously, project/plugins.sbt explicitly set scalaVersion to 2.10.4. This can cause issues when using a version of sbt that is compiled

spark git commit: [SPARK-10556] Remove explicit Scala version for sbt project build files

2015-09-11 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.3 d0d7ada9f -> 8c8d7ab20 [SPARK-10556] Remove explicit Scala version for sbt project build files Previously, project/plugins.sbt explicitly set scalaVersion to 2.10.4. This can cause issues when using a version of sbt that is compiled

spark git commit: [SPARK-10518] [DOCS] Update code examples in spark.ml user guide to use LIBSVM data source instead of MLUtils

2015-09-11 Thread meng
Repository: spark Updated Branches: refs/heads/master 9bbe33f31 -> c268ca4dd [SPARK-10518] [DOCS] Update code examples in spark.ml user guide to use LIBSVM data source instead of MLUtils I fixed to use LIBSVM data source in the example code in spark.ml instead of MLUtils Author: y-shimizu

spark git commit: [SPARK-10026] [ML] [PySpark] Implement some common Params for regression in PySpark

2015-09-11 Thread meng
Repository: spark Updated Branches: refs/heads/master c268ca4dd -> b656e6134 [SPARK-10026] [ML] [PySpark] Implement some common Params for regression in PySpark LinearRegression and LogisticRegression lack of some Params for Python, and some Params are not shared classes which lead we need

spark git commit: [SPARK-9773] [ML] [PySpark] Add Python API for MultilayerPerceptronClassifier

2015-09-11 Thread meng
Repository: spark Updated Branches: refs/heads/master b656e6134 -> b01b26260 [SPARK-9773] [ML] [PySpark] Add Python API for MultilayerPerceptronClassifier Add Python API for ```MultilayerPerceptronClassifier```. Author: Yanbo Liang Closes #8067 from

spark git commit: [MINOR] [MLLIB] [ML] [DOC] Minor doc fixes for StringIndexer and MetadataUtils

2015-09-11 Thread meng
Repository: spark Updated Branches: refs/heads/master 960d2d0ac -> 2e3a28075 [MINOR] [MLLIB] [ML] [DOC] Minor doc fixes for StringIndexer and MetadataUtils Changes: * Make Scala doc for StringIndexerInverse clearer. Also remove Scala doc from transformSchema, so that the doc is inherited. *