spark git commit: [SPARK-12692][BUILD][SQL] Scala style: Fix the style violation (Space before ", ")

2016-01-12 Thread rxin
Repository: spark Updated Branches: refs/heads/master dc7b3870f -> cb7b864a2 [SPARK-12692][BUILD][SQL] Scala style: Fix the style violation (Space before ",") Fix the style violation (space before , and :). This PR is a followup for #10643 and rework of #10685 . Author: Kousuke Saruta Clos

spark git commit: [SPARK-12558][SQL] AnalysisException when multiple functions applied in GROUP BY clause

2016-01-12 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-1.6 f71e5cc12 -> dcdc864cf [SPARK-12558][SQL] AnalysisException when multiple functions applied in GROUP BY clause cloud-fan Can you please take a look ? In this case, we are failing during check analysis while validating the aggregation

spark git commit: [SPARK-12558][SQL] AnalysisException when multiple functions applied in GROUP BY clause

2016-01-12 Thread yhuai
Repository: spark Updated Branches: refs/heads/master f14922cff -> dc7b3870f [SPARK-12558][SQL] AnalysisException when multiple functions applied in GROUP BY clause cloud-fan Can you please take a look ? In this case, we are failing during check analysis while validating the aggregation exp

spark git commit: [SPARK-12692][BUILD][CORE] Scala style: Fix the style violation (Space before ", ")

2016-01-12 Thread rxin
Repository: spark Updated Branches: refs/heads/master b3b9ad23c -> f14922cff [SPARK-12692][BUILD][CORE] Scala style: Fix the style violation (Space before ",") Fix the style violation (space before , and :). This PR is a followup for #10643 Author: Kousuke Saruta Closes #10719 from sarutak

spark git commit: [HOT-FIX] bypass hive test when parse logical plan to json

2016-01-12 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.6 03e523e52 -> f71e5cc12 [HOT-FIX] bypass hive test when parse logical plan to json https://github.com/apache/spark/pull/10311 introduces some rare, non-deterministic flakiness for hive udf tests, see https://github.com/apache/spark/pul

spark git commit: [SPARK-12788][SQL] Simplify BooleanEquality by using casts.

2016-01-12 Thread rxin
Repository: spark Updated Branches: refs/heads/master 924708496 -> b3b9ad23c [SPARK-12788][SQL] Simplify BooleanEquality by using casts. Author: Reynold Xin Closes #10730 from rxin/SPARK-12788. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/

spark git commit: [SPARK-12785][SQL] Add ColumnarBatch, an in memory columnar format for execution.

2016-01-12 Thread rxin
Repository: spark Updated Branches: refs/heads/master 4f60651cb -> 924708496 [SPARK-12785][SQL] Add ColumnarBatch, an in memory columnar format for execution. There are many potential benefits of having an efficient in memory columnar format as an alternate to UnsafeRow. This patch introduce

spark git commit: Revert "[SPARK-12645][SPARKR] SparkR support hash function"

2016-01-12 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-1.6 94b39f777 -> 03e523e52 Revert "[SPARK-12645][SPARKR] SparkR support hash function" This reverts commit 8b5f23043322254c725c703c618ba3d3cc4a4240. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apac

spark git commit: [SPARK-12652][PYSPARK] Upgrade Py4J to 0.9.1

2016-01-12 Thread zsxwing
Repository: spark Updated Branches: refs/heads/master 8ed5f12d2 -> 4f60651cb [SPARK-12652][PYSPARK] Upgrade Py4J to 0.9.1 - [x] Upgrade Py4J to 0.9.1 - [x] SPARK-12657: Revert SPARK-12617 - [x] SPARK-12658: Revert SPARK-12511 - Still keep the change that only reading checkpoint once. This is

spark git commit: [SPARK-12724] SQL generation support for persisted data source tables

2016-01-12 Thread lian
Repository: spark Updated Branches: refs/heads/master 0d543b98f -> 8ed5f12d2 [SPARK-12724] SQL generation support for persisted data source tables This PR implements SQL generation support for persisted data source tables. A new field `metastoreTableIdentifier: Option[TableIdentifier]` is ad

spark git commit: Revert "[SPARK-12692][BUILD][SQL] Scala style: Fix the style violation (Space before ", " or ":")"

2016-01-12 Thread rxin
Repository: spark Updated Branches: refs/heads/master 0ed430e31 -> 0d543b98f Revert "[SPARK-12692][BUILD][SQL] Scala style: Fix the style violation (Space before "," or ":")" This reverts commit 8cfa218f4f1b05f4d076ec15dd0a033ad3e4500d. Project: http://git-wip-us.apache.org/repos/asf/spark/

spark git commit: [SPARK-12768][SQL] Remove CaseKeyWhen expression

2016-01-12 Thread rxin
Repository: spark Updated Branches: refs/heads/master 508592b1b -> 0ed430e31 [SPARK-12768][SQL] Remove CaseKeyWhen expression This patch removes CaseKeyWhen expression and replaces it with a factory method that generates the equivalent CaseWhen. This reduces the amount of code we'd need to m

spark git commit: [SPARK-9843][SQL] Make catalyst optimizer pass pluggable at runtime

2016-01-12 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 1d8887953 -> 508592b1b [SPARK-9843][SQL] Make catalyst optimizer pass pluggable at runtime Let me know whether you'd like to see it in other place Author: Robert Kruszewski Closes #10210 from robert3005/feature/pluggable-optimizer. Pro

spark git commit: [SPARK-12762][SQL] Add unit test for SimplifyConditionals optimization rule

2016-01-12 Thread rxin
Repository: spark Updated Branches: refs/heads/master 7e15044d9 -> 1d8887953 [SPARK-12762][SQL] Add unit test for SimplifyConditionals optimization rule This pull request does a few small things: 1. Separated if simplification from BooleanSimplification and created a new rule SimplifyConditi

spark git commit: [SPARK-7615][MLLIB] MLLIB Word2Vec wordVectors divided by Euclidean Norm equals to zero

2016-01-12 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 4c67d55c0 -> 94b39f777 [SPARK-7615][MLLIB] MLLIB Word2Vec wordVectors divided by Euclidean Norm equals to zero Cosine similarity with 0 vector should be 0 Related to https://github.com/apache/spark/pull/10152 Author: Sean Owen Clos

spark git commit: [SPARK-5273][MLLIB][DOCS] Improve documentation examples for LinearRegression

2016-01-12 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 3221a7d91 -> 4c67d55c0 [SPARK-5273][MLLIB][DOCS] Improve documentation examples for LinearRegression Use a much smaller step size in LinearRegressionWithSGD MLlib examples to achieve a reasonable RMSE. Our training folks hit this exac

spark git commit: [SPARK-12582][TEST] IndexShuffleBlockResolverSuite fails in windows

2016-01-12 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 46fc7a12a -> 3221a7d91 [SPARK-12582][TEST] IndexShuffleBlockResolverSuite fails in windows [SPARK-12582][Test] IndexShuffleBlockResolverSuite fails in windows * IndexShuffleBlockResolverSuite fails in windows due to file is not closed.

spark git commit: [SPARK-12582][TEST] IndexShuffleBlockResolverSuite fails in windows

2016-01-12 Thread srowen
Repository: spark Updated Branches: refs/heads/master 9f0995bb0 -> 7e15044d9 [SPARK-12582][TEST] IndexShuffleBlockResolverSuite fails in windows [SPARK-12582][Test] IndexShuffleBlockResolverSuite fails in windows * IndexShuffleBlockResolverSuite fails in windows due to file is not closed. * m

spark git commit: [SPARK-12638][API DOC] Parameter explanation not very accurate for rdd function "aggregate"

2016-01-12 Thread srowen
Repository: spark Updated Branches: refs/heads/master 9c7f34af3 -> 9f0995bb0 [SPARK-12638][API DOC] Parameter explanation not very accurate for rdd function "aggregate" Currently, RDD function aggregate's parameter doesn't explain well, especially parameter "zeroValue". It's helpful to let j

spark git commit: [SPARK-12638][API DOC] Parameter explanation not very accurate for rdd function "aggregate"

2016-01-12 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 a6c9c68d8 -> 46fc7a12a [SPARK-12638][API DOC] Parameter explanation not very accurate for rdd function "aggregate" Currently, RDD function aggregate's parameter doesn't explain well, especially parameter "zeroValue". It's helpful to l

spark git commit: [SPARK-5273][MLLIB][DOCS] Improve documentation examples for LinearRegression

2016-01-12 Thread srowen
Repository: spark Updated Branches: refs/heads/master c48f2a3a5 -> 9c7f34af3 [SPARK-5273][MLLIB][DOCS] Improve documentation examples for LinearRegression Use a much smaller step size in LinearRegressionWithSGD MLlib examples to achieve a reasonable RMSE. Our training folks hit this exact sa

spark git commit: [SPARK-7615][MLLIB] MLLIB Word2Vec wordVectors divided by Euclidean Norm equals to zero

2016-01-12 Thread srowen
Repository: spark Updated Branches: refs/heads/master 8cfa218f4 -> c48f2a3a5 [SPARK-7615][MLLIB] MLLIB Word2Vec wordVectors divided by Euclidean Norm equals to zero Cosine similarity with 0 vector should be 0 Related to https://github.com/apache/spark/pull/10152 Author: Sean Owen Closes #

spark git commit: [SPARK-12692][BUILD][SQL] Scala style: Fix the style violation (Space before ", " or ":")

2016-01-12 Thread rxin
Repository: spark Updated Branches: refs/heads/master 112abf910 -> 8cfa218f4 [SPARK-12692][BUILD][SQL] Scala style: Fix the style violation (Space before "," or ":") Fix the style violation (space before , and :). This PR is a followup for #10643. Author: Kousuke Saruta Closes #10718 from