spark git commit: [SPARK-23267][SQL] Increase spark.sql.codegen.hugeMethodLimit to 65535

2018-01-30 Thread lixiao
Repository: spark Updated Branches: refs/heads/master a23187f53 -> 31c00ad8b [SPARK-23267][SQL] Increase spark.sql.codegen.hugeMethodLimit to 65535 ## What changes were proposed in this pull request? Still saw the performance regression introduced by `spark.sql.codegen.hugeMethodLimit` in

spark git commit: [SPARK-23157][SQL] Explain restriction on column expression in withColumn()

2018-01-29 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 a81ace196 -> bb7502f9a [SPARK-23157][SQL] Explain restriction on column expression in withColumn() ## What changes were proposed in this pull request? It's not obvious from the comments that any added column must be a function of the

spark git commit: [SPARK-23157][SQL] Explain restriction on column expression in withColumn()

2018-01-29 Thread lixiao
Repository: spark Updated Branches: refs/heads/master b375397b1 -> 8b983243e [SPARK-23157][SQL] Explain restriction on column expression in withColumn() ## What changes were proposed in this pull request? It's not obvious from the comments that any added column must be a function of the

spark git commit: [SPARK-22221][SQL][FOLLOWUP] Externalize spark.sql.execution.arrow.maxRecordsPerBatch

2018-01-29 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 75131ee86 -> 2858eaafa [SPARK-1][SQL][FOLLOWUP] Externalize spark.sql.execution.arrow.maxRecordsPerBatch ## What changes were proposed in this pull request? This is a followup to #19575 which added a section on setting max Arrow

spark git commit: [SPARK-22221][SQL][FOLLOWUP] Externalize spark.sql.execution.arrow.maxRecordsPerBatch

2018-01-29 Thread lixiao
Repository: spark Updated Branches: refs/heads/master b834446ec -> f235df66a [SPARK-1][SQL][FOLLOWUP] Externalize spark.sql.execution.arrow.maxRecordsPerBatch ## What changes were proposed in this pull request? This is a followup to #19575 which added a section on setting max Arrow

spark git commit: [SPARK-22916][SQL][FOLLOW-UP] Update the Description of Join Selection

2018-01-29 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 6588e007e -> 438631031 [SPARK-22916][SQL][FOLLOW-UP] Update the Description of Join Selection ## What changes were proposed in this pull request? This PR is to update the description of the join algorithm changes. ## How was this

spark git commit: [SPARK-22916][SQL][FOLLOW-UP] Update the Description of Join Selection

2018-01-29 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 0d60b3213 -> e30b34f7b [SPARK-22916][SQL][FOLLOW-UP] Update the Description of Join Selection ## What changes were proposed in this pull request? This PR is to update the description of the join algorithm changes. ## How was this patch

spark git commit: [SPARK-22221][DOCS] Adding User Documentation for Arrow

2018-01-29 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 d68198d26 -> 6588e007e [SPARK-1][DOCS] Adding User Documentation for Arrow ## What changes were proposed in this pull request? Adding user facing documentation for working with Arrow in Spark Author: Bryan Cutler

spark git commit: [SPARK-22221][DOCS] Adding User Documentation for Arrow

2018-01-29 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 2d903cf9d -> 0d60b3213 [SPARK-1][DOCS] Adding User Documentation for Arrow ## What changes were proposed in this pull request? Adding user facing documentation for working with Arrow in Spark Author: Bryan Cutler

spark git commit: [SPARK-23059][SQL][TEST] Correct some improper with view related method usage

2018-01-29 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 54dd7cf4e -> fbce2ed0f [SPARK-23059][SQL][TEST] Correct some improper with view related method usage ## What changes were proposed in this pull request? Correct some improper with view related method usage Only change test cases like:

spark git commit: [SPARK-23199][SQL] improved Removes repetition from group expressions in Aggregate

2018-01-29 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 de66abafc -> 4059454f9 [SPARK-23199][SQL] improved Removes repetition from group expressions in Aggregate ## What changes were proposed in this pull request? Currently, all Aggregate operations will go into

spark git commit: [SPARK-23199][SQL] improved Removes repetition from group expressions in Aggregate

2018-01-29 Thread lixiao
Repository: spark Updated Branches: refs/heads/master badf0d0e0 -> 54dd7cf4e [SPARK-23199][SQL] improved Removes repetition from group expressions in Aggregate ## What changes were proposed in this pull request? Currently, all Aggregate operations will go into

spark git commit: [SPARK-23233][PYTHON] Reset the cache in asNondeterministic to set deterministic properly

2018-01-27 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 65600bfdb -> 3b6fc286d [SPARK-23233][PYTHON] Reset the cache in asNondeterministic to set deterministic properly ## What changes were proposed in this pull request? Reproducer: ```python from pyspark.sql.functions import udf f =

spark git commit: [SPARK-23233][PYTHON] Reset the cache in asNondeterministic to set deterministic properly

2018-01-27 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 6328868e5 -> 3227d14fe [SPARK-23233][PYTHON] Reset the cache in asNondeterministic to set deterministic properly ## What changes were proposed in this pull request? Reproducer: ```python from pyspark.sql.functions import udf f =

spark git commit: [SPARK-23214][SQL] cached data should not carry extra hint info

2018-01-26 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 7aaf23cf8 -> 20c0efe48 [SPARK-23214][SQL] cached data should not carry extra hint info ## What changes were proposed in this pull request? This is a regression introduced by https://github.com/apache/spark/pull/19864 When we lookup

spark git commit: [SPARK-23214][SQL] cached data should not carry extra hint info

2018-01-26 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 073744985 -> 5b5447c68 [SPARK-23214][SQL] cached data should not carry extra hint info ## What changes were proposed in this pull request? This is a regression introduced by https://github.com/apache/spark/pull/19864 When we lookup

spark git commit: [SPARK-23218][SQL] simplify ColumnVector.getArray

2018-01-26 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 3e2525147 -> dd8e257d1 [SPARK-23218][SQL] simplify ColumnVector.getArray ## What changes were proposed in this pull request? `ColumnVector` is very flexible about how to implement array type. As a result `ColumnVector` has 3 abstract

spark git commit: [SPARK-23218][SQL] simplify ColumnVector.getArray

2018-01-26 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 ab1b5d921 -> ca3613be2 [SPARK-23218][SQL] simplify ColumnVector.getArray ## What changes were proposed in this pull request? `ColumnVector` is very flexible about how to implement array type. As a result `ColumnVector` has 3 abstract

spark git commit: [SPARK-23032][SQL] Add a per-query codegenStageId to WholeStageCodegenExec

2018-01-25 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 8480c0c57 -> e57f39481 [SPARK-23032][SQL] Add a per-query codegenStageId to WholeStageCodegenExec ## What changes were proposed in this pull request? **Proposal** Add a per-query ID to the codegen stages as represented by

spark git commit: [SPARK-23032][SQL] Add a per-query codegenStageId to WholeStageCodegenExec

2018-01-25 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 2f65c20ea -> 26a8b4e39 [SPARK-23032][SQL] Add a per-query codegenStageId to WholeStageCodegenExec ## What changes were proposed in this pull request? **Proposal** Add a per-query ID to the codegen stages as represented by

spark git commit: [MINOR][SQL] add new unit test to LimitPushdown

2018-01-24 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 500c94434 -> a857ad566 [MINOR][SQL] add new unit test to LimitPushdown ## What changes were proposed in this pull request? This PR is repaired as follows 1、update y -> x in "left outer join" test case ,maybe is mistake. 2、add a

spark git commit: [MINOR][SQL] add new unit test to LimitPushdown

2018-01-24 Thread lixiao
Repository: spark Updated Branches: refs/heads/master bc9641d90 -> 6f0ba8472 [MINOR][SQL] add new unit test to LimitPushdown ## What changes were proposed in this pull request? This PR is repaired as follows 1、update y -> x in "left outer join" test case ,maybe is mistake. 2、add a new

spark git commit: [SPARK-22837][SQL] Session timeout checker does not work in SessionManager.

2018-01-24 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 2221a3035 -> 30272c668 [SPARK-22837][SQL] Session timeout checker does not work in SessionManager. ## What changes were proposed in this pull request? Currently we do not call the `super.init(hiveConf)` in

spark git commit: [SPARK-22837][SQL] Session timeout checker does not work in SessionManager.

2018-01-24 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 8c273b416 -> bbb87b350 [SPARK-22837][SQL] Session timeout checker does not work in SessionManager. ## What changes were proposed in this pull request? Currently we do not call the `super.init(hiveConf)` in `SparkSQLSessionManager.init`.

spark git commit: Revert "[SPARK-23195][SQL] Keep the Hint of Cached Data"

2018-01-23 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 9cfe90e5a -> d656be74b Revert "[SPARK-23195][SQL] Keep the Hint of Cached Data" This reverts commit a23f6b13e8a4f0471ee33879a14746786bbf0435. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit:

spark git commit: Revert "[SPARK-23195][SQL] Keep the Hint of Cached Data"

2018-01-23 Thread lixiao
Repository: spark Updated Branches: refs/heads/master f54b65c15 -> 4e7b49041 Revert "[SPARK-23195][SQL] Keep the Hint of Cached Data" This reverts commit 44cc4daf3a03f1a220eef8ce3c86867745db9ab7. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit:

spark git commit: [SPARK-23195][SQL] Keep the Hint of Cached Data

2018-01-23 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 851c30386 -> a23f6b13e [SPARK-23195][SQL] Keep the Hint of Cached Data ## What changes were proposed in this pull request? The broadcast hint of the cached plan is lost if we cache the plan. This PR is to correct it. ```Scala val

spark git commit: [SPARK-23195][SQL] Keep the Hint of Cached Data

2018-01-23 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 613c29033 -> 44cc4daf3 [SPARK-23195][SQL] Keep the Hint of Cached Data ## What changes were proposed in this pull request? The broadcast hint of the cached plan is lost if we cache the plan. This PR is to correct it. ```Scala val df1 =

spark git commit: [SPARK-23192][SQL] Keep the Hint after Using Cached Data

2018-01-23 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 05839d164 -> 613c29033 [SPARK-23192][SQL] Keep the Hint after Using Cached Data ## What changes were proposed in this pull request? The hint of the plan segment is lost, if the plan segment is replaced by the cached data. ```Scala

spark git commit: [SPARK-23192][SQL] Keep the Hint after Using Cached Data

2018-01-23 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 f8f522c01 -> 851c30386 [SPARK-23192][SQL] Keep the Hint after Using Cached Data ## What changes were proposed in this pull request? The hint of the plan segment is lost, if the plan segment is replaced by the cached data. ```Scala

spark git commit: [SPARK-22389][SQL] data source v2 partitioning reporting interface

2018-01-22 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 76b8b840d -> 51eb75026 [SPARK-22389][SQL] data source v2 partitioning reporting interface ## What changes were proposed in this pull request? a new interface which allows data source to report partitioning and avoid shuffle at Spark

spark git commit: [SPARK-22389][SQL] data source v2 partitioning reporting interface

2018-01-22 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 566ef93a6 -> 7241556d8 [SPARK-22389][SQL] data source v2 partitioning reporting interface ## What changes were proposed in this pull request? a new interface which allows data source to report partitioning and avoid shuffle at Spark

spark git commit: [MINOR][SQL][TEST] Test case cleanups for recent PRs

2018-01-22 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 78801881c -> 896e45af5 [MINOR][SQL][TEST] Test case cleanups for recent PRs ## What changes were proposed in this pull request? Revert the unneeded test case changes we made in SPARK-23000 Also fixes the test suites that do not call

spark git commit: [MINOR][SQL][TEST] Test case cleanups for recent PRs

2018-01-22 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 d933fcea6 -> 1069fad41 [MINOR][SQL][TEST] Test case cleanups for recent PRs ## What changes were proposed in this pull request? Revert the unneeded test case changes we made in SPARK-23000 Also fixes the test suites that do not call

spark git commit: [SPARK-23170][SQL] Dump the statistics of effective runs of analyzer and optimizer rules

2018-01-22 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 73281161f -> 78801881c [SPARK-23170][SQL] Dump the statistics of effective runs of analyzer and optimizer rules ## What changes were proposed in this pull request? Dump the statistics of effective runs of analyzer and optimizer rules.

spark git commit: [SPARK-23170][SQL] Dump the statistics of effective runs of analyzer and optimizer rules

2018-01-22 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 743b9173f -> d933fcea6 [SPARK-23170][SQL] Dump the statistics of effective runs of analyzer and optimizer rules ## What changes were proposed in this pull request? Dump the statistics of effective runs of analyzer and optimizer

spark git commit: [SPARK-23122][PYSPARK][FOLLOW-UP] Update the docs for UDF Registration

2018-01-22 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 cf078a205 -> 743b9173f [SPARK-23122][PYSPARK][FOLLOW-UP] Update the docs for UDF Registration ## What changes were proposed in this pull request? This PR is to update the docs for UDF registration ## How was this patch tested? N/A

spark git commit: [SPARK-23122][PYSPARK][FOLLOW-UP] Update the docs for UDF Registration

2018-01-22 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 60175e959 -> 73281161f [SPARK-23122][PYSPARK][FOLLOW-UP] Update the docs for UDF Registration ## What changes were proposed in this pull request? This PR is to update the docs for UDF registration ## How was this patch tested? N/A

spark git commit: [SPARK-23087][SQL] CheckCartesianProduct too restrictive when condition is false/null

2018-01-20 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 b9c1367b7 -> e0ef30f77 [SPARK-23087][SQL] CheckCartesianProduct too restrictive when condition is false/null ## What changes were proposed in this pull request? CheckCartesianProduct raises an AnalysisException also when the join

spark git commit: [SPARK-23087][SQL] CheckCartesianProduct too restrictive when condition is false/null

2018-01-20 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 00d169156 -> 121dc96f0 [SPARK-23087][SQL] CheckCartesianProduct too restrictive when condition is false/null ## What changes were proposed in this pull request? CheckCartesianProduct raises an AnalysisException also when the join

spark git commit: [SPARK-21786][SQL] The 'spark.sql.parquet.compression.codec' and 'spark.sql.orc.compression.codec' configuration doesn't take effect on hive table writing

2018-01-20 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 e11d5eaf7 -> b9c1367b7 [SPARK-21786][SQL] The 'spark.sql.parquet.compression.codec' and 'spark.sql.orc.compression.codec' configuration doesn't take effect on hive table writing [SPARK-21786][SQL] The

spark git commit: [SPARK-21786][SQL] The 'spark.sql.parquet.compression.codec' and 'spark.sql.orc.compression.codec' configuration doesn't take effect on hive table writing

2018-01-20 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 84a076e0e -> 00d169156 [SPARK-21786][SQL] The 'spark.sql.parquet.compression.codec' and 'spark.sql.orc.compression.codec' configuration doesn't take effect on hive table writing [SPARK-21786][SQL] The

spark git commit: [SPARK-23165][DOC] Spelling mistake fix in quick-start doc.

2018-01-20 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 0cde5212a -> e11d5eaf7 [SPARK-23165][DOC] Spelling mistake fix in quick-start doc. ## What changes were proposed in this pull request? Fix spelling in quick-start doc. ## How was this patch tested? Doc only. Author: Shashwat Anand

spark git commit: [SPARK-23165][DOC] Spelling mistake fix in quick-start doc.

2018-01-20 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 396cdfbea -> 84a076e0e [SPARK-23165][DOC] Spelling mistake fix in quick-start doc. ## What changes were proposed in this pull request? Fix spelling in quick-start doc. ## How was this patch tested? Doc only. Author: Shashwat Anand

spark git commit: [SPARK-23091][ML] Incorrect unit test for approxQuantile

2018-01-19 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 793841c6b -> 396cdfbea [SPARK-23091][ML] Incorrect unit test for approxQuantile ## What changes were proposed in this pull request? Narrow bound on approx quantile test to epsilon from 2*epsilon to match paper ## How was this patch

spark git commit: [SPARK-23091][ML] Incorrect unit test for approxQuantile

2018-01-19 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 c647f918b -> 0cde5212a [SPARK-23091][ML] Incorrect unit test for approxQuantile ## What changes were proposed in this pull request? Narrow bound on approx quantile test to epsilon from 2*epsilon to match paper ## How was this patch

spark git commit: [SPARK-21771][SQL] remove useless hive client in SparkSQLEnv

2018-01-19 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 f9ad00a5a -> c647f918b [SPARK-21771][SQL] remove useless hive client in SparkSQLEnv ## What changes were proposed in this pull request? Once a meta hive client is created, it generates its SessionState which creates a lot of session

spark git commit: [SPARK-21771][SQL] remove useless hive client in SparkSQLEnv

2018-01-19 Thread lixiao
Repository: spark Updated Branches: refs/heads/master f6da41b01 -> 793841c6b [SPARK-21771][SQL] remove useless hive client in SparkSQLEnv ## What changes were proposed in this pull request? Once a meta hive client is created, it generates its SessionState which creates a lot of session

spark git commit: [SPARK-23149][SQL] polish ColumnarBatch

2018-01-19 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 606a7485f -> d8aaa771e [SPARK-23149][SQL] polish ColumnarBatch ## What changes were proposed in this pull request? Several cleanups in `ColumnarBatch` * remove `schema`. The `ColumnVector`s inside `ColumnarBatch` already have the data

spark git commit: [SPARK-23149][SQL] polish ColumnarBatch

2018-01-19 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 8d6845cf9 -> 55efeffd7 [SPARK-23149][SQL] polish ColumnarBatch ## What changes were proposed in this pull request? Several cleanups in `ColumnarBatch` * remove `schema`. The `ColumnVector`s inside `ColumnarBatch` already have the

spark git commit: [SPARK-23062][SQL] Improve EXCEPT documentation

2018-01-17 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 8ef323c57 -> bfbc2d41b [SPARK-23062][SQL] Improve EXCEPT documentation ## What changes were proposed in this pull request? Make the default behavior of EXCEPT (i.e. EXCEPT DISTINCT) more explicit in the documentation, and call out the

spark git commit: [SPARK-23062][SQL] Improve EXCEPT documentation

2018-01-17 Thread lixiao
Repository: spark Updated Branches: refs/heads/master a0aedb0de -> 1f3d933e0 [SPARK-23062][SQL] Improve EXCEPT documentation ## What changes were proposed in this pull request? Make the default behavior of EXCEPT (i.e. EXCEPT DISTINCT) more explicit in the documentation, and call out the

spark git commit: [SPARK-22361][SQL][TEST] Add unit test for Window Frames

2018-01-16 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 41d1a323c -> 08252bb38 [SPARK-22361][SQL][TEST] Add unit test for Window Frames ## What changes were proposed in this pull request? There are already quite a few integration tests using window frames, but the unit tests coverage is

spark git commit: [SPARK-22361][SQL][TEST] Add unit test for Window Frames

2018-01-16 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 0c2ba427b -> a9b845ebb [SPARK-22361][SQL][TEST] Add unit test for Window Frames ## What changes were proposed in this pull request? There are already quite a few integration tests using window frames, but the unit tests coverage is not

spark git commit: [SPARK-23095][SQL] Decorrelation of scalar subquery fails with java.util.NoSuchElementException

2018-01-16 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.2 7022ef800 -> d09eecccf [SPARK-23095][SQL] Decorrelation of scalar subquery fails with java.util.NoSuchElementException ## What changes were proposed in this pull request? The following SQL involving scalar correlated query returns a

spark git commit: [SPARK-23095][SQL] Decorrelation of scalar subquery fails with java.util.NoSuchElementException

2018-01-16 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 5ae91 -> 0c2ba427b [SPARK-23095][SQL] Decorrelation of scalar subquery fails with java.util.NoSuchElementException ## What changes were proposed in this pull request? The following SQL involving scalar correlated query returns a map

spark git commit: [SPARK-23095][SQL] Decorrelation of scalar subquery fails with java.util.NoSuchElementException

2018-01-16 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 833a584bb -> 41d1a323c [SPARK-23095][SQL] Decorrelation of scalar subquery fails with java.util.NoSuchElementException ## What changes were proposed in this pull request? The following SQL involving scalar correlated query returns a

spark git commit: [SPARK-23000] Use fully qualified table names in HiveMetastoreCatalogSuite

2018-01-15 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 bb8e5addc -> e2ffb9781 [SPARK-23000] Use fully qualified table names in HiveMetastoreCatalogSuite ## What changes were proposed in this pull request? In another attempt to fix DataSourceWithHiveMetastoreCatalogSuite, this patch uses

spark git commit: [SPARK-23000] Use fully qualified table names in HiveMetastoreCatalogSuite

2018-01-15 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 8ab2d7ea9 -> c7572b79d [SPARK-23000] Use fully qualified table names in HiveMetastoreCatalogSuite ## What changes were proposed in this pull request? In another attempt to fix DataSourceWithHiveMetastoreCatalogSuite, this patch uses

spark git commit: [SPARK-23035][SQL] Fix improper information of TempTableAlreadyExistsException

2018-01-15 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 c6a3b9297 -> 706a308bd [SPARK-23035][SQL] Fix improper information of TempTableAlreadyExistsException ## What changes were proposed in this pull request? Problem: it throw TempTableAlreadyExistsException and output "Temporary table

spark git commit: [SPARK-23070] Bump previousSparkVersion in MimaBuild.scala to be 2.2.0

2018-01-15 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 3491ca4fb -> c6a3b9297 [SPARK-23070] Bump previousSparkVersion in MimaBuild.scala to be 2.2.0 ## What changes were proposed in this pull request? Bump previousSparkVersion in MimaBuild.scala to be 2.2.0 and add the missing exclusions

spark git commit: [SPARK-23070] Bump previousSparkVersion in MimaBuild.scala to be 2.2.0

2018-01-15 Thread lixiao
Repository: spark Updated Branches: refs/heads/master a38c887ac -> bd08a9e7a [SPARK-23070] Bump previousSparkVersion in MimaBuild.scala to be 2.2.0 ## What changes were proposed in this pull request? Bump previousSparkVersion in MimaBuild.scala to be 2.2.0 and add the missing exclusions to

spark git commit: [SPARK-22999][SQL] show databases like command' can remove the like keyword

2018-01-14 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 9051e1a26 -> 2879236b9 [SPARK-22999][SQL] show databases like command' can remove the like keyword ## What changes were proposed in this pull request? SHOW DATABASES (LIKE pattern = STRING)? Can be like the back increase? When using

spark git commit: [SPARK-22999][SQL] show databases like command' can remove the like keyword

2018-01-14 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 60eeecd77 -> 42a1a15d7 [SPARK-22999][SQL] show databases like command' can remove the like keyword ## What changes were proposed in this pull request? SHOW DATABASES (LIKE pattern = STRING)? Can be like the back increase? When using this

spark git commit: [SPARK-23021][SQL] AnalysisBarrier should override innerChildren to print correct explain output

2018-01-14 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 0d425c336 -> 5fbbd94d5 [SPARK-23021][SQL] AnalysisBarrier should override innerChildren to print correct explain output ## What changes were proposed in this pull request? `AnalysisBarrier` in the current master cuts off explain

spark git commit: [SPARK-23021][SQL] AnalysisBarrier should override innerChildren to print correct explain output

2018-01-14 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 66738d29c -> 990f05c80 [SPARK-23021][SQL] AnalysisBarrier should override innerChildren to print correct explain output ## What changes were proposed in this pull request? `AnalysisBarrier` in the current master cuts off explain results

spark git commit: [SPARK-21213][SQL][FOLLOWUP] Use compatible types for comparisons in compareAndGetNewStats

2018-01-13 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 0fc5533e5 -> bcd87ae07 [SPARK-21213][SQL][FOLLOWUP] Use compatible types for comparisons in compareAndGetNewStats ## What changes were proposed in this pull request? This pr fixed code to compare values in `compareAndGetNewStats`. The

spark git commit: [SPARK-21213][SQL][FOLLOWUP] Use compatible types for comparisons in compareAndGetNewStats

2018-01-13 Thread lixiao
Repository: spark Updated Branches: refs/heads/master ba891ec99 -> 0066d6f6f [SPARK-21213][SQL][FOLLOWUP] Use compatible types for comparisons in compareAndGetNewStats ## What changes were proposed in this pull request? This pr fixed code to compare values in `compareAndGetNewStats`. The

spark git commit: [SPARK-22790][SQL] add a configurable factor to describe HadoopFsRelation's size

2018-01-13 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 8d32ed5f2 -> 0fc5533e5 [SPARK-22790][SQL] add a configurable factor to describe HadoopFsRelation's size ## What changes were proposed in this pull request? as per discussion in

spark git commit: [SPARK-22790][SQL] add a configurable factor to describe HadoopFsRelation's size

2018-01-13 Thread lixiao
Repository: spark Updated Branches: refs/heads/master bd4a21b48 -> ba891ec99 [SPARK-22790][SQL] add a configurable factor to describe HadoopFsRelation's size ## What changes were proposed in this pull request? as per discussion in

spark git commit: [SPARK-23036][SQL][TEST] Add withGlobalTempView for testing

2018-01-13 Thread lixiao
Repository: spark Updated Branches: refs/heads/master fc6fe8a1d -> bd4a21b48 [SPARK-23036][SQL][TEST] Add withGlobalTempView for testing ## What changes were proposed in this pull request? Add withGlobalTempView when create global temp view, like withTempView and withView. And correct some

spark git commit: [SPARK-23036][SQL][TEST] Add withGlobalTempView for testing

2018-01-13 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 801ffd799 -> 8d32ed5f2 [SPARK-23036][SQL][TEST] Add withGlobalTempView for testing ## What changes were proposed in this pull request? Add withGlobalTempView when create global temp view, like withTempView and withView. And correct

spark git commit: [SPARK-23025][SQL] Support Null type in scala reflection

2018-01-12 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 d512d873b -> 6152da389 [SPARK-23025][SQL] Support Null type in scala reflection ## What changes were proposed in this pull request? Add support for `Null` type in the `schemaFor` method for Scala reflection. ## How was this patch

spark git commit: [SPARK-23025][SQL] Support Null type in scala reflection

2018-01-12 Thread lixiao
Repository: spark Updated Branches: refs/heads/master a7d98d53c -> 505086806 [SPARK-23025][SQL] Support Null type in scala reflection ## What changes were proposed in this pull request? Add support for `Null` type in the `schemaFor` method for Scala reflection. ## How was this patch tested?

spark git commit: [SPARK-23000][TEST-HADOOP2.6] Fix Flaky test suite DataSourceWithHiveMetastoreCatalogSuite

2018-01-11 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 799598905 -> 9ca0f6eaf [SPARK-23000][TEST-HADOOP2.6] Fix Flaky test suite DataSourceWithHiveMetastoreCatalogSuite ## What changes were proposed in this pull request? The Spark 2.3 branch still failed due to the flaky test suite

spark git commit: [SPARK-23000][TEST-HADOOP2.6] Fix Flaky test suite DataSourceWithHiveMetastoreCatalogSuite

2018-01-11 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 0552c36e0 -> 76892bcf2 [SPARK-23000][TEST-HADOOP2.6] Fix Flaky test suite DataSourceWithHiveMetastoreCatalogSuite ## What changes were proposed in this pull request? The Spark 2.3 branch still failed due to the flaky test suite

spark git commit: [SPARK-22972] Couldn't find corresponding Hive SerDe for data source provider org.apache.spark.sql.hive.orc

2018-01-10 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.2 24f1f2a54 -> 0d943d96b [SPARK-22972] Couldn't find corresponding Hive SerDe for data source provider org.apache.spark.sql.hive.orc ## What changes were proposed in this pull request? Fix the warning: Couldn't find corresponding Hive

spark git commit: [SPARK-22972] Couldn't find corresponding Hive SerDe for data source provider org.apache.spark.sql.hive.orc

2018-01-08 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 eecd83cb2 -> 8032cf852 [SPARK-22972] Couldn't find corresponding Hive SerDe for data source provider org.apache.spark.sql.hive.orc ## What changes were proposed in this pull request? Fix the warning: Couldn't find corresponding Hive

spark git commit: [SPARK-22972] Couldn't find corresponding Hive SerDe for data source provider org.apache.spark.sql.hive.orc

2018-01-08 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 4f7e75883 -> 68ce792b5 [SPARK-22972] Couldn't find corresponding Hive SerDe for data source provider org.apache.spark.sql.hive.orc ## What changes were proposed in this pull request? Fix the warning: Couldn't find corresponding Hive SerDe

spark git commit: [SPARK-22983] Don't push filters beneath aggregates with empty grouping expressions

2018-01-08 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.2 41f705a57 -> 7c30ae39f [SPARK-22983] Don't push filters beneath aggregates with empty grouping expressions ## What changes were proposed in this pull request? The following SQL query should return zero rows, but in Spark it actually

spark git commit: [SPARK-22983] Don't push filters beneath aggregates with empty grouping expressions

2018-01-08 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 8bf24e9fe -> 6964dfe47 [SPARK-22983] Don't push filters beneath aggregates with empty grouping expressions ## What changes were proposed in this pull request? The following SQL query should return zero rows, but in Spark it actually

spark git commit: [SPARK-22983] Don't push filters beneath aggregates with empty grouping expressions

2018-01-08 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 8fdeb4b99 -> 2c73d2a94 [SPARK-22983] Don't push filters beneath aggregates with empty grouping expressions ## What changes were proposed in this pull request? The following SQL query should return zero rows, but in Spark it actually

spark git commit: [SPARK-22985] Fix argument escaping bug in from_utc_timestamp / to_utc_timestamp codegen

2018-01-07 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 285d342c4 -> 7673e9c56 [SPARK-22985] Fix argument escaping bug in from_utc_timestamp / to_utc_timestamp codegen ## What changes were proposed in this pull request? This patch adds additional escaping in `from_utc_timestamp` /

spark git commit: [SPARK-22985] Fix argument escaping bug in from_utc_timestamp / to_utc_timestamp codegen

2018-01-07 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 18e941499 -> 71d65a321 [SPARK-22985] Fix argument escaping bug in from_utc_timestamp / to_utc_timestamp codegen ## What changes were proposed in this pull request? This patch adds additional escaping in `from_utc_timestamp` /

spark git commit: [HOTFIX] Fix style checking failure

2018-01-06 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 b66700a5e -> f9e7b0c8a [HOTFIX] Fix style checking failure ## What changes were proposed in this pull request? This PR is to fix the style checking failure. ## How was this patch tested? N/A Author: gatorsmile

spark git commit: [HOTFIX] Fix style checking failure

2018-01-06 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 993f21567 -> 9a7048b28 [HOTFIX] Fix style checking failure ## What changes were proposed in this pull request? This PR is to fix the style checking failure. ## How was this patch tested? N/A Author: gatorsmile

spark git commit: [SPARK-22901][PYTHON][FOLLOWUP] Adds the doc for asNondeterministic for wrapped UDF function

2018-01-06 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 037775598 -> b66700a5e [SPARK-22901][PYTHON][FOLLOWUP] Adds the doc for asNondeterministic for wrapped UDF function ## What changes were proposed in this pull request? This PR wraps the `asNondeterministic` attribute in the wrapped

spark git commit: [SPARK-22901][PYTHON][FOLLOWUP] Adds the doc for asNondeterministic for wrapped UDF function

2018-01-06 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 7b7804142 -> 993f21567 [SPARK-22901][PYTHON][FOLLOWUP] Adds the doc for asNondeterministic for wrapped UDF function ## What changes were proposed in this pull request? This PR wraps the `asNondeterministic` attribute in the wrapped UDF

spark git commit: [SPARK-21786][SQL] When acquiring 'compressionCodecClassName' in 'ParquetOptions', `parquet.compression` needs to be considered.

2018-01-06 Thread lixiao
Repository: spark Updated Branches: refs/heads/master be9a804f2 -> 7b7804142 [SPARK-21786][SQL] When acquiring 'compressionCodecClassName' in 'ParquetOptions', `parquet.compression` needs to be considered. [SPARK-21786][SQL] When acquiring 'compressionCodecClassName' in 'ParquetOptions',

spark git commit: [SPARK-21786][SQL] When acquiring 'compressionCodecClassName' in 'ParquetOptions', `parquet.compression` needs to be considered.

2018-01-06 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 e6449e816 -> 037775598 [SPARK-21786][SQL] When acquiring 'compressionCodecClassName' in 'ParquetOptions', `parquet.compression` needs to be considered. [SPARK-21786][SQL] When acquiring 'compressionCodecClassName' in

spark git commit: [SPARK-22793][SQL] Memory leak in Spark Thrift Server

2018-01-06 Thread lixiao
Repository: spark Updated Branches: refs/heads/master f2dd8b923 -> be9a804f2 [SPARK-22793][SQL] Memory leak in Spark Thrift Server # What changes were proposed in this pull request? 1. Start HiveThriftServer2. 2. Connect to thriftserver through beeline. 3. Close the beeline. 4. repeat step2

spark git commit: [SPARK-22793][SQL] Memory leak in Spark Thrift Server

2018-01-06 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 7236914e5 -> e6449e816 [SPARK-22793][SQL] Memory leak in Spark Thrift Server # What changes were proposed in this pull request? 1. Start HiveThriftServer2. 2. Connect to thriftserver through beeline. 3. Close the beeline. 4. repeat

spark git commit: [SPARK-22930][PYTHON][SQL] Improve the description of Vectorized UDFs for non-deterministic cases

2018-01-06 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 3e3e9386e -> 7236914e5 [SPARK-22930][PYTHON][SQL] Improve the description of Vectorized UDFs for non-deterministic cases ## What changes were proposed in this pull request? Add tests for using non deterministic UDFs in aggregate.

spark git commit: [SPARK-22930][PYTHON][SQL] Improve the description of Vectorized UDFs for non-deterministic cases

2018-01-06 Thread lixiao
Repository: spark Updated Branches: refs/heads/master bf65cd3cd -> f2dd8b923 [SPARK-22930][PYTHON][SQL] Improve the description of Vectorized UDFs for non-deterministic cases ## What changes were proposed in this pull request? Add tests for using non deterministic UDFs in aggregate. Update

spark git commit: [SPARK-22937][SQL] SQL elt output binary for binary inputs

2018-01-05 Thread lixiao
Repository: spark Updated Branches: refs/heads/master ea9568330 -> e8af7e8ae [SPARK-22937][SQL] SQL elt output binary for binary inputs ## What changes were proposed in this pull request? This pr modified `elt` to output binary for binary inputs. `elt` in the current master always output data

spark git commit: [SPARK-22937][SQL] SQL elt output binary for binary inputs

2018-01-05 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 55afac4e7 -> bf853018c [SPARK-22937][SQL] SQL elt output binary for binary inputs ## What changes were proposed in this pull request? This pr modified `elt` to output binary for binary inputs. `elt` in the current master always output

spark git commit: [SPARK-22961][REGRESSION] Constant columns should generate QueryPlanConstraints

2018-01-05 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 f9dcdbcef -> fd4e30476 [SPARK-22961][REGRESSION] Constant columns should generate QueryPlanConstraints ## What changes were proposed in this pull request? #19201 introduced the following regression: given something like

spark git commit: [SPARK-22961][REGRESSION] Constant columns should generate QueryPlanConstraints

2018-01-05 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 6cff7d19f -> 51c33bd0d [SPARK-22961][REGRESSION] Constant columns should generate QueryPlanConstraints ## What changes were proposed in this pull request? #19201 introduced the following regression: given something like

spark git commit: [SPARK-22771][SQL] Add a missing return statement in Concat.checkInputDataTypes

2018-01-04 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 1f5e3540c -> bcfeef5a9 [SPARK-22771][SQL] Add a missing return statement in Concat.checkInputDataTypes ## What changes were proposed in this pull request? This pr is a follow-up to fix a bug left in #19977. ## How was this patch

spark git commit: [SPARK-22771][SQL] Add a missing return statement in Concat.checkInputDataTypes

2018-01-04 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 5aadbc929 -> 6f68316e9 [SPARK-22771][SQL] Add a missing return statement in Concat.checkInputDataTypes ## What changes were proposed in this pull request? This pr is a follow-up to fix a bug left in #19977. ## How was this patch tested?

<    4   5   6   7   8   9   10   11   12   13   >