[GitHub] spark pull request #13444: [SPARK-15530][SQL] Set #parallelism for file list...

2016-06-10 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13444#discussion_r66682974 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/ListingFileCatalog.scala --- @@ -66,30 +66,13 @@ class ListingFileCatalog

[GitHub] spark pull request #13542: [SPARK-15730][SQL][WIP] Respect the --hiveconf in...

2016-06-10 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13542#discussion_r66678717 --- Diff: sql/hive-thriftserver/src/test/scala/org/apache/spark/sql/hive/thriftserver/CliSuite.scala --- @@ -91,6 +91,8 @@ class CliSuite extends

[GitHub] spark issue #12313: [SPARK-14543] [SQL] Improve InsertIntoTable column resol...

2016-06-10 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/12313 @rdblue Thank you for updating the patch. I was out of town late last week and was busy on spark summit early this week. Sorry for my late reply. Having name-based resolution is very useful! Since

[GitHub] spark pull request #13413: [SPARK-15663][SQL] SparkSession.catalog.listFunct...

2016-06-09 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13413#discussion_r66563299 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/SQLQuerySuite.scala --- @@ -58,15 +60,39 @@ class SQLQuerySuite extends QueryTest

[GitHub] spark pull request #13413: [SPARK-15663][SQL] SparkSession.catalog.listFunct...

2016-06-09 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13413#discussion_r66563267 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/SessionCatalog.scala --- @@ -855,7 +855,8 @@ class SessionCatalog

[GitHub] spark pull request #13413: [SPARK-15663][SQL] SparkSession.catalog.listFunct...

2016-06-09 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13413#discussion_r66563161 --- Diff: python/pyspark/sql/tests.py --- @@ -1481,17 +1481,7 @@ def test_list_functions(self): spark.sql("CREATE DATABASE so

[GitHub] spark issue #13371: [SPARK-15639][SQL] Try to push down filter at RowGroups ...

2016-06-09 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/13371 @viirya I took a look at parquet's code. Seems parquet only evaluate row group level filters when generating splits (https://github.com/apache/parquet-mr/blob/apache-parquet-1.7.0/parquet-hadoop/src

[GitHub] spark issue #13573: [SPARK-15839] Fix Maven doc-jar generation when JAVA_7_H...

2016-06-09 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/13573 I am going to trigger a snapshot build. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

spark git commit: [SPARK-15839] Fix Maven doc-jar generation when JAVA_7_HOME is set

2016-06-09 Thread yhuai
mvn clean install -DskipTests=true` when `JAVA_7_HOME` was set. Also manually inspected the effective POM diff to verify that the final POM changes were scoped correctly: https://gist.github.com/JoshRosen/f889d1c236fad14fa25ac4be01654653 /cc vanzin and yhuai for review. Author: Josh Rosen <

spark git commit: [SPARK-15839] Fix Maven doc-jar generation when JAVA_7_HOME is set

2016-06-09 Thread yhuai
ean install -DskipTests=true` when `JAVA_7_HOME` was set. Also manually inspected the effective POM diff to verify that the final POM changes were scoped correctly: https://gist.github.com/JoshRosen/f889d1c236fad14fa25ac4be01654653 /cc vanzin and yhuai for review. Author: Josh Rosen <

[GitHub] spark issue #13573: [SPARK-15839] Fix Maven doc-jar generation when JAVA_7_H...

2016-06-09 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/13573 Merging to master and branch 2.0. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #13573: [SPARK-15839] Fix Maven doc-jar generation when JAVA_7_H...

2016-06-09 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/13573 lgtm --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #13549: [SPARK-15812][SQ][Streaming] Added support for sorting a...

2016-06-09 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/13549 test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #13573: [SPARK-15839] Fix Maven doc-jar generation when JAVA_7_H...

2016-06-09 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/13573 test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #13189: [SPARK-14670][SQL] allow updating driver side sql metric...

2016-06-07 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/13189 Seems it is fine to not have metrics when we use hiveResultString. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark pull request #13534: [SPARK-15789][SQL] Allow reserved keywords in mos...

2016-06-07 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13534#discussion_r66171754 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/parser/DataTypeParserSuite.scala --- @@ -133,4 +133,8 @@ class

[GitHub] spark pull request #13450: [SPARK-9876] [BRANCH-2.0] Revert "[SPARK-9876][SQ...

2016-06-06 Thread yhuai
Github user yhuai closed the pull request at: https://github.com/apache/spark/pull/13450 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] spark pull request #13444: [SPARK-15530][SQL] Set #parallelism for file list...

2016-06-05 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13444#discussion_r65823862 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/fileSourceInterfaces.scala --- @@ -409,13 +409,24 @@ private[sql] object

[GitHub] spark pull request #13444: [SPARK-15530][SQL] Set #parallelism for file list...

2016-06-05 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13444#discussion_r65823818 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/ListingFileCatalog.scala --- @@ -75,7 +75,7 @@ class ListingFileCatalog

[GitHub] spark issue #13270: [SPARK-15269][SQL] Removes unexpected empty table direct...

2016-06-01 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/13270 LGTM. @liancheng Can you merge this? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #13455: [SPARK-15719][SQL] Disables writing Parquet summary file...

2016-06-01 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/13455 lgtm pending jenkins. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #13290: [SPARK-13484] [SQL] Prevent illegal NULL propagat...

2016-06-01 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13290#discussion_r65450769 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala --- @@ -1448,6 +1450,38 @@ class Analyzer

spark git commit: [SPARK-15596][SPARK-15635][SQL] ALTER TABLE RENAME fixes

2016-06-01 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-2.0 46d5f7f38 -> 44052a707 [SPARK-15596][SPARK-15635][SQL] ALTER TABLE RENAME fixes ## What changes were proposed in this pull request? **SPARK-15596**: Even after we renamed a cached table, the plan would remain in the cache with the

spark git commit: [SPARK-15596][SPARK-15635][SQL] ALTER TABLE RENAME fixes

2016-06-01 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 5b08ee639 -> 9e2643b21 [SPARK-15596][SPARK-15635][SQL] ALTER TABLE RENAME fixes ## What changes were proposed in this pull request? **SPARK-15596**: Even after we renamed a cached table, the plan would remain in the cache with the old

[GitHub] spark issue #13416: [SPARK-15596][SPARK-15635][SQL] ALTER TABLE RENAME fixes

2016-06-01 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/13416 merging to master and branch 2.0 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #13416: [SPARK-15596][SPARK-15635][SQL] ALTER TABLE RENAME fixes

2016-06-01 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/13416 lgtm --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #13445: [SPARK-9876] Revert "[SPARK-9876][SQL] Update Par...

2016-06-01 Thread yhuai
Github user yhuai closed the pull request at: https://github.com/apache/spark/pull/13445 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] spark issue #13450: [SPARK-9876] [BRANCH-2.0] Revert "[SPARK-9876][SQL] Upda...

2016-06-01 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/13450 @rdblue @liancheng Can you review this PR? This is for branch 2.0. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request #13450: [SPARK-9876] [BRANCH-2.0] Revert "[SPARK-9876][SQ...

2016-06-01 Thread yhuai
GitHub user yhuai opened a pull request: https://github.com/apache/spark/pull/13450 [SPARK-9876] [BRANCH-2.0] Revert "[SPARK-9876][SQL] Update Parquet to 1.8.1." ## What changes were proposed in this pull request? Since we are pretty late in the 2.0 rel

[GitHub] spark issue #13445: [SPARK-9876] Revert "[SPARK-9876][SQL] Update Parquet to...

2016-06-01 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/13445 I am closing this for now. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #13445: [SPARK-9876] Revert "[SPARK-9876][SQL] Update Parquet to...

2016-06-01 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/13445 OK. Let me create another PR for branch 2.0. We will merge that one first. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark issue #13445: [SPARK-9876] Revert "[SPARK-9876][SQL] Update Parquet to...

2016-06-01 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/13445 @rdblue How about we merge this to master and branch 2.0? Feel free to open your PR again. We can figure out the perf thing with @liancheng together. --- If your project is set up for it, you can

[GitHub] spark issue #10793: [SPARK-12853][SQL] ignore bucketing information if we do...

2016-06-01 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/10793 close this for now? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #13216: [SPARK-15434][SQL] improve EmbedSerializerInFilter rule

2016-06-01 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/13216 close this for now? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #13029: [SPARK-15252][CORE][WIP] add accumulator wrapper to have...

2016-06-01 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/13029 close this for now? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #13445: [SPARK-9876] Revert "[SPARK-9876][SQL] Update Parquet to...

2016-06-01 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/13445 btw, we observed an error when filter pushdown is enabled. Unfortunately, we missed the exception... --- If your project is set up for it, you can reply to this email and have your reply appear

[GitHub] spark issue #13445: [SPARK-9876] Revert "[SPARK-9876][SQL] Update Parquet to...

2016-06-01 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/13445 @rdblue Since 2.0 branch has been cut, I am little bit concerned about potential merge conflicts when we cherry-pick bug fixes into 2.0 branch before the release if we do not revert it from

[GitHub] spark pull request #13445: [SPARK-9876] Revert "[SPARK-9876][SQL] Update Par...

2016-06-01 Thread yhuai
GitHub user yhuai opened a pull request: https://github.com/apache/spark/pull/13445 [SPARK-9876] Revert "[SPARK-9876][SQL] Update Parquet to 1.8.1." ## What changes were proposed in this pull request? Since we are pretty late in the 2.0 release cycle, it is

[GitHub] spark pull request: [SPARK-9876][SQL]: Update Parquet to 1.8.1.

2016-06-01 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13280 Hello @rdblue, we are pretty late in this release cycle. I am afraid that we cannot actually upgrade Parquet to 1.8.1 because of the following two reasons: 1. Since this change was merged

[GitHub] spark pull request: [SPARK-15646] [SQL] When spark.sql.hive.convertCTAS is t...

2016-05-31 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13386#discussion_r65306005 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/SparkSqlParser.scala --- @@ -936,7 +936,39 @@ class SparkSqlAstBuilder(conf: SQLConf) extends

[GitHub] spark pull request: [SPARK-15646] [SQL] When spark.sql.hive.convertCTAS is t...

2016-05-31 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13386#discussion_r65305666 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/SparkSqlParser.scala --- @@ -936,7 +936,39 @@ class SparkSqlAstBuilder(conf: SQLConf) extends

[GitHub] spark pull request: [SPARK-15646] [SQL] When spark.sql.hive.convertCTAS is t...

2016-05-31 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13386#discussion_r65305383 --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveSessionState.scala --- @@ -139,22 +139,6 @@ private[hive] class HiveSessionState(sparkSession

[GitHub] spark pull request: [SPARK-15646] [SQL] When spark.sql.hive.convertCTAS is t...

2016-05-31 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13386#discussion_r65305344 --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveMetastoreCatalog.scala --- @@ -447,52 +447,20 @@ private[hive] class HiveMetastoreCatalog

[GitHub] spark pull request: [SPARK-15639][SQL] Try to push down filter at RowGroups ...

2016-05-31 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13371 It is a good idea to add it if parquet supports it (I have an impression that parquet does not support it. But maybe I am wrong). I think having benchmark results is a good practice, so we can

[GitHub] spark pull request: [SPARK-15639][SQL] Try to push down filter at RowGroups ...

2016-05-31 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13371#discussion_r65302925 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala --- @@ -344,6 +344,11 @@ private[sql] class

[GitHub] spark pull request: [SPARK-15639][SQL] Try to push down filter at RowGroups ...

2016-05-31 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13371#discussion_r65302899 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala --- @@ -344,6 +344,11 @@ private[sql] class

[GitHub] spark pull request: [SPARK-14949][SQL] Remove HiveConf dependency from Inser...

2016-05-31 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/12728#discussion_r65302512 --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/execution/InsertIntoHiveTable.scala --- @@ -114,13 +110,15 @@ case class InsertIntoHiveTable

[GitHub] spark pull request: [SPARK-15639][SQL] Try to push down filter at RowGroups ...

2016-05-31 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13371 Can you provide a test case that shows the problem? Also, can you provide benchmark results of the performance benefit? --- If your project is set up for it, you can reply to this email

[GitHub] spark pull request: [SPARK-15639][SQL] Try to push down filter at RowGroups ...

2016-05-31 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13371#discussion_r65301654 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala --- @@ -344,6 +344,11 @@ private[sql] class

[GitHub] spark pull request: [SPARK-15639][SQL] Try to push down filter at RowGroups ...

2016-05-31 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13371#discussion_r65301661 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala --- @@ -578,62 +583,6 @@ private[sql] object

[GitHub] spark pull request: [SPARK-12988][SQL]Can't drop top level columns that cont...

2016-05-31 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13306 This PR makes the behavior of drop and withColumn consistent. Let's decide what we do for backticks in a separate JIRA. --- If your project is set up for it, you can reply to this email

spark git commit: [SPARK-12988][SQL] Can't drop top level columns that contain dots

2016-05-31 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-2.0 7f240eaee -> b8de4ad7d [SPARK-12988][SQL] Can't drop top level columns that contain dots ## What changes were proposed in this pull request? Fixes "Can't drop top level columns that contain dots". This work is based on dilipbiswal's

spark git commit: [SPARK-12988][SQL] Can't drop top level columns that contain dots

2016-05-31 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 0f2471346 -> 06514d689 [SPARK-12988][SQL] Can't drop top level columns that contain dots ## What changes were proposed in this pull request? Fixes "Can't drop top level columns that contain dots". This work is based on dilipbiswal's

[GitHub] spark pull request: [SPARK-12988][SQL]Can't drop top level columns that cont...

2016-05-31 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13306 LGTM merging to master and branch 2.0. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-15646] [SQL] When spark.sql.hive.convertCTAS is t...

2016-05-31 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13386#discussion_r65265253 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/SparkSqlParser.scala --- @@ -936,7 +936,39 @@ class SparkSqlAstBuilder(conf: SQLConf) extends

[GitHub] spark pull request: [SPARK-15269][SQL] Removes unexpected empty table direct...

2016-05-31 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13270#discussion_r65263898 --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveExternalCatalog.scala --- @@ -147,7 +152,41 @@ private[spark] class HiveExternalCatalog(client

[GitHub] spark pull request: [SPARK-15269][SQL] Removes unexpected empty table direct...

2016-05-31 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13270#discussion_r65263694 --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/client/HiveClientImpl.scala --- @@ -368,14 +371,27 @@ private[hive] class HiveClientImpl

[GitHub] spark pull request: [SQL] Prevent illegal NULL propagation when filtering ou...

2016-05-31 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13290#discussion_r65263136 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala --- @@ -1448,6 +1450,38 @@ class Analyzer

[GitHub] spark pull request: [SPARK-14507] [SQL] EXTERNAL keyword in a CTAS statement...

2016-05-31 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13395 chatted with @andrewor14 . Since https://github.com/apache/spark/pull/13386/files will fix the location handling when convertCTAS is true, probably it is not really needed to ban EXTERNAL

[GitHub] spark pull request: [SPARK-14507] [SQL] EXTERNAL keyword in a CTAS statement...

2016-05-31 Thread yhuai
Github user yhuai closed the pull request at: https://github.com/apache/spark/pull/13395 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] spark pull request: [SPARK-15269][SQL] Removes unexpected empty table direct...

2016-05-31 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13270#discussion_r65253802 --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveExternalCatalog.scala --- @@ -68,12 +72,13 @@ private[spark] class HiveExternalCatalog(client

spark git commit: [SPARK-15622][SQL] Wrap the parent classloader of Janino's classloader in the ParentClassLoader.

2016-05-31 Thread yhuai
iff-bb538fda94224dd0af01d0fd7e1b4ea0R81) and `test-only *ReplSuite -- -z "SPARK-2576 importing implicits"` still passes the test (without the change in `CodeGenerator`, this test does not pass with the change in `ExecutorClassLoader `). Author: Yin Huai <yh...@databricks.com> Closes #13366 from yhuai/SPARK-156

spark git commit: [SPARK-15622][SQL] Wrap the parent classloader of Janino's classloader in the ParentClassLoader.

2016-05-31 Thread yhuai
R81) and `test-only *ReplSuite -- -z "SPARK-2576 importing implicits"` still passes the test (without the change in `CodeGenerator`, this test does not pass with the change in `ExecutorClassLoader `). Author: Yin Huai <yh...@databricks.com> Closes #13366 from yhuai/SPARK-15622.

[GitHub] spark pull request: [SPARK-15622] [SQL] Wrap the parent classloader of Janin...

2016-05-31 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13366 Thanks. Merging to master and branch 2.0. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-15622] [SQL] Wrap the parent classloader of Janin...

2016-05-31 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13366 Yea. `SPARK-2576 importing implicits` in REPL suite is a good test. Without the fix, ExecutorClassLoader throws ClassNotFoundException with those weird class names. --- If your project

[GitHub] spark pull request: [SPARK-15494][SQL] encoder code cleanup

2016-05-31 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13269 Are we going to break this PR to multiple smaller PRs? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

spark git commit: [SPARK-15658][SQL] UDT serializer should declare its data type as udt instead of udt.sqlType

2016-05-31 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-2.0 6347ff512 -> 29b94fdb3 [SPARK-15658][SQL] UDT serializer should declare its data type as udt instead of udt.sqlType ## What changes were proposed in this pull request? When we build serializer for UDT object, we should declare its

spark git commit: [SPARK-15658][SQL] UDT serializer should declare its data type as udt instead of udt.sqlType

2016-05-31 Thread yhuai
Repository: spark Updated Branches: refs/heads/master d67c82e4b -> 2bfed1a0c [SPARK-15658][SQL] UDT serializer should declare its data type as udt instead of udt.sqlType ## What changes were proposed in this pull request? When we build serializer for UDT object, we should declare its data

[GitHub] spark pull request: [SPARK-15658][SQL] UDT serializer should declare its dat...

2016-05-31 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13402 LGTM. Merging to master and branch 2.0. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-15646] [SQL] When spark.sql.hive.convertCTAS is t...

2016-05-31 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13386 test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark pull request: [SPARK-14507] [SQL] EXTERNAL keyword in a CTAS statement...

2016-05-31 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13395 @gatorsmile Thanks. For partitioned by, it is in https://github.com/apache/spark/pull/13386. For clustered by, seems we do have test case in HiveDDLCommandSuite. --- If your project is set

[GitHub] spark pull request: [SPARK-15622] [SQL] Wrap the parent classloader of Janin...

2016-05-31 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13366 test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark pull request: [SQL] Prevent illegal NULL propagation when filtering ou...

2016-05-31 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13290 test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark pull request: [SPARK-14543] [SQL] Improve InsertIntoTable co...

2016-05-30 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/12313#issuecomment-222566966 @rdblue How about we use a separate PR for the work of adding `insertByNameInto`? It will be easier to review and the discussion on the API name/semantic will not block

[GitHub] spark pull request: [SPARK-14507] [SQL] EXTERNAL keyword in a CTAS...

2016-05-29 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13395#discussion_r65013574 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/SparkSqlParser.scala --- @@ -936,7 +936,47 @@ class SparkSqlAstBuilder(conf: SQLConf) extends

[GitHub] spark pull request: [SPARK-14507] [SQL] EXTERNAL keyword in a CTAS...

2016-05-29 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13395#issuecomment-222376388 test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-14507] [SQL] EXTERNAL keyword in a CTAS...

2016-05-29 Thread yhuai
GitHub user yhuai opened a pull request: https://github.com/apache/spark/pull/13395 [SPARK-14507] [SQL] EXTERNAL keyword in a CTAS statement is not allowed ## What changes were proposed in this pull request? This PR makes the parser to throw an exception if a hive style CTAS

[GitHub] spark pull request: [SPARK-15646] [SQL] When spark.sql.hive.conver...

2016-05-29 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13386#issuecomment-222375080 OK. External related changes will be handled by another PR. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

[GitHub] spark pull request: [SPARK-14507] [SPARK-15646] [SQL] When spark.s...

2016-05-28 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13386#issuecomment-222343075 test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-14507] [SPARK-15646] [SQL] When spark.s...

2016-05-28 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13386#issuecomment-222334796 @ericl @andrewor14 @liancheng Can you review this PR? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well

[GitHub] spark pull request: [SPARK-14507] [SPARK-15646] [SQL] When spark.s...

2016-05-28 Thread yhuai
GitHub user yhuai opened a pull request: https://github.com/apache/spark/pull/13386 [SPARK-14507] [SPARK-15646] [SQL] When spark.sql.hive.convertCTAS is true, we should not convert the table stored as TEXTFILE/SEQUENCEFILE and we need respect the user-defined location ## What

spark git commit: [SPARK-15636][SQL] Make aggregate expressions more concise in explain

2016-05-28 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-2.0 a2f68ded2 -> f3570bcea [SPARK-15636][SQL] Make aggregate expressions more concise in explain ## What changes were proposed in this pull request? This patch reduces the verbosity of aggregate expressions in explain (but does not

spark git commit: [SPARK-15636][SQL] Make aggregate expressions more concise in explain

2016-05-28 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 74c1b79f3 -> 472f16181 [SPARK-15636][SQL] Make aggregate expressions more concise in explain ## What changes were proposed in this pull request? This patch reduces the verbosity of aggregate expressions in explain (but does not actually

[GitHub] spark pull request: [SPARK-15636][SQL] Make aggregate expressions ...

2016-05-28 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13367#issuecomment-222329931 lgtm. Merging to master and branch 2.0. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

spark git commit: [SPARK-15594][SQL] ALTER TABLE SERDEPROPERTIES does not respect partition spec

2016-05-27 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 776d183c8 -> 4a2fb8b87 [SPARK-15594][SQL] ALTER TABLE SERDEPROPERTIES does not respect partition spec ## What changes were proposed in this pull request? These commands ignore the partition spec and change the storage properties of the

spark git commit: [SPARK-15594][SQL] ALTER TABLE SERDEPROPERTIES does not respect partition spec

2016-05-27 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-2.0 dc6e94157 -> 80a40e8e2 [SPARK-15594][SQL] ALTER TABLE SERDEPROPERTIES does not respect partition spec ## What changes were proposed in this pull request? These commands ignore the partition spec and change the storage properties of

[GitHub] spark pull request: [SPARK-15594][SQL] ALTER TABLE SERDEPROPERTIES...

2016-05-27 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13343#issuecomment-79262 lgtm. Merging to master and branch 2.0. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-15622] [SQL] Wrap the parent classloade...

2016-05-27 Thread yhuai
GitHub user yhuai opened a pull request: https://github.com/apache/spark/pull/13366 [SPARK-15622] [SQL] Wrap the parent classloader of Janino's classloader in the ParentClassLoader. ## What changes were proposed in this pull request? At https://github.com/aunkrig/janino/blob

spark git commit: [SPARK-15431][SQL][BRANCH-2.0-TEST] rework the clisuite test cases

2016-05-27 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 21b2605dc -> 019afd9c7 [SPARK-15431][SQL][BRANCH-2.0-TEST] rework the clisuite test cases ## What changes were proposed in this pull request? This PR reworks on the CliSuite test cases for `LIST FILES/JARS` commands. CC yhuai Tha

spark git commit: [SPARK-15431][SQL][BRANCH-2.0-TEST] rework the clisuite test cases

2016-05-27 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-2.0 dcf498e8a -> 9c137b2e3 [SPARK-15431][SQL][BRANCH-2.0-TEST] rework the clisuite test cases ## What changes were proposed in this pull request? This PR reworks on the CliSuite test cases for `LIST FILES/JARS` commands. CC yhuai Tha

[GitHub] spark pull request: [SPARK-15431][SQL][branch-2.0-test] rework the...

2016-05-27 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13361#issuecomment-52079 LGTM. Merging to master and branch 2.0. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-15431][SQL][branch-2.0-test] rework the...

2016-05-27 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13361#issuecomment-40500 Thanks for the fix. What was the problem? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-14543] [SQL] Improve InsertIntoTable co...

2016-05-27 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/12313#issuecomment-25637 @rdblue Thank you for your repl. For #2, yea, I feel it is better to be strict right now. I checked with yesterday's master and seems we already require the data

[GitHub] spark pull request: [SPARK-15431][SQL][test-maven] rework the clis...

2016-05-27 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13361#issuecomment-09770 add to whitelist --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-15494][SQL] encoder code cleanup

2016-05-27 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13269#discussion_r64940498 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/encoders/ExpressionEncoder.scala --- @@ -191,6 +189,26 @@ case class ExpressionEncoder[T

spark git commit: [SPARK-15565][SQL] Add the File Scheme to the Default Value of WAREHOUSE_PATH

2016-05-27 Thread yhuai
user.dir")/spark-warehouse`. Since `System.getProperty("user.dir")` is a local dir, we should explicitly set the scheme to local filesystem. cc yhuai How was this patch tested? Added two test cases Author: gatorsmile <gatorsm...@gmail.com>

spark git commit: [SPARK-15565][SQL] Add the File Scheme to the Default Value of WAREHOUSE_PATH

2016-05-27 Thread yhuai
user.dir")/spark-warehouse`. Since `System.getProperty("user.dir")` is a local dir, we should explicitly set the scheme to local filesystem. cc yhuai How was this patch tested? Added two test cases Author: gatorsmile <gatorsm...@gmail.com> Closes #13348 from gatorsmile/ad

[GitHub] spark pull request: [SPARK-15565] [SQL] Add the File Scheme to the...

2016-05-27 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13348#issuecomment-222197726 Thanks. Merging to master and branch 2.0. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-15431][SQL][HOTFIX] ignore 'list' comma...

2016-05-27 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13276#issuecomment-222193363 let me know when you have the PR. I will add you to the whitelist. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

[GitHub] spark pull request: [SPARK-15431][SQL][HOTFIX] ignore 'list' comma...

2016-05-27 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/13276#issuecomment-222185524 @xwu0226 Please open another PR to re-enable these tests and ask jenkins PR builder to test maven. --- If your project is set up for it, you can reply to this email

<    8   9   10   11   12   13   14   15   16   17   >