[GitHub] [spark] AmplabJenkins commented on pull request #30657: [WIP][SPARK-33696][SQL] Upgrade built-in Hive to 2.3.8

2020-12-15 Thread GitBox
AmplabJenkins commented on pull request #30657: URL: https://github.com/apache/spark/pull/30657#issuecomment-745245995 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/37425/ -

[GitHub] [spark] AmplabJenkins commented on pull request #30724: [SPARK-33752][SQL] Avoid the getSimpleMessage of AnalysisException adds semicolon repeatedly

2020-12-15 Thread GitBox
AmplabJenkins commented on pull request #30724: URL: https://github.com/apache/spark/pull/30724#issuecomment-745245998 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132811/ -

[GitHub] [spark] AmplabJenkins commented on pull request #30772: [SPARK-33733][SQL][2.4] PullOutNondeterministic should check and collect deterministic field

2020-12-15 Thread GitBox
AmplabJenkins commented on pull request #30772: URL: https://github.com/apache/spark/pull/30772#issuecomment-745245994 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132821/ -

[GitHub] [spark] AmplabJenkins commented on pull request #30775: [SPARK-33778][SQL] Allow typesafe join for LeftSemi and LeftAnti

2020-12-15 Thread GitBox
AmplabJenkins commented on pull request #30775: URL: https://github.com/apache/spark/pull/30775#issuecomment-745245996 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132809/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30724: [SPARK-33752][SQL] Avoid the getSimpleMessage of AnalysisException adds semicolon repeatedly

2020-12-15 Thread GitBox
AmplabJenkins removed a comment on pull request #30724: URL: https://github.com/apache/spark/pull/30724#issuecomment-745245998 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132811/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30657: [WIP][SPARK-33696][SQL] Upgrade built-in Hive to 2.3.8

2020-12-15 Thread GitBox
AmplabJenkins removed a comment on pull request #30657: URL: https://github.com/apache/spark/pull/30657#issuecomment-745245995 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/37425/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30775: [SPARK-33778][SQL] Allow typesafe join for LeftSemi and LeftAnti

2020-12-15 Thread GitBox
AmplabJenkins removed a comment on pull request #30775: URL: https://github.com/apache/spark/pull/30775#issuecomment-745245996 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132809/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30745: [SPARK-33678][SQL] Product aggregation function

2020-12-15 Thread GitBox
AmplabJenkins removed a comment on pull request #30745: URL: https://github.com/apache/spark/pull/30745#issuecomment-745245993 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132825/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30772: [SPARK-33733][SQL][2.4] PullOutNondeterministic should check and collect deterministic field

2020-12-15 Thread GitBox
AmplabJenkins removed a comment on pull request #30772: URL: https://github.com/apache/spark/pull/30772#issuecomment-745245994 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132821/ -

[GitHub] [spark] aokolnychyi commented on a change in pull request #30558: [SPARK-33612][SQL] Add dataSourceRewriteRules batch to Optimizer

2020-12-15 Thread GitBox
aokolnychyi commented on a change in pull request #30558: URL: https://github.com/apache/spark/pull/30558#discussion_r543286779 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala ## @@ -185,6 +185,9 @@ abstract class Optimizer(cat

[GitHub] [spark] HyukjinKwon commented on a change in pull request #30558: [SPARK-33612][SQL] Add dataSourceRewriteRules batch to Optimizer

2020-12-15 Thread GitBox
HyukjinKwon commented on a change in pull request #30558: URL: https://github.com/apache/spark/pull/30558#discussion_r543287888 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala ## @@ -185,6 +185,9 @@ abstract class Optimizer(cat

[GitHub] [spark] jbampton commented on a change in pull request #30776: [SPARK-33787][SQL] Add the `purge` parameter to `dropPartition()` of `SupportsPartitionManagement`

2020-12-15 Thread GitBox
jbampton commented on a change in pull request #30776: URL: https://github.com/apache/spark/pull/30776#discussion_r543286896 ## File path: sql/catalyst/src/main/java/org/apache/spark/sql/connector/catalog/SupportsAtomicPartitionManagement.java ## @@ -82,4 +82,27 @@ void create

[GitHub] [spark] cxzl25 opened a new pull request #30780: [SPARK-33790][CORE] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2020-12-15 Thread GitBox
cxzl25 opened a new pull request #30780: URL: https://github.com/apache/spark/pull/30780 ### What changes were proposed in this pull request? `FsHistoryProvider#checkForLogs` already has `FileStatus` when constructing `SingleFileEventLogFileReader`, and there is no need to get the `FileS

[GitHub] [spark] AmplabJenkins commented on pull request #30780: [SPARK-33790][CORE] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2020-12-15 Thread GitBox
AmplabJenkins commented on pull request #30780: URL: https://github.com/apache/spark/pull/30780#issuecomment-745248077 Can one of the admins verify this patch? This is an automated message from the Apache Git Service. To resp

[GitHub] [spark] SparkQA commented on pull request #30779: [SPARK-33789][SQL][TESTS] Refactor unified V1 and V2 datasource tests

2020-12-15 Thread GitBox
SparkQA commented on pull request #30779: URL: https://github.com/apache/spark/pull/30779#issuecomment-745249019 **[Test build #132826 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132826/testReport)** for PR 30779 at commit [`89e3898`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #30745: [SPARK-33678][SQL] Product aggregation function

2020-12-15 Thread GitBox
SparkQA commented on pull request #30745: URL: https://github.com/apache/spark/pull/30745#issuecomment-745250106 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/37427/ -

[GitHub] [spark] sqlwindspeaker commented on a change in pull request #30658: [SPARK-22769] Do not log rpc post message error when sparkEnv is already stopped

2020-12-15 Thread GitBox
sqlwindspeaker commented on a change in pull request #30658: URL: https://github.com/apache/spark/pull/30658#discussion_r543292377 ## File path: core/src/main/scala/org/apache/spark/rpc/netty/Dispatcher.scala ## @@ -147,13 +147,15 @@ private[netty] class Dispatcher(nettyEnv: Ne

[GitHub] [spark] SparkQA commented on pull request #30776: [SPARK-33787][SQL] Add the `purge` parameter to `dropPartition()` of `SupportsPartitionManagement`

2020-12-15 Thread GitBox
SparkQA commented on pull request #30776: URL: https://github.com/apache/spark/pull/30776#issuecomment-745255368 **[Test build #132813 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132813/testReport)** for PR 30776 at commit [`2a54876`](https://github.co

[GitHub] [spark] SparkQA removed a comment on pull request #30776: [SPARK-33787][SQL] Add the `purge` parameter to `dropPartition()` of `SupportsPartitionManagement`

2020-12-15 Thread GitBox
SparkQA removed a comment on pull request #30776: URL: https://github.com/apache/spark/pull/30776#issuecomment-745124753 **[Test build #132813 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132813/testReport)** for PR 30776 at commit [`2a54876`](https://gi

[GitHub] [spark] SparkQA commented on pull request #30778: [SPARK-33788][SQL] Throw NoSuchPartitionsException from HiveExternalCatalog.dropPartitions()

2020-12-15 Thread GitBox
SparkQA commented on pull request #30778: URL: https://github.com/apache/spark/pull/30778#issuecomment-745256046 **[Test build #132824 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132824/testReport)** for PR 30778 at commit [`8df8fa7`](https://github.co

[GitHub] [spark] erikerlandson commented on pull request #30735: [SPARK-33748][K8S] Respect environment variables and configurations for Python executables

2020-12-15 Thread GitBox
erikerlandson commented on pull request #30735: URL: https://github.com/apache/spark/pull/30735#issuecomment-745256148 @HyukjinKwon thanks for standardizing the env-var support! This is an automated message from the Apache Gi

[GitHub] [spark] AmplabJenkins commented on pull request #30776: [SPARK-33787][SQL] Add the `purge` parameter to `dropPartition()` of `SupportsPartitionManagement`

2020-12-15 Thread GitBox
AmplabJenkins commented on pull request #30776: URL: https://github.com/apache/spark/pull/30776#issuecomment-745256337 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132813/ -

[GitHub] [spark] SparkQA removed a comment on pull request #30778: [SPARK-33788][SQL] Throw NoSuchPartitionsException from HiveExternalCatalog.dropPartitions()

2020-12-15 Thread GitBox
SparkQA removed a comment on pull request #30778: URL: https://github.com/apache/spark/pull/30778#issuecomment-745219219 **[Test build #132824 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132824/testReport)** for PR 30778 at commit [`8df8fa7`](https://gi

[GitHub] [spark] AmplabJenkins commented on pull request #30778: [SPARK-33788][SQL] Throw NoSuchPartitionsException from HiveExternalCatalog.dropPartitions()

2020-12-15 Thread GitBox
AmplabJenkins commented on pull request #30778: URL: https://github.com/apache/spark/pull/30778#issuecomment-745256300 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132824/ -

[GitHub] [spark] AngersZhuuuu commented on pull request #29966: [SPARK-33084][CORE][SQL] Add jar support ivy path

2020-12-15 Thread GitBox
AngersZh commented on pull request #29966: URL: https://github.com/apache/spark/pull/29966#issuecomment-745256871 retest this please This is an automated message from the Apache Git Service. To respond to the message, ple

[GitHub] [spark] SparkQA commented on pull request #30761: [SPARK-33769][SQL] Improve the next-day function of the sql component to deal with Column type

2020-12-15 Thread GitBox
SparkQA commented on pull request #30761: URL: https://github.com/apache/spark/pull/30761#issuecomment-745256929 **[Test build #132815 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132815/testReport)** for PR 30761 at commit [`7c32905`](https://github.co

[GitHub] [spark] seobandoe commented on pull request #30713: Update monitoring.md

2020-12-15 Thread GitBox
seobandoe commented on pull request #30713: URL: https://github.com/apache/spark/pull/30713#issuecomment-745257694 Is the configuration version inside the real program no an external file, but ok, in my case know this other way, safe my job. Obtener Outlook para Android

[GitHub] [spark] SparkQA removed a comment on pull request #30761: [SPARK-33769][SQL] Improve the next-day function of the sql component to deal with Column type

2020-12-15 Thread GitBox
SparkQA removed a comment on pull request #30761: URL: https://github.com/apache/spark/pull/30761#issuecomment-745125347 **[Test build #132815 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132815/testReport)** for PR 30761 at commit [`7c32905`](https://gi

[GitHub] [spark] SparkQA commented on pull request #30778: [SPARK-33788][SQL] Throw NoSuchPartitionsException from HiveExternalCatalog.dropPartitions()

2020-12-15 Thread GitBox
SparkQA commented on pull request #30778: URL: https://github.com/apache/spark/pull/30778#issuecomment-745260575 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/37426/ ---

[GitHub] [spark] Ngone51 commented on pull request #30759: [SPARK-33774][UI][CORE]"Back to Master" returns 500 error in Standalone cluster

2020-12-15 Thread GitBox
Ngone51 commented on pull request #30759: URL: https://github.com/apache/spark/pull/30759#issuecomment-745260798 I didn't enable HTTPS. And I also tried with 3.0 release just now and the issue still exists. This is an automa

[GitHub] [spark] wangyum commented on pull request #30780: [SPARK-33790][CORE] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2020-12-15 Thread GitBox
wangyum commented on pull request #30780: URL: https://github.com/apache/spark/pull/30780#issuecomment-745261781 ok to test. This is an automated message from the Apache Git Service. To respond to the message, please log on t

[GitHub] [spark] MaxGekk commented on a change in pull request #30776: [SPARK-33787][SQL] Add the `purge` parameter to `dropPartition()` of `SupportsPartitionManagement`

2020-12-15 Thread GitBox
MaxGekk commented on a change in pull request #30776: URL: https://github.com/apache/spark/pull/30776#discussion_r543308672 ## File path: sql/catalyst/src/main/java/org/apache/spark/sql/connector/catalog/SupportsAtomicPartitionManagement.java ## @@ -82,4 +82,27 @@ void createP

[GitHub] [spark] SparkQA commented on pull request #30657: [WIP][SPARK-33696][SQL] Upgrade built-in Hive to 2.3.8

2020-12-15 Thread GitBox
SparkQA commented on pull request #30657: URL: https://github.com/apache/spark/pull/30657#issuecomment-745264414 **[Test build #132823 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132823/testReport)** for PR 30657 at commit [`127cb2d`](https://github.co

[GitHub] [spark] SparkQA removed a comment on pull request #30657: [WIP][SPARK-33696][SQL] Upgrade built-in Hive to 2.3.8

2020-12-15 Thread GitBox
SparkQA removed a comment on pull request #30657: URL: https://github.com/apache/spark/pull/30657#issuecomment-745186570 **[Test build #132823 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132823/testReport)** for PR 30657 at commit [`127cb2d`](https://gi

[GitHub] [spark] SparkQA commented on pull request #30779: [SPARK-33789][SQL][TESTS] Refactor unified V1 and V2 datasource tests

2020-12-15 Thread GitBox
SparkQA commented on pull request #30779: URL: https://github.com/apache/spark/pull/30779#issuecomment-745265170 **[Test build #132826 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132826/testReport)** for PR 30779 at commit [`89e3898`](https://github.co

[GitHub] [spark] SparkQA removed a comment on pull request #30779: [SPARK-33789][SQL][TESTS] Refactor unified V1 and V2 datasource tests

2020-12-15 Thread GitBox
SparkQA removed a comment on pull request #30779: URL: https://github.com/apache/spark/pull/30779#issuecomment-745249019 **[Test build #132826 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132826/testReport)** for PR 30779 at commit [`89e3898`](https://gi

[GitHub] [spark] SparkQA commented on pull request #30745: [SPARK-33678][SQL] Product aggregation function

2020-12-15 Thread GitBox
SparkQA commented on pull request #30745: URL: https://github.com/apache/spark/pull/30745#issuecomment-745265689 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/37427/ ---

[GitHub] [spark] SparkQA commented on pull request #30642: [SPARK-32991][SQL] [FOLLOWUP] Reset command relies on session initials first

2020-12-15 Thread GitBox
SparkQA commented on pull request #30642: URL: https://github.com/apache/spark/pull/30642#issuecomment-745266989 **[Test build #132818 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132818/testReport)** for PR 30642 at commit [`814f5af`](https://github.co

[GitHub] [spark] SparkQA commented on pull request #30777: [SPARK-28863][SQL][FOLLOWUP] Make sure optimized plan will not be re-analyzed

2020-12-15 Thread GitBox
SparkQA commented on pull request #30777: URL: https://github.com/apache/spark/pull/30777#issuecomment-745271178 **[Test build #132812 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132812/testReport)** for PR 30777 at commit [`bcca0d9`](https://github.co

[GitHub] [spark] AmplabJenkins commented on pull request #30745: [SPARK-33678][SQL] Product aggregation function

2020-12-15 Thread GitBox
AmplabJenkins commented on pull request #30745: URL: https://github.com/apache/spark/pull/30745#issuecomment-745274523 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/37427/ -

[GitHub] [spark] AmplabJenkins commented on pull request #30779: [SPARK-33789][SQL][TESTS] Refactor unified V1 and V2 datasource tests

2020-12-15 Thread GitBox
AmplabJenkins commented on pull request #30779: URL: https://github.com/apache/spark/pull/30779#issuecomment-745274518 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132826/ -

[GitHub] [spark] AmplabJenkins commented on pull request #30657: [WIP][SPARK-33696][SQL] Upgrade built-in Hive to 2.3.8

2020-12-15 Thread GitBox
AmplabJenkins commented on pull request #30657: URL: https://github.com/apache/spark/pull/30657#issuecomment-745274515 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132823/ -

[GitHub] [spark] AmplabJenkins commented on pull request #30642: [SPARK-32991][SQL] [FOLLOWUP] Reset command relies on session initials first

2020-12-15 Thread GitBox
AmplabJenkins commented on pull request #30642: URL: https://github.com/apache/spark/pull/30642#issuecomment-745274521 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132818/ -

[GitHub] [spark] AmplabJenkins commented on pull request #30778: [SPARK-33788][SQL] Throw NoSuchPartitionsException from HiveExternalCatalog.dropPartitions()

2020-12-15 Thread GitBox
AmplabJenkins commented on pull request #30778: URL: https://github.com/apache/spark/pull/30778#issuecomment-745274517 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/37426/ -

[GitHub] [spark] AmplabJenkins commented on pull request #30777: [SPARK-28863][SQL][FOLLOWUP] Make sure optimized plan will not be re-analyzed

2020-12-15 Thread GitBox
AmplabJenkins commented on pull request #30777: URL: https://github.com/apache/spark/pull/30777#issuecomment-745274516 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132812/ -

[GitHub] [spark] AmplabJenkins commented on pull request #30761: [SPARK-33769][SQL] Improve the next-day function of the sql component to deal with Column type

2020-12-15 Thread GitBox
AmplabJenkins commented on pull request #30761: URL: https://github.com/apache/spark/pull/30761#issuecomment-745274519 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132815/ -

[GitHub] [spark] SparkQA commented on pull request #30780: [SPARK-33790][CORE] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2020-12-15 Thread GitBox
SparkQA commented on pull request #30780: URL: https://github.com/apache/spark/pull/30780#issuecomment-745280816 **[Test build #132827 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132827/testReport)** for PR 30780 at commit [`7dae56f`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #30779: [SPARK-33789][SQL][TESTS] Refactor unified V1 and V2 datasource tests

2020-12-15 Thread GitBox
SparkQA commented on pull request #30779: URL: https://github.com/apache/spark/pull/30779#issuecomment-745280921 **[Test build #132828 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132828/testReport)** for PR 30779 at commit [`5836e54`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #30642: [SPARK-32991][SQL] [FOLLOWUP] Reset command relies on session initials first

2020-12-15 Thread GitBox
SparkQA commented on pull request #30642: URL: https://github.com/apache/spark/pull/30642#issuecomment-745280955 **[Test build #132830 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132830/testReport)** for PR 30642 at commit [`8702522`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #30776: [SPARK-33787][SQL] Add the `purge` parameter to `dropPartition()` of `SupportsPartitionManagement`

2020-12-15 Thread GitBox
SparkQA commented on pull request #30776: URL: https://github.com/apache/spark/pull/30776#issuecomment-745281011 **[Test build #132829 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132829/testReport)** for PR 30776 at commit [`2a6cbb8`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #30780: [SPARK-33790][CORE] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2020-12-15 Thread GitBox
SparkQA commented on pull request #30780: URL: https://github.com/apache/spark/pull/30780#issuecomment-745281220 **[Test build #132827 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132827/testReport)** for PR 30780 at commit [`7dae56f`](https://github.co

[GitHub] [spark] AmplabJenkins commented on pull request #30780: [SPARK-33790][CORE] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2020-12-15 Thread GitBox
AmplabJenkins commented on pull request #30780: URL: https://github.com/apache/spark/pull/30780#issuecomment-745281246 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132827/ -

[GitHub] [spark] sqlwindspeaker opened a new pull request #30781: SPARK-22748: grouping_id() can only be used with GroupingSets/Cube/Rollup

2020-12-15 Thread GitBox
sqlwindspeaker opened a new pull request #30781: URL: https://github.com/apache/spark/pull/30781 ### What changes were proposed in this pull request? See SPARK-22748, sql like: ``` create table temp.test_grouping_replace(a int, b int); select grouping__id from (select a, b, coun

[GitHub] [spark] ammar1x opened a new pull request #30782: Fix typos in the comment for DataStreamWriter#foreachBatch

2020-12-15 Thread GitBox
ammar1x opened a new pull request #30782: URL: https://github.com/apache/spark/pull/30782 The title is pretty self-explanatory. ### What changes were proposed in this pull request? Fixing typos in the docs for `foreachBatch` functions. ### Why are the changes nee

[GitHub] [spark] SparkQA commented on pull request #29966: [SPARK-33084][CORE][SQL] Add jar support ivy path

2020-12-15 Thread GitBox
SparkQA commented on pull request #29966: URL: https://github.com/apache/spark/pull/29966#issuecomment-745282519 **[Test build #132831 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132831/testReport)** for PR 29966 at commit [`57c351d`](https://github.com

[GitHub] [spark] AmplabJenkins commented on pull request #30782: Fix typos in the comment for DataStreamWriter#foreachBatch

2020-12-15 Thread GitBox
AmplabJenkins commented on pull request #30782: URL: https://github.com/apache/spark/pull/30782#issuecomment-745283053 Can one of the admins verify this patch? This is an automated message from the Apache Git Service. To resp

[GitHub] [spark] AmplabJenkins commented on pull request #30781: SPARK-22748: grouping_id() can only be used with GroupingSets/Cube/Rollup

2020-12-15 Thread GitBox
AmplabJenkins commented on pull request #30781: URL: https://github.com/apache/spark/pull/30781#issuecomment-745283104 Can one of the admins verify this patch? This is an automated message from the Apache Git Service. To resp

[GitHub] [spark] AmplabJenkins commented on pull request #30779: [SPARK-33789][SQL][TESTS] Refactor unified V1 and V2 datasource tests

2020-12-15 Thread GitBox
AmplabJenkins commented on pull request #30779: URL: https://github.com/apache/spark/pull/30779#issuecomment-745284868 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/37428/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30745: [SPARK-33678][SQL] Product aggregation function

2020-12-15 Thread GitBox
AmplabJenkins removed a comment on pull request #30745: URL: https://github.com/apache/spark/pull/30745#issuecomment-745274523 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/37427/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30777: [SPARK-28863][SQL][FOLLOWUP] Make sure optimized plan will not be re-analyzed

2020-12-15 Thread GitBox
AmplabJenkins removed a comment on pull request #30777: URL: https://github.com/apache/spark/pull/30777#issuecomment-745274516 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132812/ -

[GitHub] [spark] SparkQA removed a comment on pull request #30777: [SPARK-28863][SQL][FOLLOWUP] Make sure optimized plan will not be re-analyzed

2020-12-15 Thread GitBox
SparkQA removed a comment on pull request #30777: URL: https://github.com/apache/spark/pull/30777#issuecomment-745125541 **[Test build #132812 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132812/testReport)** for PR 30777 at commit [`bcca0d9`](https://gi

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30657: [WIP][SPARK-33696][SQL] Upgrade built-in Hive to 2.3.8

2020-12-15 Thread GitBox
AmplabJenkins removed a comment on pull request #30657: URL: https://github.com/apache/spark/pull/30657#issuecomment-745274515 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132823/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30642: [SPARK-32991][SQL] [FOLLOWUP] Reset command relies on session initials first

2020-12-15 Thread GitBox
AmplabJenkins removed a comment on pull request #30642: URL: https://github.com/apache/spark/pull/30642#issuecomment-745274521 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132818/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30780: [SPARK-33790][CORE] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2020-12-15 Thread GitBox
AmplabJenkins removed a comment on pull request #30780: URL: https://github.com/apache/spark/pull/30780#issuecomment-745248077 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA removed a comment on pull request #30780: [SPARK-33790][CORE] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2020-12-15 Thread GitBox
SparkQA removed a comment on pull request #30780: URL: https://github.com/apache/spark/pull/30780#issuecomment-745280816 **[Test build #132827 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132827/testReport)** for PR 30780 at commit [`7dae56f`](https://gi

[GitHub] [spark] SparkQA removed a comment on pull request #30642: [SPARK-32991][SQL] [FOLLOWUP] Reset command relies on session initials first

2020-12-15 Thread GitBox
SparkQA removed a comment on pull request #30642: URL: https://github.com/apache/spark/pull/30642#issuecomment-745154474 **[Test build #132818 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132818/testReport)** for PR 30642 at commit [`814f5af`](https://gi

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30779: [SPARK-33789][SQL][TESTS] Refactor unified V1 and V2 datasource tests

2020-12-15 Thread GitBox
AmplabJenkins removed a comment on pull request #30779: URL: https://github.com/apache/spark/pull/30779#issuecomment-745274518 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30761: [SPARK-33769][SQL] Improve the next-day function of the sql component to deal with Column type

2020-12-15 Thread GitBox
AmplabJenkins removed a comment on pull request #30761: URL: https://github.com/apache/spark/pull/30761#issuecomment-745274519 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132815/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30778: [SPARK-33788][SQL] Throw NoSuchPartitionsException from HiveExternalCatalog.dropPartitions()

2020-12-15 Thread GitBox
AmplabJenkins removed a comment on pull request #30778: URL: https://github.com/apache/spark/pull/30778#issuecomment-745274517 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30776: [SPARK-33787][SQL] Add the `purge` parameter to `dropPartition()` of `SupportsPartitionManagement`

2020-12-15 Thread GitBox
AmplabJenkins removed a comment on pull request #30776: URL: https://github.com/apache/spark/pull/30776#issuecomment-745256337 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132813/ -

[GitHub] [spark] srowen commented on pull request #30759: [SPARK-33774][UI][CORE]"Back to Master" returns 500 error in Standalone cluster

2020-12-15 Thread GitBox
srowen commented on pull request #30759: URL: https://github.com/apache/spark/pull/30759#issuecomment-745294656 Jenkins retest this please This is an automated message from the Apache Git Service. To respond to the message, p

[GitHub] [spark] SparkQA commented on pull request #30780: [SPARK-33790][CORE] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2020-12-15 Thread GitBox
SparkQA commented on pull request #30780: URL: https://github.com/apache/spark/pull/30780#issuecomment-745295651 **[Test build #132832 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132832/testReport)** for PR 30780 at commit [`4d1d5e4`](https://github.com

[GitHub] [spark] cloud-fan commented on pull request #30762: [SPARK-33758][SQL] Prune unrequired partitionings from AliasAwareOutputPartitionings when some columns are dropped from projection

2020-12-15 Thread GitBox
cloud-fan commented on pull request #30762: URL: https://github.com/apache/spark/pull/30762#issuecomment-745296160 thanks, merging to master! This is an automated message from the Apache Git Service. To respond to the message

[GitHub] [spark] cloud-fan closed pull request #30762: [SPARK-33758][SQL] Prune unrequired partitionings from AliasAwareOutputPartitionings when some columns are dropped from projection

2020-12-15 Thread GitBox
cloud-fan closed pull request #30762: URL: https://github.com/apache/spark/pull/30762 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go t

[GitHub] [spark] cloud-fan commented on pull request #30777: [SPARK-28863][SQL][FOLLOWUP] Make sure optimized plan will not be re-analyzed

2020-12-15 Thread GitBox
cloud-fan commented on pull request #30777: URL: https://github.com/apache/spark/pull/30777#issuecomment-745297113 cc @brkyvz This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA commented on pull request #30780: [SPARK-33790][CORE] Reduce the rpc call of getFileStatus in SingleFileEventLogFileReader

2020-12-15 Thread GitBox
SparkQA commented on pull request #30780: URL: https://github.com/apache/spark/pull/30780#issuecomment-745297034 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/37429/ -

[GitHub] [spark] SparkQA commented on pull request #30779: [SPARK-33789][SQL][TESTS] Refactor unified V1 and V2 datasource tests

2020-12-15 Thread GitBox
SparkQA commented on pull request #30779: URL: https://github.com/apache/spark/pull/30779#issuecomment-745297654 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/37430/ -

[GitHub] [spark] cloud-fan commented on a change in pull request #30558: [SPARK-33612][SQL] Add dataSourceRewriteRules batch to Optimizer

2020-12-15 Thread GitBox
cloud-fan commented on a change in pull request #30558: URL: https://github.com/apache/spark/pull/30558#discussion_r543355931 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala ## @@ -185,6 +185,9 @@ abstract class Optimizer(catal

[GitHub] [spark] SparkQA commented on pull request #30642: [SPARK-32991][SQL] [FOLLOWUP] Reset command relies on session initials first

2020-12-15 Thread GitBox
SparkQA commented on pull request #30642: URL: https://github.com/apache/spark/pull/30642#issuecomment-745298332 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/37432/ -

[GitHub] [spark] cloud-fan commented on a change in pull request #30779: [SPARK-33789][SQL][TESTS] Refactor unified V1 and V2 datasource tests

2020-12-15 Thread GitBox
cloud-fan commented on a change in pull request #30779: URL: https://github.com/apache/spark/pull/30779#discussion_r543357240 ## File path: sql/core/src/test/scala/org/apache/spark/sql/execution/command/AlterTableAddPartitionSuiteBase.scala ## @@ -17,48 +17,17 @@ package or

[GitHub] [spark] cloud-fan commented on a change in pull request #30779: [SPARK-33789][SQL][TESTS] Refactor unified V1 and V2 datasource tests

2020-12-15 Thread GitBox
cloud-fan commented on a change in pull request #30779: URL: https://github.com/apache/spark/pull/30779#discussion_r543358195 ## File path: sql/core/src/test/scala/org/apache/spark/sql/execution/command/DDLCommandTestUtils.scala ## @@ -0,0 +1,60 @@ +/* + * Licensed to the Apac

[GitHub] [spark] SparkQA commented on pull request #30736: [SPARK-33617][SQL][FOLLOWUP] refine the default parallelism SQL config

2020-12-15 Thread GitBox
SparkQA commented on pull request #30736: URL: https://github.com/apache/spark/pull/30736#issuecomment-745299577 **[Test build #132817 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132817/testReport)** for PR 30736 at commit [`7b05413`](https://github.co

[GitHub] [spark] SparkQA commented on pull request #29966: [SPARK-33084][CORE][SQL] Add jar support ivy path

2020-12-15 Thread GitBox
SparkQA commented on pull request #29966: URL: https://github.com/apache/spark/pull/29966#issuecomment-745299668 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/37433/ -

[GitHub] [spark] cloud-fan commented on a change in pull request #30779: [SPARK-33789][SQL][TESTS] Refactor unified V1 and V2 datasource tests

2020-12-15 Thread GitBox
cloud-fan commented on a change in pull request #30779: URL: https://github.com/apache/spark/pull/30779#discussion_r543358456 ## File path: sql/core/src/test/scala/org/apache/spark/sql/execution/command/ShowPartitionsSuiteBase.scala ## @@ -17,16 +17,12 @@ package org.apache

[GitHub] [spark] SparkQA removed a comment on pull request #30736: [SPARK-33617][SQL][FOLLOWUP] refine the default parallelism SQL config

2020-12-15 Thread GitBox
SparkQA removed a comment on pull request #30736: URL: https://github.com/apache/spark/pull/30736#issuecomment-745154950 **[Test build #132817 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132817/testReport)** for PR 30736 at commit [`7b05413`](https://gi

[GitHub] [spark] AmplabJenkins commented on pull request #30736: [SPARK-33617][SQL][FOLLOWUP] refine the default parallelism SQL config

2020-12-15 Thread GitBox
AmplabJenkins commented on pull request #30736: URL: https://github.com/apache/spark/pull/30736#issuecomment-745300829 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132817/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30736: [SPARK-33617][SQL][FOLLOWUP] refine the default parallelism SQL config

2020-12-15 Thread GitBox
AmplabJenkins removed a comment on pull request #30736: URL: https://github.com/apache/spark/pull/30736#issuecomment-745300829 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132817/ -

[GitHub] [spark] SparkQA commented on pull request #30759: [SPARK-33774][UI][CORE]"Back to Master" returns 500 error in Standalone cluster

2020-12-15 Thread GitBox
SparkQA commented on pull request #30759: URL: https://github.com/apache/spark/pull/30759#issuecomment-745303339 **[Test build #132833 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132833/testReport)** for PR 30759 at commit [`8e27e06`](https://github.com

[GitHub] [spark] MaxGekk commented on a change in pull request #30779: [SPARK-33789][SQL][TESTS] Refactor unified V1 and V2 datasource tests

2020-12-15 Thread GitBox
MaxGekk commented on a change in pull request #30779: URL: https://github.com/apache/spark/pull/30779#discussion_r543365157 ## File path: sql/core/src/test/scala/org/apache/spark/sql/execution/command/AlterTableAddPartitionSuiteBase.scala ## @@ -17,48 +17,17 @@ package org.

[GitHub] [spark] SparkQA commented on pull request #30776: [SPARK-33787][SQL] Add the `purge` parameter to `dropPartition()` of `SupportsPartitionManagement`

2020-12-15 Thread GitBox
SparkQA commented on pull request #30776: URL: https://github.com/apache/spark/pull/30776#issuecomment-745304338 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/37431/ -

[GitHub] [spark] cxzl25 commented on pull request #30725: [SPARK-33753][CORE] Reduce the memory footprint and gc of the cache (hadoopJobMetadata)

2020-12-15 Thread GitBox
cxzl25 commented on pull request #30725: URL: https://github.com/apache/spark/pull/30725#issuecomment-745305483 > I see, thanks for the reference. So IIUC this patch is primarily targeting the `spark.hadoop.cloneConf = true` use case? No. When `spark.hadoop.cloneConf=false`, `Hado

[GitHub] [spark] cloud-fan commented on a change in pull request #30706: [SPARK-33779][SQL] DataSource V2: API to request distribution and ordering on write

2020-12-15 Thread GitBox
cloud-fan commented on a change in pull request #30706: URL: https://github.com/apache/spark/pull/30706#discussion_r543366743 ## File path: sql/catalyst/src/main/java/org/apache/spark/sql/connector/distributions/ClusteredDistribution.java ## @@ -0,0 +1,35 @@ +/* + * Licensed t

[GitHub] [spark] cloud-fan commented on a change in pull request #30774: [SPARK-33786][SQL] The storage level for a cache should be respected when a table name is altered.

2020-12-15 Thread GitBox
cloud-fan commented on a change in pull request #30774: URL: https://github.com/apache/spark/pull/30774#discussion_r543368461 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala ## @@ -195,16 +195,21 @@ case class AlterTableRenameCommand(

[GitHub] [spark] MaxGekk commented on a change in pull request #30779: [SPARK-33789][SQL][TESTS] Refactor unified V1 and V2 datasource tests

2020-12-15 Thread GitBox
MaxGekk commented on a change in pull request #30779: URL: https://github.com/apache/spark/pull/30779#discussion_r543370516 ## File path: sql/core/src/test/scala/org/apache/spark/sql/execution/command/DDLCommandTestUtils.scala ## @@ -0,0 +1,60 @@ +/* + * Licensed to the Apache

[GitHub] [spark] cloud-fan commented on a change in pull request #30779: [SPARK-33789][SQL][TESTS] Refactor unified V1 and V2 datasource tests

2020-12-15 Thread GitBox
cloud-fan commented on a change in pull request #30779: URL: https://github.com/apache/spark/pull/30779#discussion_r543372910 ## File path: sql/core/src/test/scala/org/apache/spark/sql/execution/command/DDLCommandTestUtils.scala ## @@ -0,0 +1,60 @@ +/* + * Licensed to the Apac

[GitHub] [spark] SparkQA commented on pull request #30663: [SPARK-33700][SQL] Avoid file meta reading when enableFilterPushDown is true and filters is empty for Parquet and Orc

2020-12-15 Thread GitBox
SparkQA commented on pull request #30663: URL: https://github.com/apache/spark/pull/30663#issuecomment-745309648 **[Test build #132822 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132822/testReport)** for PR 30663 at commit [`eb04bdf`](https://github.co

[GitHub] [spark] SparkQA removed a comment on pull request #30663: [SPARK-33700][SQL] Avoid file meta reading when enableFilterPushDown is true and filters is empty for Parquet and Orc

2020-12-15 Thread GitBox
SparkQA removed a comment on pull request #30663: URL: https://github.com/apache/spark/pull/30663#issuecomment-745186524 **[Test build #132822 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/132822/testReport)** for PR 30663 at commit [`eb04bdf`](https://gi

[GitHub] [spark] AmplabJenkins commented on pull request #30663: [SPARK-33700][SQL] Avoid file meta reading when enableFilterPushDown is true and filters is empty for Parquet and Orc

2020-12-15 Thread GitBox
AmplabJenkins commented on pull request #30663: URL: https://github.com/apache/spark/pull/30663#issuecomment-745310818 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132822/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30663: [SPARK-33700][SQL] Avoid file meta reading when enableFilterPushDown is true and filters is empty for Parquet and Orc

2020-12-15 Thread GitBox
AmplabJenkins removed a comment on pull request #30663: URL: https://github.com/apache/spark/pull/30663#issuecomment-745310818 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/132822/ -

[GitHub] [spark] AmplabJenkins commented on pull request #30776: [SPARK-33787][SQL] Add the `purge` parameter to `dropPartition()` of `SupportsPartitionManagement`

2020-12-15 Thread GitBox
AmplabJenkins commented on pull request #30776: URL: https://github.com/apache/spark/pull/30776#issuecomment-745311352 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/37431/ -

<    1   2   3   4   5   6   7   8   >