[GitHub] [spark] cloud-fan closed pull request #31933: [SPARK-34701][SQL] Remove analyzing temp view again in CreateViewCommand

2021-03-25 Thread GitBox
cloud-fan closed pull request #31933: URL: https://github.com/apache/spark/pull/31933 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, plea

[GitHub] [spark] cloud-fan commented on pull request #31933: [SPARK-34701][SQL] Remove analyzing temp view again in CreateViewCommand

2021-03-25 Thread GitBox
cloud-fan commented on pull request #31933: URL: https://github.com/apache/spark/pull/31933#issuecomment-806409795 thanks, merging to master! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [spark] maropu commented on a change in pull request #31735: [SPARK-34799][PYTHON][SQL] Return User-defined types from Pandas UDF

2021-03-25 Thread GitBox
maropu commented on a change in pull request #31735: URL: https://github.com/apache/spark/pull/31735#discussion_r601120852 ## File path: python/pyspark/sql/pandas/serializers.py ## @@ -153,14 +157,15 @@ def _create_batch(self, series): from pyspark.sql.pandas.types imp

[GitHub] [spark] maropu commented on a change in pull request #31735: [SPARK-34799][PYTHON][SQL] Return User-defined types from Pandas UDF

2021-03-25 Thread GitBox
maropu commented on a change in pull request #31735: URL: https://github.com/apache/spark/pull/31735#discussion_r601122354 ## File path: python/pyspark/sql/pandas/serializers.py ## @@ -153,14 +157,15 @@ def _create_batch(self, series): from pyspark.sql.pandas.types imp

[GitHub] [spark] viirya commented on pull request #31747: [SPARK-34607][SQL][2.4] Add `Utils.isMemberClass` to fix a malformed class name error on jdk8u

2021-03-25 Thread GitBox
viirya commented on pull request #31747: URL: https://github.com/apache/spark/pull/31747#issuecomment-806410734 Note that this is backport PR. I will merge this to branch-2.4 tomorrow if no more comments. cc @dongjoon-hyun @rednaxelafx @cloud-fan -- This is an automated mes

[GitHub] [spark] SparkQA commented on pull request #31517: [WIP][SPARK-34309][BUILD][CORE][SQL][K8S]Use Caffeine instead of Guava Cache

2021-03-25 Thread GitBox
SparkQA commented on pull request #31517: URL: https://github.com/apache/spark/pull/31517#issuecomment-806416158 **[Test build #136498 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/136498/testReport)** for PR 31517 at commit [`554b5a5`](https://github.co

[GitHub] [spark] xuanyuanking commented on pull request #31944: [SPARK-34854][SQL][SS] Expose source metrics via progress report and add Kafka use-case to report delay.

2021-03-25 Thread GitBox
xuanyuanking commented on pull request #31944: URL: https://github.com/apache/spark/pull/31944#issuecomment-806420508 Besides the flexibility, I can see the new API here can help with the expansibility for the progress reporter. If new customized metrics needed in the future, we don't need

[GitHub] [spark] SparkQA commented on pull request #31921: [SPARK-34817][SQL] Read parquet unsigned types that stored as int32 physical type in parquet

2021-03-25 Thread GitBox
SparkQA commented on pull request #31921: URL: https://github.com/apache/spark/pull/31921#issuecomment-806420725 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/41081/ -- This is an automated message from the A

[GitHub] [spark] SparkQA commented on pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
SparkQA commented on pull request #30212: URL: https://github.com/apache/spark/pull/30212#issuecomment-806420914 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/41083/ -- This is an automated message from the Apache

[GitHub] [spark] AmplabJenkins commented on pull request #31921: [SPARK-34817][SQL] Read parquet unsigned types that stored as int32 physical type in parquet

2021-03-25 Thread GitBox
AmplabJenkins commented on pull request #31921: URL: https://github.com/apache/spark/pull/31921#issuecomment-806421935 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/41081/ -- T

[GitHub] [spark] AmplabJenkins commented on pull request #31517: [WIP][SPARK-34309][BUILD][CORE][SQL][K8S]Use Caffeine instead of Guava Cache

2021-03-25 Thread GitBox
AmplabJenkins commented on pull request #31517: URL: https://github.com/apache/spark/pull/31517#issuecomment-806421936 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/136498/ -- This

[GitHub] [spark] SparkQA commented on pull request #31957: [SPARK-34822][SQL] Update the plan stability golden files even if only the explain.txt changes

2021-03-25 Thread GitBox
SparkQA commented on pull request #31957: URL: https://github.com/apache/spark/pull/31957#issuecomment-806422881 **[Test build #136506 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/136506/testReport)** for PR 31957 at commit [`9d187b4`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #31517: [WIP][SPARK-34309][BUILD][CORE][SQL][K8S]Use Caffeine instead of Guava Cache

2021-03-25 Thread GitBox
SparkQA commented on pull request #31517: URL: https://github.com/apache/spark/pull/31517#issuecomment-806423187 **[Test build #136507 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/136507/testReport)** for PR 31517 at commit [`89abb61`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #31355: [SPARK-34255][SQL] Support partitioning with static number on required distribution and ordering on V2 write

2021-03-25 Thread GitBox
SparkQA commented on pull request #31355: URL: https://github.com/apache/spark/pull/31355#issuecomment-806423266 **[Test build #136508 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/136508/testReport)** for PR 31355 at commit [`e8fb6f1`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #31951: [SPARK-34850][SQL] Support multiply a day-time interval by a numeric

2021-03-25 Thread GitBox
SparkQA commented on pull request #31951: URL: https://github.com/apache/spark/pull/31951#issuecomment-806424337 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/41085/ -- This is an automated message from the Apache

[GitHub] [spark] AmplabJenkins commented on pull request #31956: [SPARK-34857][SQL] Correct AtLeastNNonNulls's explain output

2021-03-25 Thread GitBox
AmplabJenkins commented on pull request #31956: URL: https://github.com/apache/spark/pull/31956#issuecomment-806424462 Can one of the admins verify this patch? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL a

[GitHub] [spark] xuanyuanking edited a comment on pull request #31944: [SPARK-34854][SQL][SS] Expose source metrics via progress report and add Kafka use-case to report delay.

2021-03-25 Thread GitBox
xuanyuanking edited a comment on pull request #31944: URL: https://github.com/apache/spark/pull/31944#issuecomment-806420508 Besides the flexibility, I can see the new API here can help with the expansibility for the progress reporter. If new customized metrics needed in the future, we don

[GitHub] [spark] AmplabJenkins commented on pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
AmplabJenkins commented on pull request #30212: URL: https://github.com/apache/spark/pull/30212#issuecomment-806425530 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/41083/ -- T

[GitHub] [spark] SparkQA commented on pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
SparkQA commented on pull request #30212: URL: https://github.com/apache/spark/pull/30212#issuecomment-806425496 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/41083/ -- This is an automated message from the A

[GitHub] [spark] maropu commented on pull request #31957: [SPARK-34822][SQL] Update the plan stability golden files even if only the explain.txt changes

2021-03-25 Thread GitBox
maropu commented on pull request #31957: URL: https://github.com/apache/spark/pull/31957#issuecomment-806425746 Ah, I see. My two PRs (#31943 and #31940 ) that have recently been merged was updating the plans. -- This is an automated message from the Apache Git Service. To respond to the

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31517: [WIP][SPARK-34309][BUILD][CORE][SQL][K8S]Use Caffeine instead of Guava Cache

2021-03-25 Thread GitBox
AmplabJenkins removed a comment on pull request #31517: URL: https://github.com/apache/spark/pull/31517#issuecomment-806421936 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/136498/ -

[GitHub] [spark] SparkQA removed a comment on pull request #31517: [WIP][SPARK-34309][BUILD][CORE][SQL][K8S]Use Caffeine instead of Guava Cache

2021-03-25 Thread GitBox
SparkQA removed a comment on pull request #31517: URL: https://github.com/apache/spark/pull/31517#issuecomment-806352279 **[Test build #136498 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/136498/testReport)** for PR 31517 at commit [`554b5a5`](https://gi

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31921: [SPARK-34817][SQL] Read parquet unsigned types that stored as int32 physical type in parquet

2021-03-25 Thread GitBox
AmplabJenkins removed a comment on pull request #31921: URL: https://github.com/apache/spark/pull/31921#issuecomment-806421935 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/41081/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
AmplabJenkins removed a comment on pull request #30212: URL: https://github.com/apache/spark/pull/30212#issuecomment-806425530 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/41083/

[GitHub] [spark] cloud-fan commented on a change in pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
cloud-fan commented on a change in pull request #30212: URL: https://github.com/apache/spark/pull/30212#discussion_r601143106 ## File path: docs/sql-ref-syntax-qry-select-groupby.md ## @@ -42,6 +42,21 @@ aggregate_name ( [ DISTINCT ] expression [ , ... ] ) [ FILTER ( WHERE boo

[GitHub] [spark] cloud-fan commented on a change in pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
cloud-fan commented on a change in pull request #30212: URL: https://github.com/apache/spark/pull/30212#discussion_r601143106 ## File path: docs/sql-ref-syntax-qry-select-groupby.md ## @@ -42,6 +42,21 @@ aggregate_name ( [ DISTINCT ] expression [ , ... ] ) [ FILTER ( WHERE boo

[GitHub] [spark] maropu edited a comment on pull request #31957: [SPARK-34822][SQL] Update the plan stability golden files even if only the explain.txt changes

2021-03-25 Thread GitBox
maropu edited a comment on pull request #31957: URL: https://github.com/apache/spark/pull/31957#issuecomment-806425746 Ah, I see. My two PRs (#31943 and #31940 ) that have recently been merged was updating the plans -- This is an automated message from the Apache Git Service. To resp

[GitHub] [spark] SparkQA commented on pull request #31951: [SPARK-34850][SQL] Support multiply a day-time interval by a numeric

2021-03-25 Thread GitBox
SparkQA commented on pull request #31951: URL: https://github.com/apache/spark/pull/31951#issuecomment-806429366 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/41085/ -- This is an automated message from the A

[GitHub] [spark] AmplabJenkins commented on pull request #31951: [SPARK-34850][SQL] Support multiply a day-time interval by a numeric

2021-03-25 Thread GitBox
AmplabJenkins commented on pull request #31951: URL: https://github.com/apache/spark/pull/31951#issuecomment-806429385 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/41085/ -- T

[GitHub] [spark] cloud-fan commented on a change in pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
cloud-fan commented on a change in pull request #30212: URL: https://github.com/apache/spark/pull/30212#discussion_r601146756 ## File path: docs/sql-ref-syntax-qry-select-groupby.md ## @@ -32,7 +32,7 @@ When a FILTER clause is attached to an aggregate function, only the matchi

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31951: [SPARK-34850][SQL] Support multiply a day-time interval by a numeric

2021-03-25 Thread GitBox
AmplabJenkins removed a comment on pull request #31951: URL: https://github.com/apache/spark/pull/31951#issuecomment-806429385 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/41085/

[GitHub] [spark] cloud-fan commented on a change in pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
cloud-fan commented on a change in pull request #30212: URL: https://github.com/apache/spark/pull/30212#discussion_r601147150 ## File path: docs/sql-ref-syntax-qry-select-groupby.md ## @@ -42,6 +42,21 @@ aggregate_name ( [ DISTINCT ] expression [ , ... ] ) [ FILTER ( WHERE boo

[GitHub] [spark] SparkQA commented on pull request #31165: [SPARK-34092][SQL] Support Stage level restful api filter task details by task status

2021-03-25 Thread GitBox
SparkQA commented on pull request #31165: URL: https://github.com/apache/spark/pull/31165#issuecomment-806431593 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/41086/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #31951: [SPARK-34850][SQL] Support multiply a day-time interval by a numeric

2021-03-25 Thread GitBox
SparkQA commented on pull request #31951: URL: https://github.com/apache/spark/pull/31951#issuecomment-806431699 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/41084/ -- This is an automated message from the Apache

[GitHub] [spark] cloud-fan commented on a change in pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
cloud-fan commented on a change in pull request #30212: URL: https://github.com/apache/spark/pull/30212#discussion_r601148662 ## File path: docs/sql-ref-syntax-qry-select-groupby.md ## @@ -50,34 +65,28 @@ aggregate_name ( [ DISTINCT ] expression [ , ... ] ) [ FILTER ( WHERE bo

[GitHub] [spark] cloud-fan commented on a change in pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
cloud-fan commented on a change in pull request #30212: URL: https://github.com/apache/spark/pull/30212#discussion_r601150341 ## File path: docs/sql-ref-syntax-qry-select-groupby.md ## @@ -50,34 +65,28 @@ aggregate_name ( [ DISTINCT ] expression [ , ... ] ) [ FILTER ( WHERE bo

[GitHub] [spark] cloud-fan commented on a change in pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
cloud-fan commented on a change in pull request #30212: URL: https://github.com/apache/spark/pull/30212#discussion_r601154251 ## File path: docs/sql-ref-syntax-qry-select-groupby.md ## @@ -50,34 +65,28 @@ aggregate_name ( [ DISTINCT ] expression [ , ... ] ) [ FILTER ( WHERE bo

[GitHub] [spark] MaxGekk closed pull request #31951: [SPARK-34850][SQL] Support multiply a day-time interval by a numeric

2021-03-25 Thread GitBox
MaxGekk closed pull request #31951: URL: https://github.com/apache/spark/pull/31951 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please

[GitHub] [spark] AngersZhuuuu commented on a change in pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
AngersZh commented on a change in pull request #30212: URL: https://github.com/apache/spark/pull/30212#discussion_r601161941 ## File path: docs/sql-ref-syntax-qry-select-groupby.md ## @@ -50,34 +65,28 @@ aggregate_name ( [ DISTINCT ] expression [ , ... ] ) [ FILTER ( WHERE

[GitHub] [spark] AngersZhuuuu commented on a change in pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
AngersZh commented on a change in pull request #30212: URL: https://github.com/apache/spark/pull/30212#discussion_r601171306 ## File path: docs/sql-ref-syntax-qry-select-groupby.md ## @@ -50,34 +65,28 @@ aggregate_name ( [ DISTINCT ] expression [ , ... ] ) [ FILTER ( WHERE

[GitHub] [spark] cloud-fan commented on a change in pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
cloud-fan commented on a change in pull request #30212: URL: https://github.com/apache/spark/pull/30212#discussion_r601151630 ## File path: docs/sql-ref-syntax-qry-select-groupby.md ## @@ -50,34 +65,28 @@ aggregate_name ( [ DISTINCT ] expression [ , ... ] ) [ FILTER ( WHERE bo

[GitHub] [spark] AngersZhuuuu commented on a change in pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
AngersZh commented on a change in pull request #30212: URL: https://github.com/apache/spark/pull/30212#discussion_r601172474 ## File path: docs/sql-ref-syntax-qry-select-groupby.md ## @@ -42,6 +42,21 @@ aggregate_name ( [ DISTINCT ] expression [ , ... ] ) [ FILTER ( WHERE

[GitHub] [spark] AngersZhuuuu commented on a change in pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
AngersZh commented on a change in pull request #30212: URL: https://github.com/apache/spark/pull/30212#discussion_r601173492 ## File path: docs/sql-ref-syntax-qry-select-groupby.md ## @@ -50,34 +65,28 @@ aggregate_name ( [ DISTINCT ] expression [ , ... ] ) [ FILTER ( WHERE

[GitHub] [spark] SparkQA commented on pull request #31165: [SPARK-34092][SQL] Support Stage level restful api filter task details by task status

2021-03-25 Thread GitBox
SparkQA commented on pull request #31165: URL: https://github.com/apache/spark/pull/31165#issuecomment-806453051 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/41086/ -- This is an automated message from the A

[GitHub] [spark] SparkQA commented on pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
SparkQA commented on pull request #30212: URL: https://github.com/apache/spark/pull/30212#issuecomment-806455462 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/41088/ -- This is an automated message from the Apache

[GitHub] [spark] AmplabJenkins commented on pull request #31165: [SPARK-34092][SQL] Support Stage level restful api filter task details by task status

2021-03-25 Thread GitBox
AmplabJenkins commented on pull request #31165: URL: https://github.com/apache/spark/pull/31165#issuecomment-806456185 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/41086/ -- T

[GitHub] [spark] HyukjinKwon commented on pull request #31956: [SPARK-34857][SQL] Correct AtLeastNNonNulls's explain output

2021-03-25 Thread GitBox
HyukjinKwon commented on pull request #31956: URL: https://github.com/apache/spark/pull/31956#issuecomment-806457444 the test failure is from https://github.com/apache/spark/commit/7838f55ca795ca222541de7bc3cb065205718957. Merged to master. -- This is an automated message from the Apach

[GitHub] [spark] AngersZhuuuu commented on pull request #29087: [SPARK-28227][SQL] Support projection, aggregate/window functions, and lateral view in the TRANSFORM clause

2021-03-25 Thread GitBox
AngersZh commented on pull request #29087: URL: https://github.com/apache/spark/pull/29087#issuecomment-806458740 Any more concern on this PR? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31165: [SPARK-34092][SQL] Support Stage level restful api filter task details by task status

2021-03-25 Thread GitBox
AmplabJenkins removed a comment on pull request #31165: URL: https://github.com/apache/spark/pull/31165#issuecomment-806456185 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/41086/

[GitHub] [spark] HyukjinKwon closed pull request #31956: [SPARK-34857][SQL] Correct AtLeastNNonNulls's explain output

2021-03-25 Thread GitBox
HyukjinKwon closed pull request #31956: URL: https://github.com/apache/spark/pull/31956 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, pl

[GitHub] [spark] MaxGekk commented on pull request #31951: [SPARK-34850][SQL] Support multiply a day-time interval by a numeric

2021-03-25 Thread GitBox
MaxGekk commented on pull request #31951: URL: https://github.com/apache/spark/pull/31951#issuecomment-806438970 GA passed. Merging to master. Thank you, @yaooqinn @cloud-fan for your review. -- This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [spark] AngersZhuuuu commented on pull request #30957: [SPARK-31937][SQL] Support processing ArrayType/MapType/StructType data using no-serde mode script transform

2021-03-25 Thread GitBox
AngersZh commented on pull request #30957: URL: https://github.com/apache/spark/pull/30957#issuecomment-806456347 retest this please -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the speci

[GitHub] [spark] SparkQA commented on pull request #29087: [SPARK-28227][SQL] Support projection, aggregate/window functions, and lateral view in the TRANSFORM clause

2021-03-25 Thread GitBox
SparkQA commented on pull request #29087: URL: https://github.com/apache/spark/pull/29087#issuecomment-806460947 **[Test build #136513 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/136513/testReport)** for PR 29087 at commit [`1278705`](https://github.com

[GitHub] [spark] AmplabJenkins commented on pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
AmplabJenkins commented on pull request #30212: URL: https://github.com/apache/spark/pull/30212#issuecomment-806460952 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/41088/ -- T

[GitHub] [spark] SparkQA commented on pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
SparkQA commented on pull request #30212: URL: https://github.com/apache/spark/pull/30212#issuecomment-806460928 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/41088/ -- This is an automated message from the A

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
AmplabJenkins removed a comment on pull request #30212: URL: https://github.com/apache/spark/pull/30212#issuecomment-806460952 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/41088/

[GitHub] [spark] cloud-fan commented on a change in pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
cloud-fan commented on a change in pull request #30212: URL: https://github.com/apache/spark/pull/30212#discussion_r601152883 ## File path: docs/sql-ref-syntax-qry-select-groupby.md ## @@ -50,34 +65,28 @@ aggregate_name ( [ DISTINCT ] expression [ , ... ] ) [ FILTER ( WHERE bo

[GitHub] [spark] SparkQA commented on pull request #31470: [SPARK-34354][SQL] Fix failure when apply CostBasedJoinReorder on self-join

2021-03-25 Thread GitBox
SparkQA commented on pull request #31470: URL: https://github.com/apache/spark/pull/31470#issuecomment-806458275 **[Test build #136510 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/136510/testReport)** for PR 31470 at commit [`39f1615`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
SparkQA commented on pull request #30212: URL: https://github.com/apache/spark/pull/30212#issuecomment-806458716 **[Test build #136512 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/136512/testReport)** for PR 30212 at commit [`2b7f645`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #31886: [WIP][SPARK-34795][SQL][TEST] Adds a new job in GitHub Actions to check the output of TPC-DS queries

2021-03-25 Thread GitBox
SparkQA commented on pull request #31886: URL: https://github.com/apache/spark/pull/31886#issuecomment-806457975 **[Test build #136509 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/136509/testReport)** for PR 31886 at commit [`4eab893`](https://github.com

[GitHub] [spark] AngersZhuuuu edited a comment on pull request #29087: [SPARK-28227][SQL] Support projection, aggregate/window functions, and lateral view in the TRANSFORM clause

2021-03-25 Thread GitBox
AngersZh edited a comment on pull request #29087: URL: https://github.com/apache/spark/pull/29087#issuecomment-806458740 Any more concern on this PR? Gentle ping @cloud-fan @maropu -- This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA commented on pull request #30957: [SPARK-31937][SQL] Support processing ArrayType/MapType/StructType data using no-serde mode script transform

2021-03-25 Thread GitBox
SparkQA commented on pull request #30957: URL: https://github.com/apache/spark/pull/30957#issuecomment-806458501 **[Test build #136511 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/136511/testReport)** for PR 30957 at commit [`ed8c54c`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #31951: [SPARK-34850][SQL] Support multiply a day-time interval by a numeric

2021-03-25 Thread GitBox
SparkQA commented on pull request #31951: URL: https://github.com/apache/spark/pull/31951#issuecomment-806461721 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/41084/ -- This is an automated message from the A

[GitHub] [spark] SparkQA commented on pull request #31165: [SPARK-34092][SQL] Support Stage level restful api filter task details by task status

2021-03-25 Thread GitBox
SparkQA commented on pull request #31165: URL: https://github.com/apache/spark/pull/31165#issuecomment-806461643 **[Test build #136501 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/136501/testReport)** for PR 31165 at commit [`41e96a5`](https://github.co

[GitHub] [spark] AmplabJenkins commented on pull request #31951: [SPARK-34850][SQL] Support multiply a day-time interval by a numeric

2021-03-25 Thread GitBox
AmplabJenkins commented on pull request #31951: URL: https://github.com/apache/spark/pull/31951#issuecomment-806461752 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/41084/ -- T

[GitHub] [spark] SparkQA removed a comment on pull request #31165: [SPARK-34092][SQL] Support Stage level restful api filter task details by task status

2021-03-25 Thread GitBox
SparkQA removed a comment on pull request #31165: URL: https://github.com/apache/spark/pull/31165#issuecomment-806371106 **[Test build #136501 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/136501/testReport)** for PR 31165 at commit [`41e96a5`](https://gi

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31951: [SPARK-34850][SQL] Support multiply a day-time interval by a numeric

2021-03-25 Thread GitBox
AmplabJenkins removed a comment on pull request #31951: URL: https://github.com/apache/spark/pull/31951#issuecomment-806461752 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/41084/

[GitHub] [spark] SparkQA commented on pull request #31517: [WIP][SPARK-34309][BUILD][CORE][SQL][K8S]Use Caffeine instead of Guava Cache

2021-03-25 Thread GitBox
SparkQA commented on pull request #31517: URL: https://github.com/apache/spark/pull/31517#issuecomment-806462871 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/41089/ -- This is an automated message from the Apache

[GitHub] [spark] AmplabJenkins commented on pull request #31165: [SPARK-34092][SQL] Support Stage level restful api filter task details by task status

2021-03-25 Thread GitBox
AmplabJenkins commented on pull request #31165: URL: https://github.com/apache/spark/pull/31165#issuecomment-806463335 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/136501/ -- This

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31165: [SPARK-34092][SQL] Support Stage level restful api filter task details by task status

2021-03-25 Thread GitBox
AmplabJenkins removed a comment on pull request #31165: URL: https://github.com/apache/spark/pull/31165#issuecomment-806463335 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/136501/ -

[GitHub] [spark] SparkQA commented on pull request #31951: [SPARK-34850][SQL] Support multiply a day-time interval by a numeric

2021-03-25 Thread GitBox
SparkQA commented on pull request #31951: URL: https://github.com/apache/spark/pull/31951#issuecomment-806465714 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/41087/ -- This is an automated message from the Apache

[GitHub] [spark] peter-toth commented on a change in pull request #31955: [SPARK-34829][SQL] Fix typed ScalaUDF result conversion

2021-03-25 Thread GitBox
peter-toth commented on a change in pull request #31955: URL: https://github.com/apache/spark/pull/31955#discussion_r601199084 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/ScalaUDF.scala ## @@ -124,10 +124,10 @@ case class ScalaUDF(

[GitHub] [spark] peter-toth commented on a change in pull request #31955: [SPARK-34829][SQL] Fix typed ScalaUDF result conversion

2021-03-25 Thread GitBox
peter-toth commented on a change in pull request #31955: URL: https://github.com/apache/spark/pull/31955#discussion_r601199228 ## File path: sql/core/src/test/scala/org/apache/spark/sql/DataFrameSuite.scala ## @@ -2755,6 +2755,20 @@ class DataFrameSuite extends QueryTest )

[GitHub] [spark] peter-toth commented on a change in pull request #31955: [SPARK-34829][SQL] Fix typed ScalaUDF result conversion

2021-03-25 Thread GitBox
peter-toth commented on a change in pull request #31955: URL: https://github.com/apache/spark/pull/31955#discussion_r601199084 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/ScalaUDF.scala ## @@ -124,10 +124,10 @@ case class ScalaUDF(

[GitHub] [spark] SparkQA commented on pull request #31517: [WIP][SPARK-34309][BUILD][CORE][SQL][K8S]Use Caffeine instead of Guava Cache

2021-03-25 Thread GitBox
SparkQA commented on pull request #31517: URL: https://github.com/apache/spark/pull/31517#issuecomment-806468142 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/41089/ -- This is an automated message from the A

[GitHub] [spark] AngersZhuuuu commented on pull request #30145: [SPARK-33233][SQL]CUBE/ROLLUP/GROUPING SETS support GROUP BY ordinal

2021-03-25 Thread GitBox
AngersZh commented on pull request #30145: URL: https://github.com/apache/spark/pull/30145#issuecomment-806471174 @maropu Can you reopen this since we have restart reviewing https://github.com/apache/spark/pull/30212 -- This is an automated message from the Apache Git Service. To res

[GitHub] [spark] SparkQA commented on pull request #31933: [SPARK-34701][SQL] Remove analyzing temp view again in CreateViewCommand

2021-03-25 Thread GitBox
SparkQA commented on pull request #31933: URL: https://github.com/apache/spark/pull/31933#issuecomment-806482755 **[Test build #136496 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/136496/testReport)** for PR 31933 at commit [`507b00c`](https://github.co

[GitHub] [spark] cloud-fan commented on a change in pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
cloud-fan commented on a change in pull request #30212: URL: https://github.com/apache/spark/pull/30212#discussion_r601229471 ## File path: docs/sql-ref-syntax-qry-select-groupby.md ## @@ -32,7 +32,7 @@ When a FILTER clause is attached to an aggregate function, only the matchi

[GitHub] [spark] cloud-fan commented on a change in pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
cloud-fan commented on a change in pull request #30212: URL: https://github.com/apache/spark/pull/30212#discussion_r601230361 ## File path: docs/sql-ref-syntax-qry-select-groupby.md ## @@ -42,42 +42,44 @@ aggregate_name ( [ DISTINCT ] expression [ , ... ] ) [ FILTER ( WHERE bo

[GitHub] [spark] cloud-fan commented on a change in pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
cloud-fan commented on a change in pull request #30212: URL: https://github.com/apache/spark/pull/30212#discussion_r601232220 ## File path: docs/sql-ref-syntax-qry-select-groupby.md ## @@ -50,34 +65,28 @@ aggregate_name ( [ DISTINCT ] expression [ , ... ] ) [ FILTER ( WHERE bo

[GitHub] [spark] cloud-fan commented on a change in pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
cloud-fan commented on a change in pull request #30212: URL: https://github.com/apache/spark/pull/30212#discussion_r601234435 ## File path: docs/sql-ref-syntax-qry-select-groupby.md ## @@ -42,42 +42,44 @@ aggregate_name ( [ DISTINCT ] expression [ , ... ] ) [ FILTER ( WHERE bo

[GitHub] [spark] c21 opened a new pull request #31958: [SPARK-34862][SQL] Support nested column in ORC vectorized reader

2021-03-25 Thread GitBox
c21 opened a new pull request #31958: URL: https://github.com/apache/spark/pull/31958 ### What changes were proposed in this pull request? This PR is to support nested column type in Spark ORC vectorized reader. Currently ORC vectorized reader [does not support nested column

[GitHub] [spark] cloud-fan commented on a change in pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
cloud-fan commented on a change in pull request #30212: URL: https://github.com/apache/spark/pull/30212#discussion_r601237679 ## File path: docs/sql-ref-syntax-qry-select-groupby.md ## @@ -42,42 +42,44 @@ aggregate_name ( [ DISTINCT ] expression [ , ... ] ) [ FILTER ( WHERE bo

[GitHub] [spark] c21 commented on pull request #31958: [SPARK-34862][SQL] Support nested column in ORC vectorized reader

2021-03-25 Thread GitBox
c21 commented on pull request #31958: URL: https://github.com/apache/spark/pull/31958#issuecomment-806488959 cc @cloud-fan, @maropu and @dongjoon-hyun could you help take a look when you have time, thanks. -- This is an automated message from the Apache Git Service. To respond to the mes

[GitHub] [spark] cloud-fan commented on a change in pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
cloud-fan commented on a change in pull request #30212: URL: https://github.com/apache/spark/pull/30212#discussion_r601240887 ## File path: docs/sql-ref-syntax-qry-select-groupby.md ## @@ -32,7 +32,7 @@ When a FILTER clause is attached to an aggregate function, only the matchi

[GitHub] [spark] SparkQA commented on pull request #31951: [SPARK-34850][SQL] Support multiply a day-time interval by a numeric

2021-03-25 Thread GitBox
SparkQA commented on pull request #31951: URL: https://github.com/apache/spark/pull/31951#issuecomment-806490909 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/41087/ -- This is an automated message from the A

[GitHub] [spark] AmplabJenkins commented on pull request #31933: [SPARK-34701][SQL] Remove analyzing temp view again in CreateViewCommand

2021-03-25 Thread GitBox
AmplabJenkins commented on pull request #31933: URL: https://github.com/apache/spark/pull/31933#issuecomment-806491362 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/136496/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #31517: [WIP][SPARK-34309][BUILD][CORE][SQL][K8S]Use Caffeine instead of Guava Cache

2021-03-25 Thread GitBox
AmplabJenkins commented on pull request #31517: URL: https://github.com/apache/spark/pull/31517#issuecomment-806491358 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/41089/ -- T

[GitHub] [spark] AmplabJenkins commented on pull request #31951: [SPARK-34850][SQL] Support multiply a day-time interval by a numeric

2021-03-25 Thread GitBox
AmplabJenkins commented on pull request #31951: URL: https://github.com/apache/spark/pull/31951#issuecomment-806491359 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/41087/ -- T

[GitHub] [spark] SparkQA commented on pull request #31958: [SPARK-34862][SQL] Support nested column in ORC vectorized reader

2021-03-25 Thread GitBox
SparkQA commented on pull request #31958: URL: https://github.com/apache/spark/pull/31958#issuecomment-806491825 **[Test build #136514 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/136514/testReport)** for PR 31958 at commit [`7037893`](https://github.com

[GitHub] [spark] AngersZhuuuu commented on a change in pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
AngersZh commented on a change in pull request #30212: URL: https://github.com/apache/spark/pull/30212#discussion_r601244661 ## File path: docs/sql-ref-syntax-qry-select-groupby.md ## @@ -32,7 +32,7 @@ When a FILTER clause is attached to an aggregate function, only the mat

[GitHub] [spark] AmplabJenkins commented on pull request #31958: [SPARK-34862][SQL] Support nested column in ORC vectorized reader

2021-03-25 Thread GitBox
AmplabJenkins commented on pull request #31958: URL: https://github.com/apache/spark/pull/31958#issuecomment-806492296 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/136514/ -- This

[GitHub] [spark] SparkQA commented on pull request #31958: [SPARK-34862][SQL] Support nested column in ORC vectorized reader

2021-03-25 Thread GitBox
SparkQA commented on pull request #31958: URL: https://github.com/apache/spark/pull/31958#issuecomment-806492275 **[Test build #136514 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/136514/testReport)** for PR 31958 at commit [`7037893`](https://github.co

[GitHub] [spark] SparkQA commented on pull request #31355: [SPARK-34255][SQL] Support partitioning with static number on required distribution and ordering on V2 write

2021-03-25 Thread GitBox
SparkQA commented on pull request #31355: URL: https://github.com/apache/spark/pull/31355#issuecomment-806492370 **[Test build #136515 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/136515/testReport)** for PR 31355 at commit [`3e75641`](https://github.com

[GitHub] [spark] AngersZhuuuu commented on a change in pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
AngersZh commented on a change in pull request #30212: URL: https://github.com/apache/spark/pull/30212#discussion_r601250092 ## File path: docs/sql-ref-syntax-qry-select-groupby.md ## @@ -32,7 +32,7 @@ When a FILTER clause is attached to an aggregate function, only the mat

[GitHub] [spark] SparkQA removed a comment on pull request #31933: [SPARK-34701][SQL] Remove analyzing temp view again in CreateViewCommand

2021-03-25 Thread GitBox
SparkQA removed a comment on pull request #31933: URL: https://github.com/apache/spark/pull/31933#issuecomment-806352100 **[Test build #136496 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/136496/testReport)** for PR 31933 at commit [`507b00c`](https://gi

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31951: [SPARK-34850][SQL] Support multiply a day-time interval by a numeric

2021-03-25 Thread GitBox
AmplabJenkins removed a comment on pull request #31951: URL: https://github.com/apache/spark/pull/31951#issuecomment-806491359 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/41087/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31933: [SPARK-34701][SQL] Remove analyzing temp view again in CreateViewCommand

2021-03-25 Thread GitBox
AmplabJenkins removed a comment on pull request #31933: URL: https://github.com/apache/spark/pull/31933#issuecomment-806491362 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/136496/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31517: [WIP][SPARK-34309][BUILD][CORE][SQL][K8S]Use Caffeine instead of Guava Cache

2021-03-25 Thread GitBox
AmplabJenkins removed a comment on pull request #31517: URL: https://github.com/apache/spark/pull/31517#issuecomment-806491358 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/41089/

[GitHub] [spark] cloud-fan commented on a change in pull request #30212: [SPARK-33308][SQL] Refactor current grouping analytics

2021-03-25 Thread GitBox
cloud-fan commented on a change in pull request #30212: URL: https://github.com/apache/spark/pull/30212#discussion_r601254568 ## File path: docs/sql-ref-syntax-qry-select-groupby.md ## @@ -32,7 +32,7 @@ When a FILTER clause is attached to an aggregate function, only the matchi

  1   2   3   4   5   6   7   >