[GitHub] [spark] AmplabJenkins removed a comment on pull request #33637: [SPARK-36384][CORE][DOC] Add doc for shuffle checksum

2021-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #33637: URL: https://github.com/apache/spark/pull/33637#issuecomment-892773032 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/46550/

[GitHub] [spark] SparkQA commented on pull request #33638: [SPARK-36415][SQL][DOCS] Add docs for try_cast/try_add/try_divide

2021-08-04 Thread GitBox
SparkQA commented on pull request #33638: URL: https://github.com/apache/spark/pull/33638#issuecomment-892775972 **[Test build #142040 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142040/testReport)** for PR 33638 at commit [`226c1d5`](https://github.com

[GitHub] [spark] andygrove commented on a change in pull request #33624: [SPARK-35881][SQL][FOLLOWUP] Add a boolean flag in AdaptiveSparkPlanExec to ask for columnar output

2021-08-04 Thread GitBox
andygrove commented on a change in pull request #33624: URL: https://github.com/apache/spark/pull/33624#discussion_r682751729 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/AdaptiveSparkPlanExec.scala ## @@ -65,7 +65,8 @@ case class AdaptiveSpark

[GitHub] [spark] andygrove commented on a change in pull request #33624: [SPARK-35881][SQL][FOLLOWUP] Add a boolean flag in AdaptiveSparkPlanExec to ask for columnar output

2021-08-04 Thread GitBox
andygrove commented on a change in pull request #33624: URL: https://github.com/apache/spark/pull/33624#discussion_r682758152 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/AdaptiveSparkPlanExec.scala ## @@ -536,7 +523,7 @@ case class AdaptiveSpa

[GitHub] [spark] SparkQA commented on pull request #33638: [SPARK-36415][SQL][DOCS] Add docs for try_cast/try_add/try_divide

2021-08-04 Thread GitBox
SparkQA commented on pull request #33638: URL: https://github.com/apache/spark/pull/33638#issuecomment-892786290 **[Test build #142040 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142040/testReport)** for PR 33638 at commit [`226c1d5`](https://github.co

[GitHub] [spark] SparkQA removed a comment on pull request #33638: [SPARK-36415][SQL][DOCS] Add docs for try_cast/try_add/try_divide

2021-08-04 Thread GitBox
SparkQA removed a comment on pull request #33638: URL: https://github.com/apache/spark/pull/33638#issuecomment-892775972 **[Test build #142040 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142040/testReport)** for PR 33638 at commit [`226c1d5`](https://gi

[GitHub] [spark] AmplabJenkins commented on pull request #33638: [SPARK-36415][SQL][DOCS] Add docs for try_cast/try_add/try_divide

2021-08-04 Thread GitBox
AmplabJenkins commented on pull request #33638: URL: https://github.com/apache/spark/pull/33638#issuecomment-892786618 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/142040/ -- This

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33638: [SPARK-36415][SQL][DOCS] Add docs for try_cast/try_add/try_divide

2021-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #33638: URL: https://github.com/apache/spark/pull/33638#issuecomment-892786618 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/142040/ -

[GitHub] [spark] SparkQA commented on pull request #33633: [DO NOT MERGE][SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
SparkQA commented on pull request #33633: URL: https://github.com/apache/spark/pull/33633#issuecomment-892790952 **[Test build #142026 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142026/testReport)** for PR 33633 at commit [`66d5006`](https://github.co

[GitHub] [spark] SparkQA removed a comment on pull request #33633: [DO NOT MERGE][SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
SparkQA removed a comment on pull request #33633: URL: https://github.com/apache/spark/pull/33633#issuecomment-892448656 **[Test build #142026 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142026/testReport)** for PR 33633 at commit [`66d5006`](https://gi

[GitHub] [spark] AmplabJenkins commented on pull request #33633: [DO NOT MERGE][SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
AmplabJenkins commented on pull request #33633: URL: https://github.com/apache/spark/pull/33633#issuecomment-892791478 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/142026/ -- This

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33633: [DO NOT MERGE][SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #33633: URL: https://github.com/apache/spark/pull/33633#issuecomment-892791478 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/142026/ -

[GitHub] [spark] SparkQA commented on pull request #33587: [SPARK-36353][SPARK-36355][SQL] RemoveNoopOperators should keep output schema and NamedExpression add method `withName(newName: String)

2021-08-04 Thread GitBox
SparkQA commented on pull request #33587: URL: https://github.com/apache/spark/pull/33587#issuecomment-892792078 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/46551/ -- This is an automated message from the A

[GitHub] [spark] AmplabJenkins commented on pull request #33587: [SPARK-36353][SPARK-36355][SQL] RemoveNoopOperators should keep output schema and NamedExpression add method `withName(newName: String

2021-08-04 Thread GitBox
AmplabJenkins commented on pull request #33587: URL: https://github.com/apache/spark/pull/33587#issuecomment-892792115 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/46551/ -- T

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33587: [SPARK-36353][SPARK-36355][SQL] RemoveNoopOperators should keep output schema and NamedExpression add method `withName(newName

2021-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #33587: URL: https://github.com/apache/spark/pull/33587#issuecomment-892792115 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/46551/

[GitHub] [spark] huaxingao opened a new pull request #33639: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-08-04 Thread GitBox
huaxingao opened a new pull request #33639: URL: https://github.com/apache/spark/pull/33639 ### What changes were proposed in this pull request? Push down Min/Max/Count to Parquet ### Why are the changes needed? Since parquet has the statistics information for min, max a

[GitHub] [spark] SparkQA commented on pull request #33637: [SPARK-36384][CORE][DOC] Add doc for shuffle checksum

2021-08-04 Thread GitBox
SparkQA commented on pull request #33637: URL: https://github.com/apache/spark/pull/33637#issuecomment-892808604 **[Test build #142038 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142038/testReport)** for PR 33637 at commit [`55bcbf9`](https://github.co

[GitHub] [spark] SparkQA commented on pull request #33638: [SPARK-36415][SQL][DOCS] Add docs for try_cast/try_add/try_divide

2021-08-04 Thread GitBox
SparkQA commented on pull request #33638: URL: https://github.com/apache/spark/pull/33638#issuecomment-892808798 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/46552/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA removed a comment on pull request #33637: [SPARK-36384][CORE][DOC] Add doc for shuffle checksum

2021-08-04 Thread GitBox
SparkQA removed a comment on pull request #33637: URL: https://github.com/apache/spark/pull/33637#issuecomment-892682137 **[Test build #142038 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142038/testReport)** for PR 33637 at commit [`55bcbf9`](https://gi

[GitHub] [spark] dongjoon-hyun commented on pull request #33586: [SPARK-36354][CORE] EventLogFileReader should skip rolling event log directories with no logs

2021-08-04 Thread GitBox
dongjoon-hyun commented on pull request #33586: URL: https://github.com/apache/spark/pull/33586#issuecomment-892814945 Thank you, @HeartSaVioR ! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to t

[GitHub] [spark] dongjoon-hyun edited a comment on pull request #33586: [SPARK-36354][CORE] EventLogFileReader should skip rolling event log directories with no logs

2021-08-04 Thread GitBox
dongjoon-hyun edited a comment on pull request #33586: URL: https://github.com/apache/spark/pull/33586#issuecomment-892814945 Thank you again, @HeartSaVioR ! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL abo

[GitHub] [spark] AmplabJenkins commented on pull request #33637: [SPARK-36384][CORE][DOC] Add doc for shuffle checksum

2021-08-04 Thread GitBox
AmplabJenkins commented on pull request #33637: URL: https://github.com/apache/spark/pull/33637#issuecomment-892815286 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/142038/ -- This

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33637: [SPARK-36384][CORE][DOC] Add doc for shuffle checksum

2021-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #33637: URL: https://github.com/apache/spark/pull/33637#issuecomment-892815286 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/142038/ -

[GitHub] [spark] SparkQA commented on pull request #33639: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-08-04 Thread GitBox
SparkQA commented on pull request #33639: URL: https://github.com/apache/spark/pull/33639#issuecomment-892817410 **[Test build #142041 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142041/testReport)** for PR 33639 at commit [`3ea5fdd`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #33625: [SPARK-36397][PYTHON] Implement DataFrame.mode

2021-08-04 Thread GitBox
SparkQA commented on pull request #33625: URL: https://github.com/apache/spark/pull/33625#issuecomment-892817499 **[Test build #142042 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142042/testReport)** for PR 33625 at commit [`1eee115`](https://github.com

[GitHub] [spark] cloud-fan opened a new pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
cloud-fan opened a new pull request #33640: URL: https://github.com/apache/spark/pull/33640 ### What changes were proposed in this pull request? Currently `datetime.sql` contains a lot of tests and will be run 3 times: default mode, ansi mode, ntz mode. It wastes the test tim

[GitHub] [spark] gengliangwang commented on pull request #33584: [SPARK-36351][SQL] Separate partition filters and data filters in PushDownUtils

2021-08-04 Thread GitBox
gengliangwang commented on pull request #33584: URL: https://github.com/apache/spark/pull/33584#issuecomment-892826981 > In order to lift the above restriction, at the time of checking whether to push down the aggregate, we should have already separated the partition filters and data filte

[GitHub] [spark] gengliangwang edited a comment on pull request #33584: [SPARK-36351][SQL] Separate partition filters and data filters in PushDownUtils

2021-08-04 Thread GitBox
gengliangwang edited a comment on pull request #33584: URL: https://github.com/apache/spark/pull/33584#issuecomment-892826981 > In order to lift the above restriction, at the time of checking whether to push down the aggregate, we should have already separated the partition filters and dat

[GitHub] [spark] gengliangwang edited a comment on pull request #33584: [SPARK-36351][SQL] Separate partition filters and data filters in PushDownUtils

2021-08-04 Thread GitBox
gengliangwang edited a comment on pull request #33584: URL: https://github.com/apache/spark/pull/33584#issuecomment-892826981 > In order to lift the above restriction, at the time of checking whether to push down the aggregate, we should have already separated the partition filters and dat

[GitHub] [spark] SparkQA commented on pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
SparkQA commented on pull request #33640: URL: https://github.com/apache/spark/pull/33640#issuecomment-892829484 **[Test build #142043 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142043/testReport)** for PR 33640 at commit [`12b18f6`](https://github.com

[GitHub] [spark] AmplabJenkins commented on pull request #33638: [SPARK-36415][SQL][DOCS] Add docs for try_cast/try_add/try_divide

2021-08-04 Thread GitBox
AmplabJenkins commented on pull request #33638: URL: https://github.com/apache/spark/pull/33638#issuecomment-892833453 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/46552/ -- T

[GitHub] [spark] SparkQA commented on pull request #33638: [SPARK-36415][SQL][DOCS] Add docs for try_cast/try_add/try_divide

2021-08-04 Thread GitBox
SparkQA commented on pull request #33638: URL: https://github.com/apache/spark/pull/33638#issuecomment-892833424 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/46552/ -- This is an automated message from the A

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33638: [SPARK-36415][SQL][DOCS] Add docs for try_cast/try_add/try_divide

2021-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #33638: URL: https://github.com/apache/spark/pull/33638#issuecomment-892833453 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/46552/

[GitHub] [spark] SparkQA commented on pull request #33633: [DO NOT MERGE][SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
SparkQA commented on pull request #33633: URL: https://github.com/apache/spark/pull/33633#issuecomment-892837072 **[Test build #142036 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142036/testReport)** for PR 33633 at commit [`a5b74fd`](https://github.co

[GitHub] [spark] SparkQA removed a comment on pull request #33633: [DO NOT MERGE][SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
SparkQA removed a comment on pull request #33633: URL: https://github.com/apache/spark/pull/33633#issuecomment-892633315 **[Test build #142036 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142036/testReport)** for PR 33633 at commit [`a5b74fd`](https://gi

[GitHub] [spark] SparkQA commented on pull request #33625: [SPARK-36397][PYTHON] Implement DataFrame.mode

2021-08-04 Thread GitBox
SparkQA commented on pull request #33625: URL: https://github.com/apache/spark/pull/33625#issuecomment-892838548 **[Test build #142042 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142042/testReport)** for PR 33625 at commit [`1eee115`](https://github.co

[GitHub] [spark] SparkQA removed a comment on pull request #33625: [SPARK-36397][PYTHON] Implement DataFrame.mode

2021-08-04 Thread GitBox
SparkQA removed a comment on pull request #33625: URL: https://github.com/apache/spark/pull/33625#issuecomment-892817499 **[Test build #142042 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142042/testReport)** for PR 33625 at commit [`1eee115`](https://gi

[GitHub] [spark] SparkQA commented on pull request #33639: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-08-04 Thread GitBox
SparkQA commented on pull request #33639: URL: https://github.com/apache/spark/pull/33639#issuecomment-892851806 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/46553/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #33625: [SPARK-36397][PYTHON] Implement DataFrame.mode

2021-08-04 Thread GitBox
SparkQA commented on pull request #33625: URL: https://github.com/apache/spark/pull/33625#issuecomment-892852422 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/46554/ -- This is an automated message from the Apache

[GitHub] [spark] AmplabJenkins commented on pull request #33625: [SPARK-36397][PYTHON] Implement DataFrame.mode

2021-08-04 Thread GitBox
AmplabJenkins commented on pull request #33625: URL: https://github.com/apache/spark/pull/33625#issuecomment-892853978 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/142042/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #33633: [DO NOT MERGE][SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
AmplabJenkins commented on pull request #33633: URL: https://github.com/apache/spark/pull/33633#issuecomment-892853979 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/142036/ -- This

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33633: [DO NOT MERGE][SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #33633: URL: https://github.com/apache/spark/pull/33633#issuecomment-892853979 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/142036/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33625: [SPARK-36397][PYTHON] Implement DataFrame.mode

2021-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #33625: URL: https://github.com/apache/spark/pull/33625#issuecomment-892853978 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/142042/ -

[GitHub] [spark] SparkQA commented on pull request #33627: [SPARK-36405] Check that SQLSTATEs are valid

2021-08-04 Thread GitBox
SparkQA commented on pull request #33627: URL: https://github.com/apache/spark/pull/33627#issuecomment-892855065 **[Test build #142044 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142044/testReport)** for PR 33627 at commit [`8546448`](https://github.com

[GitHub] [spark] venkata91 commented on a change in pull request #33613: [SPARK-36378][SHUFFLE] Switch to using RPCResponse to communicate common block push failures to the client.

2021-08-04 Thread GitBox
venkata91 commented on a change in pull request #33613: URL: https://github.com/apache/spark/pull/33613#discussion_r682840984 ## File path: common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/RemoteBlockPushResolver.java ## @@ -471,9 +488,10 @@ public void on

[GitHub] [spark] holdenk commented on a change in pull request #33508: [SPARK-36058][K8S] Add support for statefulset APIs in K8s

2021-08-04 Thread GitBox
holdenk commented on a change in pull request #33508: URL: https://github.com/apache/spark/pull/33508#discussion_r682842336 ## File path: resource-managers/kubernetes/core/src/main/scala/org/apache/spark/deploy/k8s/features/BasicExecutorFeatureStep.scala ## @@ -260,16 +267,22

[GitHub] [spark] holdenk commented on a change in pull request #33508: [SPARK-36058][K8S] Add support for statefulset APIs in K8s

2021-08-04 Thread GitBox
holdenk commented on a change in pull request #33508: URL: https://github.com/apache/spark/pull/33508#discussion_r682843690 ## File path: resource-managers/kubernetes/core/src/main/scala/org/apache/spark/deploy/k8s/Config.scala ## @@ -323,6 +323,15 @@ private[spark] object Con

[GitHub] [spark] SparkQA commented on pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
SparkQA commented on pull request #33640: URL: https://github.com/apache/spark/pull/33640#issuecomment-892865980 **[Test build #142045 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142045/testReport)** for PR 33640 at commit [`7778951`](https://github.com

[GitHub] [spark] AmplabJenkins commented on pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
AmplabJenkins commented on pull request #33640: URL: https://github.com/apache/spark/pull/33640#issuecomment-892866396 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/46555/ -- T

[GitHub] [spark] SparkQA commented on pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
SparkQA commented on pull request #33640: URL: https://github.com/apache/spark/pull/33640#issuecomment-892866359 Kubernetes integration test unable to build dist. exiting with code: 1 URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/46555/ -- This

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #33640: URL: https://github.com/apache/spark/pull/33640#issuecomment-892866396 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/46555/

[GitHub] [spark] cloud-fan commented on a change in pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
cloud-fan commented on a change in pull request #33640: URL: https://github.com/apache/spark/pull/33640#discussion_r682854999 ## File path: sql/core/src/test/resources/sql-tests/results/ansi/date.sql.out ## @@ -0,0 +1,525 @@ +-- Automatically generated by SQLQueryTestSuite +--

[GitHub] [spark] SparkQA commented on pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
SparkQA commented on pull request #33640: URL: https://github.com/apache/spark/pull/33640#issuecomment-892873360 **[Test build #142046 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142046/testReport)** for PR 33640 at commit [`2ecae96`](https://github.com

[GitHub] [spark] cloud-fan commented on a change in pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
cloud-fan commented on a change in pull request #33640: URL: https://github.com/apache/spark/pull/33640#discussion_r682856406 ## File path: sql/core/src/test/resources/sql-tests/results/date.sql.out ## @@ -0,0 +1,515 @@ +-- Automatically generated by SQLQueryTestSuite +-- Numbe

[GitHub] [spark] SparkQA commented on pull request #33639: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-08-04 Thread GitBox
SparkQA commented on pull request #33639: URL: https://github.com/apache/spark/pull/33639#issuecomment-892875360 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/46553/ -- This is an automated message from the A

[GitHub] [spark] cloud-fan commented on a change in pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
cloud-fan commented on a change in pull request #33640: URL: https://github.com/apache/spark/pull/33640#discussion_r682862523 ## File path: sql/core/src/test/resources/sql-tests/results/timestampNTZ/timestamp.sql.out ## @@ -0,0 +1,579 @@ +-- Automatically generated by SQLQuery

[GitHub] [spark] venkata91 commented on a change in pull request #33615: [SPARK-36374][SHUFFLE][DOC] Push-based shuffle high level user documentation

2021-08-04 Thread GitBox
venkata91 commented on a change in pull request #33615: URL: https://github.com/apache/spark/pull/33615#discussion_r682863944 ## File path: docs/configuration.md ## @@ -3134,3 +3134,119 @@ The stage level scheduling feature allows users to specify task and executor res This i

[GitHub] [spark] venkata91 commented on a change in pull request #33615: [SPARK-36374][SHUFFLE][DOC] Push-based shuffle high level user documentation

2021-08-04 Thread GitBox
venkata91 commented on a change in pull request #33615: URL: https://github.com/apache/spark/pull/33615#discussion_r682863944 ## File path: docs/configuration.md ## @@ -3134,3 +3134,119 @@ The stage level scheduling feature allows users to specify task and executor res This i

[GitHub] [spark] ueshin commented on a change in pull request #33634: [SPARK-36369] Fix Index.union to follow pandas 1.3

2021-08-04 Thread GitBox
ueshin commented on a change in pull request #33634: URL: https://github.com/apache/spark/pull/33634#discussion_r682859453 ## File path: python/pyspark/pandas/indexes/base.py ## @@ -2293,8 +2311,6 @@ def union( sdf_self = self._internal.spark_frame.select(self._intern

[GitHub] [spark] cloud-fan commented on a change in pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
cloud-fan commented on a change in pull request #33640: URL: https://github.com/apache/spark/pull/33640#discussion_r682867086 ## File path: sql/core/src/test/resources/sql-tests/inputs/datetime-legacy.sql ## @@ -1,2 +1,3 @@ --SET spark.sql.legacy.timeParserPolicy=LEGACY ---IMP

[GitHub] [spark] cloud-fan commented on pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
cloud-fan commented on pull request #33640: URL: https://github.com/apache/spark/pull/33640#issuecomment-892884661 cc @MaxGekk -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comme

[GitHub] [spark] SparkQA commented on pull request #33627: [SPARK-36405] Check that SQLSTATEs are valid

2021-08-04 Thread GitBox
SparkQA commented on pull request #33627: URL: https://github.com/apache/spark/pull/33627#issuecomment-892885494 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/46556/ -- This is an automated message from the Apache

[GitHub] [spark] ueshin commented on a change in pull request #33634: [SPARK-36369] Fix Index.union to follow pandas 1.3

2021-08-04 Thread GitBox
ueshin commented on a change in pull request #33634: URL: https://github.com/apache/spark/pull/33634#discussion_r682859453 ## File path: python/pyspark/pandas/indexes/base.py ## @@ -2293,8 +2311,6 @@ def union( sdf_self = self._internal.spark_frame.select(self._intern

[GitHub] [spark] allisonwang-db commented on a change in pull request #33530: [SPARK-36098][CORE] Grouping exception in core/storage

2021-08-04 Thread GitBox
allisonwang-db commented on a change in pull request #33530: URL: https://github.com/apache/spark/pull/33530#discussion_r682868967 ## File path: core/src/main/scala/org/apache/spark/storage/BlockManager.scala ## @@ -270,7 +271,7 @@ private[spark] class BlockManager( // Don

[GitHub] [spark] ekoifman opened a new pull request #33641: [SPARK-36416][SQL] Add SQL metrics to AdaptiveSparkPlanExec for BHJs …

2021-08-04 Thread GitBox
ekoifman opened a new pull request #33641: URL: https://github.com/apache/spark/pull/33641 …and Skew joins ### What changes were proposed in this pull request? Add "num broadcast joins conversions" and "num skew join conversions" metrics to AdaptiveSparkPlanExec to r

[GitHub] [spark] SparkQA commented on pull request #33625: [SPARK-36397][PYTHON] Implement DataFrame.mode

2021-08-04 Thread GitBox
SparkQA commented on pull request #33625: URL: https://github.com/apache/spark/pull/33625#issuecomment-892888561 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/46554/ -- This is an automated message from the A

[GitHub] [spark] MaxGekk commented on pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
MaxGekk commented on pull request #33640: URL: https://github.com/apache/spark/pull/33640#issuecomment-892888796 @cloud-fan Just for my understanding, you just move tests around w/o adding or deleting tests, correct? And the test runs with the same settings/SQL configs. -- This is an au

[GitHub] [spark] MaxGekk edited a comment on pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
MaxGekk edited a comment on pull request #33640: URL: https://github.com/apache/spark/pull/33640#issuecomment-892888796 @cloud-fan Just for my understanding, you just moved tests around w/o adding or deleting tests, correct? And the tests run with the same settings/SQL configs. -- This

[GitHub] [spark] MaxGekk edited a comment on pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
MaxGekk edited a comment on pull request #33640: URL: https://github.com/apache/spark/pull/33640#issuecomment-892888796 @cloud-fan Just for my understanding, you moved tests around w/o adding or deleting tests, correct? And the tests run with the same settings/SQL configs. -- This is an

[GitHub] [spark] AmplabJenkins commented on pull request #33625: [SPARK-36397][PYTHON] Implement DataFrame.mode

2021-08-04 Thread GitBox
AmplabJenkins commented on pull request #33625: URL: https://github.com/apache/spark/pull/33625#issuecomment-892892687 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/46554/ -- T

[GitHub] [spark] AmplabJenkins commented on pull request #33639: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-08-04 Thread GitBox
AmplabJenkins commented on pull request #33639: URL: https://github.com/apache/spark/pull/33639#issuecomment-892892692 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/46553/ -- T

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33625: [SPARK-36397][PYTHON] Implement DataFrame.mode

2021-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #33625: URL: https://github.com/apache/spark/pull/33625#issuecomment-892892687 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/46554/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33639: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #33639: URL: https://github.com/apache/spark/pull/33639#issuecomment-892892692 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/46553/

[GitHub] [spark] otterc commented on a change in pull request #33613: [SPARK-36378][SHUFFLE] Switch to using RPCResponse to communicate common block push failures to the client.

2021-08-04 Thread GitBox
otterc commented on a change in pull request #33613: URL: https://github.com/apache/spark/pull/33613#discussion_r682876739 ## File path: common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/RemoteBlockPushResolver.java ## @@ -471,9 +488,10 @@ public void onDat

[GitHub] [spark] MaxGekk edited a comment on pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
MaxGekk edited a comment on pull request #33640: URL: https://github.com/apache/spark/pull/33640#issuecomment-892888796 @cloud-fan Just for my understanding, you moved tests around w/o adding or deleting tests, correct? So, test coverage should be the same. -- This is an automated messag

[GitHub] [spark] AmplabJenkins commented on pull request #33641: [SPARK-36416][SQL] Add SQL metrics to AdaptiveSparkPlanExec for BHJs …

2021-08-04 Thread GitBox
AmplabJenkins commented on pull request #33641: URL: https://github.com/apache/spark/pull/33641#issuecomment-892894773 Can one of the admins verify this patch? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL a

[GitHub] [spark] otterc commented on a change in pull request #33613: [SPARK-36378][SHUFFLE] Switch to using RPCResponse to communicate common block push failures to the client.

2021-08-04 Thread GitBox
otterc commented on a change in pull request #33613: URL: https://github.com/apache/spark/pull/33613#discussion_r682876739 ## File path: common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/RemoteBlockPushResolver.java ## @@ -471,9 +488,10 @@ public void onDat

[GitHub] [spark] ueshin commented on a change in pull request #33634: [SPARK-36369] Fix Index.union to follow pandas 1.3

2021-08-04 Thread GitBox
ueshin commented on a change in pull request #33634: URL: https://github.com/apache/spark/pull/33634#discussion_r682859453 ## File path: python/pyspark/pandas/indexes/base.py ## @@ -2293,8 +2311,6 @@ def union( sdf_self = self._internal.spark_frame.select(self._intern

[GitHub] [spark] SparkQA commented on pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
SparkQA commented on pull request #33640: URL: https://github.com/apache/spark/pull/33640#issuecomment-892897438 **[Test build #142047 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142047/testReport)** for PR 33640 at commit [`0a007d2`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #33625: [SPARK-36397][PYTHON] Implement DataFrame.mode

2021-08-04 Thread GitBox
SparkQA commented on pull request #33625: URL: https://github.com/apache/spark/pull/33625#issuecomment-892897460 **[Test build #142048 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142048/testReport)** for PR 33625 at commit [`97de65b`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #33508: [SPARK-36058][K8S] Add support for statefulset APIs in K8s

2021-08-04 Thread GitBox
SparkQA commented on pull request #33508: URL: https://github.com/apache/spark/pull/33508#issuecomment-892897952 **[Test build #142049 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142049/testReport)** for PR 33508 at commit [`e8eece5`](https://github.com

[GitHub] [spark] MaxGekk removed a comment on pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
MaxGekk removed a comment on pull request #33640: URL: https://github.com/apache/spark/pull/33640#issuecomment-892888796 @cloud-fan Just for my understanding, you moved tests around w/o adding or deleting tests, correct? So, test coverage should be the same. -- This is an automated messa

[GitHub] [spark] asfgit closed pull request #31517: [SPARK-34309][BUILD][CORE][SQL][K8S]Use Caffeine instead of Guava Cache

2021-08-04 Thread GitBox
asfgit closed pull request #31517: URL: https://github.com/apache/spark/pull/31517 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: reviews-unsubsc

[GitHub] [spark] holdenk commented on pull request #31517: [SPARK-34309][BUILD][CORE][SQL][K8S]Use Caffeine instead of Guava Cache

2021-08-04 Thread GitBox
holdenk commented on pull request #31517: URL: https://github.com/apache/spark/pull/31517#issuecomment-892899674 Merged to the current branch targeting 3.3.0. Thanks everyone for taking the time to review the PR & special thanks to @LuciferYang for sticking with this for several months.

[GitHub] [spark] SparkQA commented on pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
SparkQA commented on pull request #33640: URL: https://github.com/apache/spark/pull/33640#issuecomment-892900687 Kubernetes integration test unable to build dist. exiting with code: 1 URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/46557/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
AmplabJenkins commented on pull request #33640: URL: https://github.com/apache/spark/pull/33640#issuecomment-892900729 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/46557/ -- T

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #33640: URL: https://github.com/apache/spark/pull/33640#issuecomment-892900729 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/46557/

[GitHub] [spark] SparkQA commented on pull request #33627: [SPARK-36405] Check that SQLSTATEs are valid

2021-08-04 Thread GitBox
SparkQA commented on pull request #33627: URL: https://github.com/apache/spark/pull/33627#issuecomment-892909274 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/46556/ -- This is an automated message from the A

[GitHub] [spark] AmplabJenkins commented on pull request #33627: [SPARK-36405] Check that SQLSTATEs are valid

2021-08-04 Thread GitBox
AmplabJenkins commented on pull request #33627: URL: https://github.com/apache/spark/pull/33627#issuecomment-892909298 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/46556/ -- T

[GitHub] [spark] SparkQA commented on pull request #33639: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-08-04 Thread GitBox
SparkQA commented on pull request #33639: URL: https://github.com/apache/spark/pull/33639#issuecomment-892909839 **[Test build #142041 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142041/testReport)** for PR 33639 at commit [`3ea5fdd`](https://github.co

[GitHub] [spark] SparkQA removed a comment on pull request #33639: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-08-04 Thread GitBox
SparkQA removed a comment on pull request #33639: URL: https://github.com/apache/spark/pull/33639#issuecomment-892817410 **[Test build #142041 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142041/testReport)** for PR 33639 at commit [`3ea5fdd`](https://gi

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33627: [SPARK-36405] Check that SQLSTATEs are valid

2021-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #33627: URL: https://github.com/apache/spark/pull/33627#issuecomment-892909298 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/46556/

[GitHub] [spark] AmplabJenkins commented on pull request #33639: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-08-04 Thread GitBox
AmplabJenkins commented on pull request #33639: URL: https://github.com/apache/spark/pull/33639#issuecomment-892910682 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/142041/ -- This

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33639: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #33639: URL: https://github.com/apache/spark/pull/33639#issuecomment-892910682 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/142041/ -

[GitHub] [spark] SparkQA commented on pull request #33625: [SPARK-36397][PYTHON] Implement DataFrame.mode

2021-08-04 Thread GitBox
SparkQA commented on pull request #33625: URL: https://github.com/apache/spark/pull/33625#issuecomment-892915939 **[Test build #142048 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142048/testReport)** for PR 33625 at commit [`97de65b`](https://github.co

[GitHub] [spark] SparkQA removed a comment on pull request #33625: [SPARK-36397][PYTHON] Implement DataFrame.mode

2021-08-04 Thread GitBox
SparkQA removed a comment on pull request #33625: URL: https://github.com/apache/spark/pull/33625#issuecomment-892897460 **[Test build #142048 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142048/testReport)** for PR 33625 at commit [`97de65b`](https://gi

[GitHub] [spark] SparkQA commented on pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
SparkQA commented on pull request #33640: URL: https://github.com/apache/spark/pull/33640#issuecomment-892922709 **[Test build #142043 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142043/testReport)** for PR 33640 at commit [`12b18f6`](https://github.co

[GitHub] [spark] SparkQA removed a comment on pull request #33640: [SPARK-36409][SQL][TESTS] Splitting test cases from datetime.sql

2021-08-04 Thread GitBox
SparkQA removed a comment on pull request #33640: URL: https://github.com/apache/spark/pull/33640#issuecomment-892829484 **[Test build #142043 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142043/testReport)** for PR 33640 at commit [`12b18f6`](https://gi

[GitHub] [spark] SparkQA commented on pull request #33625: [SPARK-36397][PYTHON] Implement DataFrame.mode

2021-08-04 Thread GitBox
SparkQA commented on pull request #33625: URL: https://github.com/apache/spark/pull/33625#issuecomment-892925550 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/46559/ -- This is an automated message from the Apache

[GitHub] [spark] viirya opened a new pull request #33642: [NOT-MERGE] Only for testing GA

2021-08-04 Thread GitBox
viirya opened a new pull request #33642: URL: https://github.com/apache/spark/pull/33642 ### What changes were proposed in this pull request? ### Why are the changes needed? ### Does this PR introduce _any_ user-facing change? ### How was

<    1   2   3   4   5   6   >