[GitHub] [spark] SparkQA removed a comment on pull request #34248: [SPARK-36647][SQL][TESTS] Push down Aggregate (Min/Max/Count) for Parquet if filter is on partition col

2021-10-27 Thread GitBox
SparkQA removed a comment on pull request #34248: URL: https://github.com/apache/spark/pull/34248#issuecomment-952475127 **[Test build #144636 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144636/testReport)** for PR 34248 at commit [`1293ae0`](https://gi

[GitHub] [spark] cloud-fan commented on a change in pull request #34291: [SPARK-37020][SQL] DS V2 LIMIT push down

2021-10-27 Thread GitBox
cloud-fan commented on a change in pull request #34291: URL: https://github.com/apache/spark/pull/34291#discussion_r737162835 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/V2ScanRelationPushDown.scala ## @@ -245,13 +258,18 @@ object V2Scan

[GitHub] [spark] AmplabJenkins commented on pull request #34389: [SPARK-37036][PYTHON] Add util function to raise advice warning for pandas API on Spark.

2021-10-27 Thread GitBox
AmplabJenkins commented on pull request #34389: URL: https://github.com/apache/spark/pull/34389#issuecomment-952603770 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To un

[GitHub] [spark] AmplabJenkins commented on pull request #34248: [SPARK-36647][SQL][TESTS] Push down Aggregate (Min/Max/Count) for Parquet if filter is on partition col

2021-10-27 Thread GitBox
AmplabJenkins commented on pull request #34248: URL: https://github.com/apache/spark/pull/34248#issuecomment-952603764 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144636/ -- This

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34248: [SPARK-36647][SQL][TESTS] Push down Aggregate (Min/Max/Count) for Parquet if filter is on partition col

2021-10-27 Thread GitBox
AmplabJenkins removed a comment on pull request #34248: URL: https://github.com/apache/spark/pull/34248#issuecomment-952603764 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144636/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34389: [SPARK-37036][PYTHON] Add util function to raise advice warning for pandas API on Spark.

2021-10-27 Thread GitBox
AmplabJenkins removed a comment on pull request #34389: URL: https://github.com/apache/spark/pull/34389#issuecomment-952603765 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment

[GitHub] [spark] SparkQA commented on pull request #34389: [SPARK-37036][PYTHON] Add util function to raise advice warning for pandas API on Spark.

2021-10-27 Thread GitBox
SparkQA commented on pull request #34389: URL: https://github.com/apache/spark/pull/34389#issuecomment-952605287 **[Test build #144649 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144649/testReport)** for PR 34389 at commit [`b9f40b2`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #34380: [SPARK-37082][SQL] Implements histogram_numeric aggregation function which supports partial aggregation.

2021-10-27 Thread GitBox
SparkQA commented on pull request #34380: URL: https://github.com/apache/spark/pull/34380#issuecomment-952605355 **[Test build #144650 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144650/testReport)** for PR 34380 at commit [`5ec9afb`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #34291: [SPARK-37020][SQL] DS V2 LIMIT push down

2021-10-27 Thread GitBox
SparkQA commented on pull request #34291: URL: https://github.com/apache/spark/pull/34291#issuecomment-952605459 **[Test build #144651 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144651/testReport)** for PR 34291 at commit [`008aadb`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #34380: [SPARK-37082][SQL] Implements histogram_numeric aggregation function which supports partial aggregation.

2021-10-27 Thread GitBox
SparkQA commented on pull request #34380: URL: https://github.com/apache/spark/pull/34380#issuecomment-952607864 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49118/ -- This is an automated message from the Apache

[GitHub] [spark] viirya commented on pull request #34248: [SPARK-36647][SQL][TESTS] Push down Aggregate (Min/Max/Count) for Parquet if filter is on partition col

2021-10-27 Thread GitBox
viirya commented on pull request #34248: URL: https://github.com/apache/spark/pull/34248#issuecomment-952608157 Thanks! Merging to master. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the spe

[GitHub] [spark] viirya closed pull request #34248: [SPARK-36647][SQL][TESTS] Push down Aggregate (Min/Max/Count) for Parquet if filter is on partition col

2021-10-27 Thread GitBox
viirya closed pull request #34248: URL: https://github.com/apache/spark/pull/34248 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: reviews-unsubsc

[GitHub] [spark] SparkQA commented on pull request #34397: [SPARK-36348][PYTHON][FOLLOWUP] Complete test_astype for index

2021-10-27 Thread GitBox
SparkQA commented on pull request #34397: URL: https://github.com/apache/spark/pull/34397#issuecomment-952612387 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49115/ -- This is an automated message from the A

[GitHub] [spark] AmplabJenkins commented on pull request #34397: [SPARK-36348][PYTHON][FOLLOWUP] Complete test_astype for index

2021-10-27 Thread GitBox
AmplabJenkins commented on pull request #34397: URL: https://github.com/apache/spark/pull/34397#issuecomment-952612434 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49115/ -- T

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34397: [SPARK-36348][PYTHON][FOLLOWUP] Complete test_astype for index

2021-10-27 Thread GitBox
AmplabJenkins removed a comment on pull request #34397: URL: https://github.com/apache/spark/pull/34397#issuecomment-952612434 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49115/

[GitHub] [spark] huaxingao commented on pull request #34248: [SPARK-36647][SQL][TESTS] Push down Aggregate (Min/Max/Count) for Parquet if filter is on partition col

2021-10-27 Thread GitBox
huaxingao commented on pull request #34248: URL: https://github.com/apache/spark/pull/34248#issuecomment-952612514 Thanks @c21 @viirya -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specif

[GitHub] [spark] SparkQA commented on pull request #34291: [SPARK-37020][SQL] DS V2 LIMIT push down

2021-10-27 Thread GitBox
SparkQA commented on pull request #34291: URL: https://github.com/apache/spark/pull/34291#issuecomment-952613356 **[Test build #144652 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144652/testReport)** for PR 34291 at commit [`a36007e`](https://github.com

[GitHub] [spark] cloud-fan commented on pull request #34399: [SPARK-37031][SQL][TESTS][FOLLOWUP] Add a missing test to DescribeNamespaceSuite

2021-10-27 Thread GitBox
cloud-fan commented on pull request #34399: URL: https://github.com/apache/spark/pull/34399#issuecomment-952617947 thanks, merging to master! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [spark] cloud-fan closed pull request #34399: [SPARK-37031][SQL][TESTS][FOLLOWUP] Add a missing test to DescribeNamespaceSuite

2021-10-27 Thread GitBox
cloud-fan closed pull request #34399: URL: https://github.com/apache/spark/pull/34399 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: reviews-unsu

[GitHub] [spark] cloud-fan commented on pull request #34398: [SPARK-37125][SQL] Support AnsiInterval radix sort

2021-10-27 Thread GitBox
cloud-fan commented on pull request #34398: URL: https://github.com/apache/spark/pull/34398#issuecomment-952620111 thanks, merging to master! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [spark] cloud-fan closed pull request #34398: [SPARK-37125][SQL] Support AnsiInterval radix sort

2021-10-27 Thread GitBox
cloud-fan closed pull request #34398: URL: https://github.com/apache/spark/pull/34398 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: reviews-unsu

[GitHub] [spark] cloud-fan commented on pull request #34388: [SPARK-37115][SQL] HiveClientImpl should use shim to wrap all hive client calls

2021-10-27 Thread GitBox
cloud-fan commented on pull request #34388: URL: https://github.com/apache/spark/pull/34388#issuecomment-952620977 thanks, merging to master! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [spark] cloud-fan closed pull request #34388: [SPARK-37115][SQL] HiveClientImpl should use shim to wrap all hive client calls

2021-10-27 Thread GitBox
cloud-fan closed pull request #34388: URL: https://github.com/apache/spark/pull/34388 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: reviews-unsu

[GitHub] [spark] cloud-fan commented on a change in pull request #34396: [SPARK-37124][SQL] Add ArrowWritableColumnVector

2021-10-27 Thread GitBox
cloud-fan commented on a change in pull request #34396: URL: https://github.com/apache/spark/pull/34396#discussion_r737188529 ## File path: sql/core/src/main/java/org/apache/spark/sql/execution/vectorized/ArrowWritableColumnVector.java ## @@ -0,0 +1,1322 @@ +/* + * Licensed to

[GitHub] [spark] cloud-fan commented on a change in pull request #34041: [SPARK-36799][SQL] Pass queryExecution name in CLI when only select query

2021-10-27 Thread GitBox
cloud-fan commented on a change in pull request #34041: URL: https://github.com/apache/spark/pull/34041#discussion_r737190347 ## File path: sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/SparkSQLDriver.scala ## @@ -65,7 +65,11 @@ private[hive] clas

[GitHub] [spark] SparkQA commented on pull request #34389: [SPARK-37036][PYTHON] Add util function to raise advice warning for pandas API on Spark.

2021-10-27 Thread GitBox
SparkQA commented on pull request #34389: URL: https://github.com/apache/spark/pull/34389#issuecomment-952625830 **[Test build #144649 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144649/testReport)** for PR 34389 at commit [`b9f40b2`](https://github.co

[GitHub] [spark] xuechendi commented on a change in pull request #34396: [SPARK-37124][SQL] Add ArrowWritableColumnVector

2021-10-27 Thread GitBox
xuechendi commented on a change in pull request #34396: URL: https://github.com/apache/spark/pull/34396#discussion_r737197861 ## File path: sql/core/src/main/java/org/apache/spark/sql/execution/vectorized/ArrowWritableColumnVector.java ## @@ -0,0 +1,1322 @@ +/* + * Licensed to

[GitHub] [spark] SparkQA commented on pull request #34380: [SPARK-37082][SQL] Implements histogram_numeric aggregation function which supports partial aggregation.

2021-10-27 Thread GitBox
SparkQA commented on pull request #34380: URL: https://github.com/apache/spark/pull/34380#issuecomment-952633764 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49118/ -- This is an automated message from the A

[GitHub] [spark] SparkQA commented on pull request #34399: [SPARK-37031][SQL][TESTS][FOLLOWUP] Add a missing test to DescribeNamespaceSuite

2021-10-27 Thread GitBox
SparkQA commented on pull request #34399: URL: https://github.com/apache/spark/pull/34399#issuecomment-952634255 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49117/ -- This is an automated message from the A

[GitHub] [spark] SparkQA commented on pull request #34389: [SPARK-37036][PYTHON] Add util function to raise advice warning for pandas API on Spark.

2021-10-27 Thread GitBox
SparkQA commented on pull request #34389: URL: https://github.com/apache/spark/pull/34389#issuecomment-952638296 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49119/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #34380: [SPARK-37082][SQL] Implements histogram_numeric aggregation function which supports partial aggregation.

2021-10-27 Thread GitBox
SparkQA commented on pull request #34380: URL: https://github.com/apache/spark/pull/34380#issuecomment-952638858 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49120/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #34291: [SPARK-37020][SQL] DS V2 LIMIT push down

2021-10-27 Thread GitBox
SparkQA commented on pull request #34291: URL: https://github.com/apache/spark/pull/34291#issuecomment-952638813 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49121/ -- This is an automated message from the Apache

[GitHub] [spark] cxzl25 commented on a change in pull request #34041: [SPARK-36799][SQL] Pass queryExecution name in CLI when only select query

2021-10-27 Thread GitBox
cxzl25 commented on a change in pull request #34041: URL: https://github.com/apache/spark/pull/34041#discussion_r737205210 ## File path: sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/SparkSQLDriver.scala ## @@ -65,7 +65,11 @@ private[hive] class S

[GitHub] [spark] SparkQA removed a comment on pull request #34389: [SPARK-37036][PYTHON] Add util function to raise advice warning for pandas API on Spark.

2021-10-27 Thread GitBox
SparkQA removed a comment on pull request #34389: URL: https://github.com/apache/spark/pull/34389#issuecomment-952605287 **[Test build #144649 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144649/testReport)** for PR 34389 at commit [`b9f40b2`](https://gi

[GitHub] [spark] AmplabJenkins commented on pull request #34380: [SPARK-37082][SQL] Implements histogram_numeric aggregation function which supports partial aggregation.

2021-10-27 Thread GitBox
AmplabJenkins commented on pull request #34380: URL: https://github.com/apache/spark/pull/34380#issuecomment-952645961 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49118/ -- T

[GitHub] [spark] AmplabJenkins commented on pull request #34399: [SPARK-37031][SQL][TESTS][FOLLOWUP] Add a missing test to DescribeNamespaceSuite

2021-10-27 Thread GitBox
AmplabJenkins commented on pull request #34399: URL: https://github.com/apache/spark/pull/34399#issuecomment-952645951 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49117/ -- T

[GitHub] [spark] AmplabJenkins commented on pull request #34389: [SPARK-37036][PYTHON] Add util function to raise advice warning for pandas API on Spark.

2021-10-27 Thread GitBox
AmplabJenkins commented on pull request #34389: URL: https://github.com/apache/spark/pull/34389#issuecomment-952645956 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144649/ -- This

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34389: [SPARK-37036][PYTHON] Add util function to raise advice warning for pandas API on Spark.

2021-10-27 Thread GitBox
AmplabJenkins removed a comment on pull request #34389: URL: https://github.com/apache/spark/pull/34389#issuecomment-952645956 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144649/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34380: [SPARK-37082][SQL] Implements histogram_numeric aggregation function which supports partial aggregation.

2021-10-27 Thread GitBox
AmplabJenkins removed a comment on pull request #34380: URL: https://github.com/apache/spark/pull/34380#issuecomment-952645961 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49118/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34399: [SPARK-37031][SQL][TESTS][FOLLOWUP] Add a missing test to DescribeNamespaceSuite

2021-10-27 Thread GitBox
AmplabJenkins removed a comment on pull request #34399: URL: https://github.com/apache/spark/pull/34399#issuecomment-952645951 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49117/

[GitHub] [spark] SparkQA commented on pull request #34291: [SPARK-37020][SQL] DS V2 LIMIT push down

2021-10-27 Thread GitBox
SparkQA commented on pull request #34291: URL: https://github.com/apache/spark/pull/34291#issuecomment-952651727 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49122/ -- This is an automated message from the Apache

[GitHub] [spark] dongjoon-hyun commented on pull request #32583: [SPARK-35437][SQL] Use expressions to filter Hive partitions at client side

2021-10-27 Thread GitBox
dongjoon-hyun commented on pull request #32583: URL: https://github.com/apache/spark/pull/32583#issuecomment-952660943 Oops. Sorry. I didn't catch up the flakiness after merging. Thanks, @HyukjinKwon . -- This is an automated message from the Apache Git Service. To respond to the messa

[GitHub] [spark] careyhay commented on pull request #28743: [SPARK-31920][PYTHON] Fix pandas conversion using Arrow with __arrow_array__ columns

2021-10-27 Thread GitBox
careyhay commented on pull request #28743: URL: https://github.com/apache/spark/pull/28743#issuecomment-952668366 Any way this can be revived and pulled?! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [spark] cloud-fan commented on pull request #34302: [SPARK-37028][UI] Add a 'kill' executor link in the Web UI.

2021-10-27 Thread GitBox
cloud-fan commented on pull request #34302: URL: https://github.com/apache/spark/pull/34302#issuecomment-952671644 I don't think it's a good idea to let users identify bad executors and kill them, can we do it automatically? -- This is an automated message from the Apache Git Service. To

[GitHub] [spark] cloud-fan commented on a change in pull request #34396: [SPARK-37124][SQL] Add ArrowWritableColumnVector

2021-10-27 Thread GitBox
cloud-fan commented on a change in pull request #34396: URL: https://github.com/apache/spark/pull/34396#discussion_r737237402 ## File path: sql/core/src/main/java/org/apache/spark/sql/execution/vectorized/ArrowWritableColumnVector.java ## @@ -0,0 +1,1322 @@ +/* + * Licensed to

[GitHub] [spark] cloud-fan commented on a change in pull request #34396: [SPARK-37124][SQL] Add ArrowWritableColumnVector

2021-10-27 Thread GitBox
cloud-fan commented on a change in pull request #34396: URL: https://github.com/apache/spark/pull/34396#discussion_r737237402 ## File path: sql/core/src/main/java/org/apache/spark/sql/execution/vectorized/ArrowWritableColumnVector.java ## @@ -0,0 +1,1322 @@ +/* + * Licensed to

[GitHub] [spark] cloud-fan commented on a change in pull request #34041: [SPARK-36799][SQL] Pass queryExecution name in CLI when only select query

2021-10-27 Thread GitBox
cloud-fan commented on a change in pull request #34041: URL: https://github.com/apache/spark/pull/34041#discussion_r737238067 ## File path: sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/SparkSQLDriver.scala ## @@ -65,7 +65,11 @@ private[hive] clas

[GitHub] [spark] SparkQA commented on pull request #34389: [SPARK-37036][PYTHON] Add util function to raise advice warning for pandas API on Spark.

2021-10-27 Thread GitBox
SparkQA commented on pull request #34389: URL: https://github.com/apache/spark/pull/34389#issuecomment-952678550 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49119/ -- This is an automated message from the A

[GitHub] [spark] SparkQA commented on pull request #34380: [SPARK-37082][SQL] Implements histogram_numeric aggregation function which supports partial aggregation.

2021-10-27 Thread GitBox
SparkQA commented on pull request #34380: URL: https://github.com/apache/spark/pull/34380#issuecomment-952678769 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49120/ -- This is an automated message from the A

[GitHub] [spark] LuciferYang edited a comment on pull request #34376: [SPARK-37105][TEST] Pass all UTs in `sql/hive` with Java 17

2021-10-27 Thread GitBox
LuciferYang edited a comment on pull request #34376: URL: https://github.com/apache/spark/pull/34376#issuecomment-952510648 > This seems specific to M1? E.g. cl-plus-ssl/cl-plus-ssl#114 too. Seems like it's coming from the Azul jdk itself. Maybe something has to get updated on that end

[GitHub] [spark] cxzl25 commented on a change in pull request #34041: [SPARK-36799][SQL] Pass queryExecution name in CLI when only select query

2021-10-27 Thread GitBox
cxzl25 commented on a change in pull request #34041: URL: https://github.com/apache/spark/pull/34041#discussion_r737243549 ## File path: sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/SparkSQLDriver.scala ## @@ -65,7 +65,11 @@ private[hive] class S

[GitHub] [spark] dongjoon-hyun commented on pull request #34199: [SPARK-36935][SQL] Extend ParquetSchemaConverter to compute Parquet repetition & definition level

2021-10-27 Thread GitBox
dongjoon-hyun commented on pull request #34199: URL: https://github.com/apache/spark/pull/34199#issuecomment-952684798 Retest this please. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the spe

[GitHub] [spark] SparkQA commented on pull request #34398: [SPARK-37125][SQL] Support AnsiInterval radix sort

2021-10-27 Thread GitBox
SparkQA commented on pull request #34398: URL: https://github.com/apache/spark/pull/34398#issuecomment-952689872 **[Test build #144642 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144642/testReport)** for PR 34398 at commit [`d472055`](https://github.co

[GitHub] [spark] SparkQA commented on pull request #34291: [SPARK-37020][SQL] DS V2 LIMIT push down

2021-10-27 Thread GitBox
SparkQA commented on pull request #34291: URL: https://github.com/apache/spark/pull/34291#issuecomment-952692760 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49122/ -- This is an automated message from the A

[GitHub] [spark] SparkQA commented on pull request #34291: [SPARK-37020][SQL] DS V2 LIMIT push down

2021-10-27 Thread GitBox
SparkQA commented on pull request #34291: URL: https://github.com/apache/spark/pull/34291#issuecomment-952694220 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49121/ -- This is an automated message from the A

[GitHub] [spark] SparkQA removed a comment on pull request #34398: [SPARK-37125][SQL] Support AnsiInterval radix sort

2021-10-27 Thread GitBox
SparkQA removed a comment on pull request #34398: URL: https://github.com/apache/spark/pull/34398#issuecomment-952520197 **[Test build #144642 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144642/testReport)** for PR 34398 at commit [`d472055`](https://gi

[GitHub] [spark] xuechendi commented on a change in pull request #34396: [SPARK-37124][SQL] Add ArrowWritableColumnVector

2021-10-27 Thread GitBox
xuechendi commented on a change in pull request #34396: URL: https://github.com/apache/spark/pull/34396#discussion_r737258784 ## File path: sql/core/src/main/java/org/apache/spark/sql/execution/vectorized/ArrowWritableColumnVector.java ## @@ -0,0 +1,1322 @@ +/* + * Licensed to

[GitHub] [spark] sadikovi commented on pull request #34199: [SPARK-36935][SQL] Extend ParquetSchemaConverter to compute Parquet repetition & definition level

2021-10-27 Thread GitBox
sadikovi commented on pull request #34199: URL: https://github.com/apache/spark/pull/34199#issuecomment-952698814 My apologies for the delay, I will take a look shortly. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

[GitHub] [spark] AmplabJenkins commented on pull request #34380: [SPARK-37082][SQL] Implements histogram_numeric aggregation function which supports partial aggregation.

2021-10-27 Thread GitBox
AmplabJenkins commented on pull request #34380: URL: https://github.com/apache/spark/pull/34380#issuecomment-952702317 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49120/ -- T

[GitHub] [spark] AmplabJenkins commented on pull request #34291: [SPARK-37020][SQL] DS V2 LIMIT push down

2021-10-27 Thread GitBox
AmplabJenkins commented on pull request #34291: URL: https://github.com/apache/spark/pull/34291#issuecomment-952702320 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To un

[GitHub] [spark] AmplabJenkins commented on pull request #34398: [SPARK-37125][SQL] Support AnsiInterval radix sort

2021-10-27 Thread GitBox
AmplabJenkins commented on pull request #34398: URL: https://github.com/apache/spark/pull/34398#issuecomment-952702316 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144642/ -- This

[GitHub] [spark] xuechendi commented on a change in pull request #34396: [SPARK-37124][SQL] Add ArrowWritableColumnVector

2021-10-27 Thread GitBox
xuechendi commented on a change in pull request #34396: URL: https://github.com/apache/spark/pull/34396#discussion_r737262703 ## File path: sql/core/src/main/java/org/apache/spark/sql/execution/vectorized/ArrowWritableColumnVector.java ## @@ -0,0 +1,1322 @@ +/* + * Licensed to

[GitHub] [spark] AmplabJenkins commented on pull request #34389: [SPARK-37036][PYTHON] Add util function to raise advice warning for pandas API on Spark.

2021-10-27 Thread GitBox
AmplabJenkins commented on pull request #34389: URL: https://github.com/apache/spark/pull/34389#issuecomment-952702319 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49119/ -- T

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34380: [SPARK-37082][SQL] Implements histogram_numeric aggregation function which supports partial aggregation.

2021-10-27 Thread GitBox
AmplabJenkins removed a comment on pull request #34380: URL: https://github.com/apache/spark/pull/34380#issuecomment-952702317 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49120/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34291: [SPARK-37020][SQL] DS V2 LIMIT push down

2021-10-27 Thread GitBox
AmplabJenkins removed a comment on pull request #34291: URL: https://github.com/apache/spark/pull/34291#issuecomment-952702320 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34389: [SPARK-37036][PYTHON] Add util function to raise advice warning for pandas API on Spark.

2021-10-27 Thread GitBox
AmplabJenkins removed a comment on pull request #34389: URL: https://github.com/apache/spark/pull/34389#issuecomment-952702319 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49119/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34398: [SPARK-37125][SQL] Support AnsiInterval radix sort

2021-10-27 Thread GitBox
AmplabJenkins removed a comment on pull request #34398: URL: https://github.com/apache/spark/pull/34398#issuecomment-952702316 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144642/ -

[GitHub] [spark] SparkQA commented on pull request #34241: [SPARK-36975][SQL] Correct the hive client calls‘s metrics in HiveClientImpl

2021-10-27 Thread GitBox
SparkQA commented on pull request #34241: URL: https://github.com/apache/spark/pull/34241#issuecomment-952703667 **[Test build #144653 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144653/testReport)** for PR 34241 at commit [`80ffe42`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #34199: [SPARK-36935][SQL] Extend ParquetSchemaConverter to compute Parquet repetition & definition level

2021-10-27 Thread GitBox
SparkQA commented on pull request #34199: URL: https://github.com/apache/spark/pull/34199#issuecomment-952703809 **[Test build #144654 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144654/testReport)** for PR 34199 at commit [`96fe294`](https://github.com

[GitHub] [spark] LuciferYang commented on pull request #32044: [SPARK-34950][TESTS] Update benchmark results to the ones created by GitHub Actions machines

2021-10-27 Thread GitBox
LuciferYang commented on pull request #32044: URL: https://github.com/apache/spark/pull/32044#issuecomment-952708322 @HyukjinKwon Can we use this way to generate the benchmarks results with Java 17? On the other hand, I found some benchmarks do not have corresponding Java 11 result

[GitHub] [spark] LuciferYang commented on pull request #32044: [SPARK-34950][TESTS] Update benchmark results to the ones created by GitHub Actions machines

2021-10-27 Thread GitBox
LuciferYang commented on pull request #32044: URL: https://github.com/apache/spark/pull/32044#issuecomment-952709194 > @HyukjinKwon Can we use this way to generate the benchmarks results with Java 17? Let me study #32015 first -- This is an automated message from the Apach

[GitHub] [spark] LuciferYang edited a comment on pull request #32044: [SPARK-34950][TESTS] Update benchmark results to the ones created by GitHub Actions machines

2021-10-27 Thread GitBox
LuciferYang edited a comment on pull request #32044: URL: https://github.com/apache/spark/pull/32044#issuecomment-952709194 > @HyukjinKwon Can we use this way to generate the benchmarks results with Java 17? Let me study #32015 first. Should all new benchmarks results need generate

[GitHub] [spark] cloud-fan commented on a change in pull request #34396: [SPARK-37124][SQL] Add ArrowWritableColumnVector

2021-10-27 Thread GitBox
cloud-fan commented on a change in pull request #34396: URL: https://github.com/apache/spark/pull/34396#discussion_r737271557 ## File path: sql/core/src/main/java/org/apache/spark/sql/execution/vectorized/ArrowWritableColumnVector.java ## @@ -0,0 +1,1322 @@ +/* + * Licensed to

[GitHub] [spark] ulysses-you commented on pull request #34398: [SPARK-37125][SQL] Support AnsiInterval radix sort

2021-10-27 Thread GitBox
ulysses-you commented on pull request #34398: URL: https://github.com/apache/spark/pull/34398#issuecomment-952716226 thank you @cloud-fan -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the spe

[GitHub] [spark] SparkQA commented on pull request #34323: [SPARK-37042][PYTHON] Inline type hints for kinesis.py and listener.py in python/pyspark/streaming

2021-10-27 Thread GitBox
SparkQA commented on pull request #34323: URL: https://github.com/apache/spark/pull/34323#issuecomment-952719129 **[Test build #144655 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144655/testReport)** for PR 34323 at commit [`21206fe`](https://github.com

[GitHub] [spark] weixiuli commented on pull request #34302: [SPARK-37028][UI] Add a 'kill' executor link in the Web UI.

2021-10-27 Thread GitBox
weixiuli commented on pull request #34302: URL: https://github.com/apache/spark/pull/34302#issuecomment-952726471 Thanks @cloud-fan . Auto-killing bad executors is a good idea, but it may be difficult to do that, because there are so many factors to consider, such as GC overhead, deadlock

[GitHub] [spark] zero323 commented on a change in pull request #34323: [SPARK-37042][PYTHON] Inline type hints for kinesis.py and listener.py in python/pyspark/streaming

2021-10-27 Thread GitBox
zero323 commented on a change in pull request #34323: URL: https://github.com/apache/spark/pull/34323#discussion_r737292998 ## File path: python/pyspark/streaming/kinesis.py ## @@ -34,11 +38,65 @@ def utf8_decoder(s): class KinesisUtils(object): @staticmethod -def c

[GitHub] [spark] SparkQA commented on pull request #34323: [SPARK-37042][PYTHON] Inline type hints for kinesis.py and listener.py in python/pyspark/streaming

2021-10-27 Thread GitBox
SparkQA commented on pull request #34323: URL: https://github.com/apache/spark/pull/34323#issuecomment-952738824 **[Test build #144655 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144655/testReport)** for PR 34323 at commit [`21206fe`](https://github.co

[GitHub] [spark] SparkQA removed a comment on pull request #34323: [SPARK-37042][PYTHON] Inline type hints for kinesis.py and listener.py in python/pyspark/streaming

2021-10-27 Thread GitBox
SparkQA removed a comment on pull request #34323: URL: https://github.com/apache/spark/pull/34323#issuecomment-952719129 **[Test build #144655 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144655/testReport)** for PR 34323 at commit [`21206fe`](https://gi

[GitHub] [spark] SparkQA commented on pull request #34199: [SPARK-36935][SQL] Extend ParquetSchemaConverter to compute Parquet repetition & definition level

2021-10-27 Thread GitBox
SparkQA commented on pull request #34199: URL: https://github.com/apache/spark/pull/34199#issuecomment-952751143 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49124/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #34241: [SPARK-36975][SQL] Correct the hive client calls‘s metrics in HiveClientImpl

2021-10-27 Thread GitBox
SparkQA commented on pull request #34241: URL: https://github.com/apache/spark/pull/34241#issuecomment-952751690 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49123/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #34323: [SPARK-37042][PYTHON] Inline type hints for kinesis.py and listener.py in python/pyspark/streaming

2021-10-27 Thread GitBox
SparkQA commented on pull request #34323: URL: https://github.com/apache/spark/pull/34323#issuecomment-952757321 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49125/ -- This is an automated message from the Apache

[GitHub] [spark] AmplabJenkins commented on pull request #34323: [SPARK-37042][PYTHON] Inline type hints for kinesis.py and listener.py in python/pyspark/streaming

2021-10-27 Thread GitBox
AmplabJenkins commented on pull request #34323: URL: https://github.com/apache/spark/pull/34323#issuecomment-952757436 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144655/ -- This

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34323: [SPARK-37042][PYTHON] Inline type hints for kinesis.py and listener.py in python/pyspark/streaming

2021-10-27 Thread GitBox
AmplabJenkins removed a comment on pull request #34323: URL: https://github.com/apache/spark/pull/34323#issuecomment-952757436 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144655/ -

[GitHub] [spark] HyukjinKwon commented on pull request #32044: [SPARK-34950][TESTS] Update benchmark results to the ones created by GitHub Actions machines

2021-10-27 Thread GitBox
HyukjinKwon commented on pull request #32044: URL: https://github.com/apache/spark/pull/32044#issuecomment-952776553 Yes, they all should generate the files for JDK 11. If they don't, it's a bug. Yes, we should have another set of these benchmark result files for JDK 17 separately

[GitHub] [spark] HyukjinKwon commented on pull request #32583: [SPARK-35437][SQL] Use expressions to filter Hive partitions at client side

2021-10-27 Thread GitBox
HyukjinKwon commented on pull request #32583: URL: https://github.com/apache/spark/pull/32583#issuecomment-952777556 No problem. Thanks for bearing with my quick reverting 👍 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

[GitHub] [spark] bjornjorgensen commented on a change in pull request #34389: [SPARK-37036][PYTHON] Add util function to raise advice warning for pandas API on Spark.

2021-10-27 Thread GitBox
bjornjorgensen commented on a change in pull request #34389: URL: https://github.com/apache/spark/pull/34389#discussion_r737336464 ## File path: python/pyspark/pandas/groupby.py ## @@ -1199,6 +1200,10 @@ def pandas_apply(pdf: pd.DataFrame, *a: Any, **k: Any) -> Any:

[GitHub] [spark] bjornjorgensen commented on a change in pull request #34389: [SPARK-37036][PYTHON] Add util function to raise advice warning for pandas API on Spark.

2021-10-27 Thread GitBox
bjornjorgensen commented on a change in pull request #34389: URL: https://github.com/apache/spark/pull/34389#discussion_r737338014 ## File path: python/pyspark/pandas/indexes/base.py ## @@ -1553,6 +1558,9 @@ def sort_values(self, ascending: bool = True) -> "Index":

[GitHub] [spark] SparkQA commented on pull request #34199: [SPARK-36935][SQL] Extend ParquetSchemaConverter to compute Parquet repetition & definition level

2021-10-27 Thread GitBox
SparkQA commented on pull request #34199: URL: https://github.com/apache/spark/pull/34199#issuecomment-952787317 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49124/ -- This is an automated message from the A

[GitHub] [spark] SparkQA commented on pull request #34241: [SPARK-36975][SQL] Correct the hive client calls‘s metrics in HiveClientImpl

2021-10-27 Thread GitBox
SparkQA commented on pull request #34241: URL: https://github.com/apache/spark/pull/34241#issuecomment-952788425 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49123/ -- This is an automated message from the A

[GitHub] [spark] SparkQA commented on pull request #34399: [SPARK-37031][SQL][TESTS][FOLLOWUP] Add a missing test to DescribeNamespaceSuite

2021-10-27 Thread GitBox
SparkQA commented on pull request #34399: URL: https://github.com/apache/spark/pull/34399#issuecomment-952789802 **[Test build #144645 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144645/testReport)** for PR 34399 at commit [`4110c95`](https://github.co

[GitHub] [spark] SparkQA commented on pull request #34323: [SPARK-37042][PYTHON] Inline type hints for kinesis.py and listener.py in python/pyspark/streaming

2021-10-27 Thread GitBox
SparkQA commented on pull request #34323: URL: https://github.com/apache/spark/pull/34323#issuecomment-952792927 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49125/ -- This is an automated message from the A

[GitHub] [spark] codecov-commenter commented on pull request #34241: [SPARK-36975][SQL] Correct the hive client calls‘s metrics in HiveClientImpl

2021-10-27 Thread GitBox
codecov-commenter commented on pull request #34241: URL: https://github.com/apache/spark/pull/34241#issuecomment-952799438 # [Codecov](https://codecov.io/gh/apache/spark/pull/34241?src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=The+A

[GitHub] [spark] codecov-commenter edited a comment on pull request #34241: [SPARK-36975][SQL] Correct the hive client calls‘s metrics in HiveClientImpl

2021-10-27 Thread GitBox
codecov-commenter edited a comment on pull request #34241: URL: https://github.com/apache/spark/pull/34241#issuecomment-952799438 # [Codecov](https://codecov.io/gh/apache/spark/pull/34241?src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_ter

[GitHub] [spark] SparkQA removed a comment on pull request #34399: [SPARK-37031][SQL][TESTS][FOLLOWUP] Add a missing test to DescribeNamespaceSuite

2021-10-27 Thread GitBox
SparkQA removed a comment on pull request #34399: URL: https://github.com/apache/spark/pull/34399#issuecomment-952569275 **[Test build #144645 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144645/testReport)** for PR 34399 at commit [`4110c95`](https://gi

[GitHub] [spark] codecov-commenter edited a comment on pull request #34241: [SPARK-36975][SQL] Correct the hive client calls‘s metrics in HiveClientImpl

2021-10-27 Thread GitBox
codecov-commenter edited a comment on pull request #34241: URL: https://github.com/apache/spark/pull/34241#issuecomment-952799438 # [Codecov](https://codecov.io/gh/apache/spark/pull/34241?src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_ter

[GitHub] [spark] AmplabJenkins commented on pull request #34399: [SPARK-37031][SQL][TESTS][FOLLOWUP] Add a missing test to DescribeNamespaceSuite

2021-10-27 Thread GitBox
AmplabJenkins commented on pull request #34399: URL: https://github.com/apache/spark/pull/34399#issuecomment-952807013 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144645/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #34241: [SPARK-36975][SQL] Correct the hive client calls‘s metrics in HiveClientImpl

2021-10-27 Thread GitBox
AmplabJenkins commented on pull request #34241: URL: https://github.com/apache/spark/pull/34241#issuecomment-952807015 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49123/ -- T

[GitHub] [spark] AmplabJenkins commented on pull request #34199: [SPARK-36935][SQL] Extend ParquetSchemaConverter to compute Parquet repetition & definition level

2021-10-27 Thread GitBox
AmplabJenkins commented on pull request #34199: URL: https://github.com/apache/spark/pull/34199#issuecomment-952807014 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49124/ -- T

[GitHub] [spark] AmplabJenkins commented on pull request #34323: [SPARK-37042][PYTHON] Inline type hints for kinesis.py and listener.py in python/pyspark/streaming

2021-10-27 Thread GitBox
AmplabJenkins commented on pull request #34323: URL: https://github.com/apache/spark/pull/34323#issuecomment-952807016 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49125/ -- T

  1   2   3   4   5   6   7   >