[GitHub] [spark] AmplabJenkins commented on pull request #34338: [SPARK-37067][SQL] Use ZoneId.of() to handle timezone string in DatetimeUtils

2021-10-21 Thread GitBox
AmplabJenkins commented on pull request #34338: URL: https://github.com/apache/spark/pull/34338#issuecomment-948578408 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/48974/ --

[GitHub] [spark] SparkQA removed a comment on pull request #34241: [SPARK-36975][SQL] Correct the hive client calls‘s metrics in HiveClientImpl

2021-10-21 Thread GitBox
SparkQA removed a comment on pull request #34241: URL: https://github.com/apache/spark/pull/34241#issuecomment-948485880 **[Test build #144505 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144505/testReport)** for PR 34241 at commit

[GitHub] [spark] SparkQA commented on pull request #34338: [SPARK-37067][SQL] Use ZoneId.of() to handle timezone string in DatetimeUtils

2021-10-21 Thread GitBox
SparkQA commented on pull request #34338: URL: https://github.com/apache/spark/pull/34338#issuecomment-948564087 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/48974/ -- This is an automated message from the

[GitHub] [spark] SparkQA commented on pull request #34296: [SPARK-36989][TESTS][PYTHON] Add type hints data tests

2021-10-21 Thread GitBox
SparkQA commented on pull request #34296: URL: https://github.com/apache/spark/pull/34296#issuecomment-948562372 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/48976/ -- This is an automated message from the

[GitHub] [spark] SparkQA commented on pull request #34241: [SPARK-36975][SQL] Correct the hive client calls‘s metrics in HiveClientImpl

2021-10-21 Thread GitBox
SparkQA commented on pull request #34241: URL: https://github.com/apache/spark/pull/34241#issuecomment-948561622 **[Test build #144505 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144505/testReport)** for PR 34241 at commit

[GitHub] [spark] SparkQA commented on pull request #34241: [SPARK-36975][SQL] Correct the hive client calls‘s metrics in HiveClientImpl

2021-10-21 Thread GitBox
SparkQA commented on pull request #34241: URL: https://github.com/apache/spark/pull/34241#issuecomment-948560996 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/48977/ -- This is an automated message from the

[GitHub] [spark] SparkQA commented on pull request #34337: [SPARK-37066][SQL] Improve error message to show file path when failed to read next file

2021-10-21 Thread GitBox
SparkQA commented on pull request #34337: URL: https://github.com/apache/spark/pull/34337#issuecomment-948557823 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/48975/ -- This is an automated message from the

[GitHub] [spark] wankunde commented on a change in pull request #34234: [SPARK-36967][CORE] Report accurate shuffle block size if its skewed

2021-10-21 Thread GitBox
wankunde commented on a change in pull request #34234: URL: https://github.com/apache/spark/pull/34234#discussion_r733611640 ## File path: core/src/main/scala/org/apache/spark/scheduler/MapStatus.scala ## @@ -255,9 +255,24 @@ private[spark] object HighlyCompressedMapStatus {

[GitHub] [spark] LuciferYang opened a new pull request #34355: [SPARK-37070][TEST] Pass all UTs in `mllib-local` and `mllib` with Java 17

2021-10-21 Thread GitBox
LuciferYang opened a new pull request #34355: URL: https://github.com/apache/spark/pull/34355 ### What changes were proposed in this pull request? `Mockito` can't mock `j.u.Random` with Java 17 due to `module java.base does not export jdk.internal.util.random to unnamed module` and

[GitHub] [spark] wankunde commented on pull request #34234: [SPARK-36967][CORE] Report accurate shuffle block size if its skewed

2021-10-21 Thread GitBox
wankunde commented on pull request #34234: URL: https://github.com/apache/spark/pull/34234#issuecomment-948549219 Hi, @Ngone51 @JoeyValentine @mridulm I add a parameter to limit the number of reported shuffle blocks if there are too many huge skewed blocks. I think this is also

[GitHub] [spark] AmplabJenkins commented on pull request #34291: [SPARK-37020][SQL] DS V2 LIMIT push down

2021-10-21 Thread GitBox
AmplabJenkins commented on pull request #34291: URL: https://github.com/apache/spark/pull/34291#issuecomment-948539187 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144493/ -- This

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34291: [SPARK-37020][SQL] DS V2 LIMIT push down

2021-10-21 Thread GitBox
AmplabJenkins removed a comment on pull request #34291: URL: https://github.com/apache/spark/pull/34291#issuecomment-948539187 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144493/

[GitHub] [spark] SparkQA removed a comment on pull request #34291: [SPARK-37020][SQL] DS V2 LIMIT push down

2021-10-21 Thread GitBox
SparkQA removed a comment on pull request #34291: URL: https://github.com/apache/spark/pull/34291#issuecomment-948306406 **[Test build #144493 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144493/testReport)** for PR 34291 at commit

[GitHub] [spark] SparkQA commented on pull request #34291: [SPARK-37020][SQL] DS V2 LIMIT push down

2021-10-21 Thread GitBox
SparkQA commented on pull request #34291: URL: https://github.com/apache/spark/pull/34291#issuecomment-948537362 **[Test build #144493 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144493/testReport)** for PR 34291 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34313: [SPARK-37013][SQL] Forbid `%0$` usage explicitly to ensure `format_string` has same behavior when using Java 8 and Java 17

2021-10-21 Thread GitBox
AmplabJenkins removed a comment on pull request #34313: URL: https://github.com/apache/spark/pull/34313#issuecomment-948529538 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144491/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34324: [SPARK-37015][PYTHON] Inline type hints for python/pyspark/streaming/dstream.py

2021-10-21 Thread GitBox
AmplabJenkins removed a comment on pull request #34324: URL: https://github.com/apache/spark/pull/34324#issuecomment-948529535 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/48973/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34346: [SPARK-36645][SQL][FOLLOWUP] Disable min/max push down for Parquet Binary

2021-10-21 Thread GitBox
AmplabJenkins removed a comment on pull request #34346: URL: https://github.com/apache/spark/pull/34346#issuecomment-948529537 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144495/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34354: [WIP][SPARK-37085][PYTHON][SQL] Add list/tuple overloads to array, struct, create_map, map_concat

2021-10-21 Thread GitBox
AmplabJenkins removed a comment on pull request #34354: URL: https://github.com/apache/spark/pull/34354#issuecomment-948529534 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/48972/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34308: [SPARK-37035][SQL] Improve error message when use parquet vectorize reader

2021-10-21 Thread GitBox
AmplabJenkins removed a comment on pull request #34308: URL: https://github.com/apache/spark/pull/34308#issuecomment-948529536 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144492/

[GitHub] [spark] AmplabJenkins commented on pull request #34354: [WIP][SPARK-37085][PYTHON][SQL] Add list/tuple overloads to array, struct, create_map, map_concat

2021-10-21 Thread GitBox
AmplabJenkins commented on pull request #34354: URL: https://github.com/apache/spark/pull/34354#issuecomment-948529534 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/48972/ --

[GitHub] [spark] AmplabJenkins commented on pull request #34346: [SPARK-36645][SQL][FOLLOWUP] Disable min/max push down for Parquet Binary

2021-10-21 Thread GitBox
AmplabJenkins commented on pull request #34346: URL: https://github.com/apache/spark/pull/34346#issuecomment-948529537 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144495/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #34308: [SPARK-37035][SQL] Improve error message when use parquet vectorize reader

2021-10-21 Thread GitBox
AmplabJenkins commented on pull request #34308: URL: https://github.com/apache/spark/pull/34308#issuecomment-948529536 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144492/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #34313: [SPARK-37013][SQL] Forbid `%0$` usage explicitly to ensure `format_string` has same behavior when using Java 8 and Java 17

2021-10-21 Thread GitBox
AmplabJenkins commented on pull request #34313: URL: https://github.com/apache/spark/pull/34313#issuecomment-948529538 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144491/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #34324: [SPARK-37015][PYTHON] Inline type hints for python/pyspark/streaming/dstream.py

2021-10-21 Thread GitBox
AmplabJenkins commented on pull request #34324: URL: https://github.com/apache/spark/pull/34324#issuecomment-948529535 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/48973/ --

[GitHub] [spark] SparkQA commented on pull request #34296: [SPARK-36989][TESTS][PYTHON] Add type hints data tests

2021-10-21 Thread GitBox
SparkQA commented on pull request #34296: URL: https://github.com/apache/spark/pull/34296#issuecomment-948526591 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/48976/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA removed a comment on pull request #34346: [SPARK-36645][SQL][FOLLOWUP] Disable min/max push down for Parquet Binary

2021-10-21 Thread GitBox
SparkQA removed a comment on pull request #34346: URL: https://github.com/apache/spark/pull/34346#issuecomment-948317076 **[Test build #144495 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144495/testReport)** for PR 34346 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #34308: [SPARK-37035][SQL] Improve error message when use parquet vectorize reader

2021-10-21 Thread GitBox
SparkQA removed a comment on pull request #34308: URL: https://github.com/apache/spark/pull/34308#issuecomment-948306363 **[Test build #144492 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144492/testReport)** for PR 34308 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #34313: [SPARK-37013][SQL] Forbid `%0$` usage explicitly to ensure `format_string` has same behavior when using Java 8 and Java 17

2021-10-21 Thread GitBox
SparkQA removed a comment on pull request #34313: URL: https://github.com/apache/spark/pull/34313#issuecomment-948280989 **[Test build #144491 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144491/testReport)** for PR 34313 at commit

[GitHub] [spark] SparkQA commented on pull request #34241: [SPARK-36975][SQL] Correct the hive client calls‘s metrics in HiveClientImpl

2021-10-21 Thread GitBox
SparkQA commented on pull request #34241: URL: https://github.com/apache/spark/pull/34241#issuecomment-948525412 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/48977/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #34308: [SPARK-37035][SQL] Improve error message when use parquet vectorize reader

2021-10-21 Thread GitBox
SparkQA commented on pull request #34308: URL: https://github.com/apache/spark/pull/34308#issuecomment-948523701 **[Test build #144492 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144492/testReport)** for PR 34308 at commit

[GitHub] [spark] SparkQA commented on pull request #34346: [SPARK-36645][SQL][FOLLOWUP] Disable min/max push down for Parquet Binary

2021-10-21 Thread GitBox
SparkQA commented on pull request #34346: URL: https://github.com/apache/spark/pull/34346#issuecomment-948523473 **[Test build #144495 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144495/testReport)** for PR 34346 at commit

[GitHub] [spark] SparkQA commented on pull request #34337: [SPARK-37066][SQL] Improve error message to show file path when failed to read next file

2021-10-21 Thread GitBox
SparkQA commented on pull request #34337: URL: https://github.com/apache/spark/pull/34337#issuecomment-948522794 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/48975/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #34354: [WIP][SPARK-37085][PYTHON][SQL] Add list/tuple overloads to array, struct, create_map, map_concat

2021-10-21 Thread GitBox
SparkQA commented on pull request #34354: URL: https://github.com/apache/spark/pull/34354#issuecomment-948520742 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/48972/ -- This is an automated message from the

[GitHub] [spark] SparkQA commented on pull request #34338: [SPARK-37067][SQL] Use ZoneId.of() to handle timezone string in DatetimeUtils

2021-10-21 Thread GitBox
SparkQA commented on pull request #34338: URL: https://github.com/apache/spark/pull/34338#issuecomment-948518592 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/48974/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #34324: [SPARK-37015][PYTHON] Inline type hints for python/pyspark/streaming/dstream.py

2021-10-21 Thread GitBox
SparkQA commented on pull request #34324: URL: https://github.com/apache/spark/pull/34324#issuecomment-948516075 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/48973/ -- This is an automated message from the

[GitHub] [spark] SparkQA commented on pull request #34313: [SPARK-37013][SQL] Forbid `%0$` usage explicitly to ensure `format_string` has same behavior when using Java 8 and Java 17

2021-10-21 Thread GitBox
SparkQA commented on pull request #34313: URL: https://github.com/apache/spark/pull/34313#issuecomment-948500775 **[Test build #144491 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144491/testReport)** for PR 34313 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34352: [SPARK-37018][SQL] Spark SQL should support create function with Aggregator

2021-10-21 Thread GitBox
AmplabJenkins removed a comment on pull request #34352: URL: https://github.com/apache/spark/pull/34352#issuecomment-948486976 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144497/

[GitHub] [spark] SparkQA commented on pull request #34352: [SPARK-37018][SQL] Spark SQL should support create function with Aggregator

2021-10-21 Thread GitBox
SparkQA commented on pull request #34352: URL: https://github.com/apache/spark/pull/34352#issuecomment-948486712 **[Test build #144497 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144497/testReport)** for PR 34352 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #34352: [SPARK-37018][SQL] Spark SQL should support create function with Aggregator

2021-10-21 Thread GitBox
SparkQA removed a comment on pull request #34352: URL: https://github.com/apache/spark/pull/34352#issuecomment-948392347 **[Test build #144497 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144497/testReport)** for PR 34352 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #34352: [SPARK-37018][SQL] Spark SQL should support create function with Aggregator

2021-10-21 Thread GitBox
AmplabJenkins commented on pull request #34352: URL: https://github.com/apache/spark/pull/34352#issuecomment-948486976 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144497/ -- This

[GitHub] [spark] SparkQA commented on pull request #34296: [SPARK-36989][TESTS][PYTHON] Add type hints data tests

2021-10-21 Thread GitBox
SparkQA commented on pull request #34296: URL: https://github.com/apache/spark/pull/34296#issuecomment-948485764 **[Test build #144504 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144504/testReport)** for PR 34296 at commit

[GitHub] [spark] SparkQA commented on pull request #34241: [SPARK-36975][SQL] Correct the hive client calls‘s metrics in HiveClientImpl

2021-10-21 Thread GitBox
SparkQA commented on pull request #34241: URL: https://github.com/apache/spark/pull/34241#issuecomment-948485880 **[Test build #144505 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144505/testReport)** for PR 34241 at commit

[GitHub] [spark] SparkQA commented on pull request #34337: [SPARK-37066][SQL] Improve error message to show file path when failed to read next file

2021-10-21 Thread GitBox
SparkQA commented on pull request #34337: URL: https://github.com/apache/spark/pull/34337#issuecomment-948485642 **[Test build #144503 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144503/testReport)** for PR 34337 at commit

[GitHub] [spark] SparkQA commented on pull request #34338: [SPARK-37067][SQL] Use ZoneId.of() to handle timezone string in DatetimeUtils

2021-10-21 Thread GitBox
SparkQA commented on pull request #34338: URL: https://github.com/apache/spark/pull/34338#issuecomment-948485617 **[Test build #144502 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144502/testReport)** for PR 34338 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34354: [WIP][SPARK-37085][PYTHON][SQL] Add list/tuple overloads to array, struct, create_map, map_concat

2021-10-21 Thread GitBox
AmplabJenkins removed a comment on pull request #34354: URL: https://github.com/apache/spark/pull/34354#issuecomment-948485092 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144500/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34338: [SPARK-37067][SQL] Use ZoneId.of() to handle timezone string in DatetimeUtils

2021-10-21 Thread GitBox
AmplabJenkins removed a comment on pull request #34338: URL: https://github.com/apache/spark/pull/34338#issuecomment-948485095 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144490/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34352: [SPARK-37018][SQL] Spark SQL should support create function with Aggregator

2021-10-21 Thread GitBox
AmplabJenkins removed a comment on pull request #34352: URL: https://github.com/apache/spark/pull/34352#issuecomment-948485088 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/48969/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34350: [SPARK-37081][SQL][TESTS] Upgrade the version of RDBMS and corresponding JDBC drivers used by docker-integration-tests

2021-10-21 Thread GitBox
AmplabJenkins removed a comment on pull request #34350: URL: https://github.com/apache/spark/pull/34350#issuecomment-948485089 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144496/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34241: [SPARK-36975][SQL] Correct the hive client calls‘s metrics in HiveClientImpl

2021-10-21 Thread GitBox
AmplabJenkins removed a comment on pull request #34241: URL: https://github.com/apache/spark/pull/34241#issuecomment-948485085 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34324: [SPARK-37015][PYTHON] Inline type hints for python/pyspark/streaming/dstream.py

2021-10-21 Thread GitBox
AmplabJenkins removed a comment on pull request #34324: URL: https://github.com/apache/spark/pull/34324#issuecomment-948485083 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [spark] AmplabJenkins commented on pull request #34352: [SPARK-37018][SQL] Spark SQL should support create function with Aggregator

2021-10-21 Thread GitBox
AmplabJenkins commented on pull request #34352: URL: https://github.com/apache/spark/pull/34352#issuecomment-948485088 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/48969/ --

[GitHub] [spark] AmplabJenkins commented on pull request #34350: [SPARK-37081][SQL][TESTS] Upgrade the version of RDBMS and corresponding JDBC drivers used by docker-integration-tests

2021-10-21 Thread GitBox
AmplabJenkins commented on pull request #34350: URL: https://github.com/apache/spark/pull/34350#issuecomment-948485089 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144496/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #34354: [WIP][SPARK-37085][PYTHON][SQL] Add list/tuple overloads to array, struct, create_map, map_concat

2021-10-21 Thread GitBox
AmplabJenkins commented on pull request #34354: URL: https://github.com/apache/spark/pull/34354#issuecomment-948485092 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144500/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #34338: [SPARK-37067][SQL] Use ZoneId.of() to handle timezone string in DatetimeUtils

2021-10-21 Thread GitBox
AmplabJenkins commented on pull request #34338: URL: https://github.com/apache/spark/pull/34338#issuecomment-948485095 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144490/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #34324: [SPARK-37015][PYTHON] Inline type hints for python/pyspark/streaming/dstream.py

2021-10-21 Thread GitBox
AmplabJenkins commented on pull request #34324: URL: https://github.com/apache/spark/pull/34324#issuecomment-948485083 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[GitHub] [spark] AmplabJenkins commented on pull request #34241: [SPARK-36975][SQL] Correct the hive client calls‘s metrics in HiveClientImpl

2021-10-21 Thread GitBox
AmplabJenkins commented on pull request #34241: URL: https://github.com/apache/spark/pull/34241#issuecomment-948485085 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[GitHub] [spark] SparkQA removed a comment on pull request #34354: [WIP][SPARK-37085][PYTHON][SQL] Add list/tuple overloads to array, struct, create_map, map_concat

2021-10-21 Thread GitBox
SparkQA removed a comment on pull request #34354: URL: https://github.com/apache/spark/pull/34354#issuecomment-948441267 **[Test build #144500 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144500/testReport)** for PR 34354 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #34324: [SPARK-37015][PYTHON] Inline type hints for python/pyspark/streaming/dstream.py

2021-10-21 Thread GitBox
SparkQA removed a comment on pull request #34324: URL: https://github.com/apache/spark/pull/34324#issuecomment-948441375 **[Test build #144501 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144501/testReport)** for PR 34324 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #34241: [SPARK-36975][SQL] Correct the hive client calls‘s metrics in HiveClientImpl

2021-10-21 Thread GitBox
SparkQA removed a comment on pull request #34241: URL: https://github.com/apache/spark/pull/34241#issuecomment-948392519 **[Test build #144498 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144498/testReport)** for PR 34241 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #34350: [SPARK-37081][SQL][TESTS] Upgrade the version of RDBMS and corresponding JDBC drivers used by docker-integration-tests

2021-10-21 Thread GitBox
SparkQA removed a comment on pull request #34350: URL: https://github.com/apache/spark/pull/34350#issuecomment-948346877 **[Test build #144496 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144496/testReport)** for PR 34350 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #34338: [SPARK-37067][SQL] Use ZoneId.of() to handle timezone string in DatetimeUtils

2021-10-21 Thread GitBox
SparkQA removed a comment on pull request #34338: URL: https://github.com/apache/spark/pull/34338#issuecomment-948276447 **[Test build #144490 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144490/testReport)** for PR 34338 at commit

[GitHub] [spark] SparkQA commented on pull request #34354: [WIP][SPARK-37085][PYTHON][SQL] Add list/tuple overloads to array, struct, create_map, map_concat

2021-10-21 Thread GitBox
SparkQA commented on pull request #34354: URL: https://github.com/apache/spark/pull/34354#issuecomment-948478104 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/48972/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #34241: [SPARK-36975][SQL] Correct the hive client calls‘s metrics in HiveClientImpl

2021-10-21 Thread GitBox
SparkQA commented on pull request #34241: URL: https://github.com/apache/spark/pull/34241#issuecomment-948476254 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/48971/ -- This is an automated message from the

[GitHub] [spark] SparkQA commented on pull request #34324: [SPARK-37015][PYTHON] Inline type hints for python/pyspark/streaming/dstream.py

2021-10-21 Thread GitBox
SparkQA commented on pull request #34324: URL: https://github.com/apache/spark/pull/34324#issuecomment-948476719 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/48973/ -- This is an automated message from the Apache

[GitHub] [spark] AngersZhuuuu commented on a change in pull request #34337: [SPARK-37066][SQL] Improve error message to show file path when failed to read next file

2021-10-21 Thread GitBox
AngersZh commented on a change in pull request #34337: URL: https://github.com/apache/spark/pull/34337#discussion_r733541218 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FileScanRDD.scala ## @@ -177,6 +177,10 @@ class FileScanRDD(

[GitHub] [spark] SparkQA commented on pull request #34352: [SPARK-37018][SQL] Spark SQL should support create function with Aggregator

2021-10-21 Thread GitBox
SparkQA commented on pull request #34352: URL: https://github.com/apache/spark/pull/34352#issuecomment-948475766 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/48969/ -- This is an automated message from the

[GitHub] [spark] SparkQA commented on pull request #34350: [SPARK-37081][SQL][TESTS] Upgrade the version of RDBMS and corresponding JDBC drivers used by docker-integration-tests

2021-10-21 Thread GitBox
SparkQA commented on pull request #34350: URL: https://github.com/apache/spark/pull/34350#issuecomment-948474456 **[Test build #144496 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144496/testReport)** for PR 34350 at commit

[GitHub] [spark] SparkQA commented on pull request #34324: [SPARK-37015][PYTHON] Inline type hints for python/pyspark/streaming/dstream.py

2021-10-21 Thread GitBox
SparkQA commented on pull request #34324: URL: https://github.com/apache/spark/pull/34324#issuecomment-948467372 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/48970/ -- This is an automated message from the

[GitHub] [spark] SparkQA commented on pull request #34354: [WIP][SPARK-37085][PYTHON][SQL] Add list/tuple overloads to array, struct, create_map, map_concat

2021-10-21 Thread GitBox
SparkQA commented on pull request #34354: URL: https://github.com/apache/spark/pull/34354#issuecomment-948463850 **[Test build #144500 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144500/testReport)** for PR 34354 at commit

[GitHub] [spark] SparkQA commented on pull request #34338: [SPARK-37067][SQL] Use ZoneId.of() to handle timezone string in DatetimeUtils

2021-10-21 Thread GitBox
SparkQA commented on pull request #34338: URL: https://github.com/apache/spark/pull/34338#issuecomment-948463781 **[Test build #144490 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144490/testReport)** for PR 34338 at commit

[GitHub] [spark] SparkQA commented on pull request #34324: [SPARK-37015][PYTHON] Inline type hints for python/pyspark/streaming/dstream.py

2021-10-21 Thread GitBox
SparkQA commented on pull request #34324: URL: https://github.com/apache/spark/pull/34324#issuecomment-948463263 **[Test build #144501 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144501/testReport)** for PR 34324 at commit

[GitHub] [spark] SparkQA commented on pull request #34241: [SPARK-36975][SQL] Correct the hive client calls‘s metrics in HiveClientImpl

2021-10-21 Thread GitBox
SparkQA commented on pull request #34241: URL: https://github.com/apache/spark/pull/34241#issuecomment-948459864 **[Test build #144498 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144498/testReport)** for PR 34241 at commit

[GitHub] [spark] linhongliu-db commented on pull request #34338: [SPARK-37067][SQL] Use ZoneId.of() to handle timezone string in DatetimeUtils

2021-10-21 Thread GitBox
linhongliu-db commented on pull request #34338: URL: https://github.com/apache/spark/pull/34338#issuecomment-948457573 cc @cloud-fan -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [spark] PengleiShi commented on a change in pull request #33914: [SPARK-32268][SQL] Dynamic bloom filter join pruning

2021-10-21 Thread GitBox
PengleiShi commented on a change in pull request #33914: URL: https://github.com/apache/spark/pull/33914#discussion_r733515433 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/dynamicpruning/DynamicBloomFilterPruning.scala ## @@ -0,0 +1,191 @@ +/* + *

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34324: [SPARK-37015][PYTHON] Inline type hints for python/pyspark/streaming/dstream.py

2021-10-21 Thread GitBox
AmplabJenkins removed a comment on pull request #34324: URL: https://github.com/apache/spark/pull/34324#issuecomment-948407202 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144499/

[GitHub] [spark] SparkQA commented on pull request #34354: [WIP][SPARK-37085][PYTHON][SQL] Add list/tuple overloads to array, struct, create_map, map_concat

2021-10-21 Thread GitBox
SparkQA commented on pull request #34354: URL: https://github.com/apache/spark/pull/34354#issuecomment-948441267 **[Test build #144500 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144500/testReport)** for PR 34354 at commit

[GitHub] [spark] SparkQA commented on pull request #34324: [SPARK-37015][PYTHON] Inline type hints for python/pyspark/streaming/dstream.py

2021-10-21 Thread GitBox
SparkQA commented on pull request #34324: URL: https://github.com/apache/spark/pull/34324#issuecomment-948441375 **[Test build #144501 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144501/testReport)** for PR 34324 at commit

[GitHub] [spark] SparkQA commented on pull request #34241: [SPARK-36975][SQL] Correct the hive client calls‘s metrics in HiveClientImpl

2021-10-21 Thread GitBox
SparkQA commented on pull request #34241: URL: https://github.com/apache/spark/pull/34241#issuecomment-948440226 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/48971/ -- This is an automated message from the Apache

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34346: [SPARK-36645][SQL][FOLLOWUP] Disable min/max push down for Parquet Binary

2021-10-21 Thread GitBox
AmplabJenkins removed a comment on pull request #34346: URL: https://github.com/apache/spark/pull/34346#issuecomment-948438777 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/48968/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33828: [SPARK-36579][CORE][SQL] Make spark source stagingDir can be customized

2021-10-21 Thread GitBox
AmplabJenkins removed a comment on pull request #33828: URL: https://github.com/apache/spark/pull/33828#issuecomment-948438779 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144494/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34350: [SPARK-37081][SQL][TESTS] Upgrade the version of RDBMS and corresponding JDBC drivers used by docker-integration-tests

2021-10-21 Thread GitBox
AmplabJenkins removed a comment on pull request #34350: URL: https://github.com/apache/spark/pull/34350#issuecomment-948438780 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/48967/

[GitHub] [spark] AmplabJenkins commented on pull request #34346: [SPARK-36645][SQL][FOLLOWUP] Disable min/max push down for Parquet Binary

2021-10-21 Thread GitBox
AmplabJenkins commented on pull request #34346: URL: https://github.com/apache/spark/pull/34346#issuecomment-948438777 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/48968/ --

[GitHub] [spark] AmplabJenkins commented on pull request #33828: [SPARK-36579][CORE][SQL] Make spark source stagingDir can be customized

2021-10-21 Thread GitBox
AmplabJenkins commented on pull request #33828: URL: https://github.com/apache/spark/pull/33828#issuecomment-948438779 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144494/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #34350: [SPARK-37081][SQL][TESTS] Upgrade the version of RDBMS and corresponding JDBC drivers used by docker-integration-tests

2021-10-21 Thread GitBox
AmplabJenkins commented on pull request #34350: URL: https://github.com/apache/spark/pull/34350#issuecomment-948438780 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/48967/ --

[GitHub] [spark] SparkQA removed a comment on pull request #33828: [SPARK-36579][CORE][SQL] Make spark source stagingDir can be customized

2021-10-21 Thread GitBox
SparkQA removed a comment on pull request #33828: URL: https://github.com/apache/spark/pull/33828#issuecomment-948306867 **[Test build #144494 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144494/testReport)** for PR 33828 at commit

[GitHub] [spark] SparkQA commented on pull request #33828: [SPARK-36579][CORE][SQL] Make spark source stagingDir can be customized

2021-10-21 Thread GitBox
SparkQA commented on pull request #33828: URL: https://github.com/apache/spark/pull/33828#issuecomment-948435508 **[Test build #144494 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144494/testReport)** for PR 33828 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #34324: [SPARK-37015][PYTHON] Inline type hints for python/pyspark/streaming/dstream.py

2021-10-21 Thread GitBox
SparkQA removed a comment on pull request #34324: URL: https://github.com/apache/spark/pull/34324#issuecomment-948395660 **[Test build #144499 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144499/testReport)** for PR 34324 at commit

[GitHub] [spark] SparkQA commented on pull request #34324: [SPARK-37015][PYTHON] Inline type hints for python/pyspark/streaming/dstream.py

2021-10-21 Thread GitBox
SparkQA commented on pull request #34324: URL: https://github.com/apache/spark/pull/34324#issuecomment-948432822 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/48970/ -- This is an automated message from the Apache

[GitHub] [spark] zero323 commented on pull request #34354: [WIP][SPARK-37085][PYTHON][SQL] Add list/tuple overloads to array, struct, create_map, map_concat

2021-10-21 Thread GitBox
zero323 commented on pull request #34354: URL: https://github.com/apache/spark/pull/34354#issuecomment-948430499 New annotations are already implemented, but I think we might have to redefine `ColumnOrName` to fully support these, so I'll keep this as a draft for now. FYI

[GitHub] [spark] SparkQA commented on pull request #34352: [SPARK-37018][SQL] Spark SQL should support create function with Aggregator

2021-10-21 Thread GitBox
SparkQA commented on pull request #34352: URL: https://github.com/apache/spark/pull/34352#issuecomment-948430392 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/48969/ -- This is an automated message from the Apache

[GitHub] [spark] zero323 opened a new pull request #34354: [WIP][SPARK-37085][PYTHON][SQL] Add list/tuple overloads to array, struct, create_map, map_concat

2021-10-21 Thread GitBox
zero323 opened a new pull request #34354: URL: https://github.com/apache/spark/pull/34354 ### What changes were proposed in this pull request? This PR adds overloads to the following `pyspark.sql.functions`: - `array` - `struct` - `create_map` -

[GitHub] [spark] SparkQA commented on pull request #34346: [SPARK-36645][SQL][FOLLOWUP] Disable min/max push down for Parquet Binary

2021-10-21 Thread GitBox
SparkQA commented on pull request #34346: URL: https://github.com/apache/spark/pull/34346#issuecomment-948427391 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/48968/ -- This is an automated message from the

[GitHub] [spark] SparkQA commented on pull request #34350: [SPARK-37081][SQL][TESTS] Upgrade the version of RDBMS and corresponding JDBC drivers used by docker-integration-tests

2021-10-21 Thread GitBox
SparkQA commented on pull request #34350: URL: https://github.com/apache/spark/pull/34350#issuecomment-948414507 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/48967/ -- This is an automated message from the

[GitHub] [spark] cloud-fan commented on a change in pull request #34352: [SPARK-37018][SQL] Spark SQL should support create function with Aggregator

2021-10-21 Thread GitBox
cloud-fan commented on a change in pull request #34352: URL: https://github.com/apache/spark/pull/34352#discussion_r733476604 ## File path: sql/core/src/test/scala/org/apache/spark/sql/UDFSuite.scala ## @@ -50,6 +51,20 @@ private case class FunctionResult(f1: String, f2:

[GitHub] [spark] cloud-fan commented on a change in pull request #34352: [SPARK-37018][SQL] Spark SQL should support create function with Aggregator

2021-10-21 Thread GitBox
cloud-fan commented on a change in pull request #34352: URL: https://github.com/apache/spark/pull/34352#discussion_r733473500 ## File path: sql/core/src/main/scala/org/apache/spark/sql/internal/BaseSessionStateBuilder.scala ## @@ -410,6 +415,35 @@ class

[GitHub] [spark] cloud-fan commented on a change in pull request #34352: [SPARK-37018][SQL] Spark SQL should support create function with Aggregator

2021-10-21 Thread GitBox
cloud-fan commented on a change in pull request #34352: URL: https://github.com/apache/spark/pull/34352#discussion_r733473172 ## File path: sql/core/src/main/scala/org/apache/spark/sql/internal/BaseSessionStateBuilder.scala ## @@ -410,6 +415,35 @@ class

[GitHub] [spark] AmplabJenkins commented on pull request #34324: [SPARK-37015][PYTHON] Inline type hints for python/pyspark/streaming/dstream.py

2021-10-21 Thread GitBox
AmplabJenkins commented on pull request #34324: URL: https://github.com/apache/spark/pull/34324#issuecomment-948407202 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144499/ -- This

[GitHub] [spark] SparkQA commented on pull request #34324: [SPARK-37015][PYTHON] Inline type hints for python/pyspark/streaming/dstream.py

2021-10-21 Thread GitBox
SparkQA commented on pull request #34324: URL: https://github.com/apache/spark/pull/34324#issuecomment-948406882 **[Test build #144499 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144499/testReport)** for PR 34324 at commit

[GitHub] [spark] SparkQA commented on pull request #34324: [SPARK-37015][PYTHON] Inline type hints for python/pyspark/streaming/dstream.py

2021-10-21 Thread GitBox
SparkQA commented on pull request #34324: URL: https://github.com/apache/spark/pull/34324#issuecomment-948395660 **[Test build #144499 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144499/testReport)** for PR 34324 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34333: [SPARK-37062][SS] Introduce a new data source for providing consistent set of rows per microbatch

2021-10-21 Thread GitBox
AmplabJenkins removed a comment on pull request #34333: URL: https://github.com/apache/spark/pull/34333#issuecomment-948394208 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144487/

<    1   2   3   4   5   >