[GitHub] [spark] AmplabJenkins commented on pull request #32959: [SPARK-35780][SQL] Support DATE/TIMESTAMP literals across the full range

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #32959: URL: https://github.com/apache/spark/pull/32959#issuecomment-878884339 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45472/ --

[GitHub] [spark] AmplabJenkins commented on pull request #33297: [SPARK-36069] from_json's exception should contain field name, type and value

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33297: URL: https://github.com/apache/spark/pull/33297#issuecomment-878884341 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140952/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #33078: [SPARK-35546][Shuffle] Enable push-based shuffle when multiple app attempts are enabled and manage concurrent access to the state in a

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33078: URL: https://github.com/apache/spark/pull/33078#issuecomment-878884340 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140956/ -- This

[GitHub] [spark] SparkQA commented on pull request #33318: [SPARK-36119][SQL] Add new SQL function to_timestamp_ltz

2021-07-13 Thread GitBox
SparkQA commented on pull request #33318: URL: https://github.com/apache/spark/pull/33318#issuecomment-878884334 Kubernetes integration test unable to build dist. exiting with code: 1 URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45480/ --

[GitHub] [spark] dongjoon-hyun commented on pull request #33311: [SPARK-36109][SS][TEST] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
dongjoon-hyun commented on pull request #33311: URL: https://github.com/apache/spark/pull/33311#issuecomment-878884115 Thank you, @viirya and @HeartSaVioR . GitHub Action passed. Merged to master/3.2. Could you make a backport PR for 3.1/3.0 because this is an ancient bug, @viirya ?

[GitHub] [spark] Ngone51 commented on a change in pull request #32401: [SPARK-35276][CORE] Calculate checksum for shuffle data and write as checksum file

2021-07-13 Thread GitBox
Ngone51 commented on a change in pull request #32401: URL: https://github.com/apache/spark/pull/32401#discussion_r668538313 ## File path: core/src/main/scala/org/apache/spark/shuffle/IndexShuffleBlockResolver.scala ## @@ -329,44 +352,111 @@ private[spark] class

[GitHub] [spark] dongjoon-hyun closed pull request #33311: [SPARK-36109][SS][TEST] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
dongjoon-hyun closed pull request #33311: URL: https://github.com/apache/spark/pull/33311 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [spark] SparkQA commented on pull request #33311: [SPARK-36109][SS][TEST] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
SparkQA commented on pull request #33311: URL: https://github.com/apache/spark/pull/33311#issuecomment-878880834 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45478/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #33070: [SPARK-35551][SQL] Handle the COUNT bug for lateral subqueries

2021-07-13 Thread GitBox
SparkQA commented on pull request #33070: URL: https://github.com/apache/spark/pull/33070#issuecomment-878878989 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45477/ -- This is an automated message from the Apache

[GitHub] [spark] AngersZhuuuu commented on a change in pull request #33296: [SPARK-34402][SQL] Group exception about data format schema

2021-07-13 Thread GitBox
AngersZh commented on a change in pull request #33296: URL: https://github.com/apache/spark/pull/33296#discussion_r668532000 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetSchemaConverter.scala ## @@ -586,11 +586,10 @@

[GitHub] [spark] beliefer commented on a change in pull request #33296: [SPARK-34402][SQL] Group exception about data format schema

2021-07-13 Thread GitBox
beliefer commented on a change in pull request #33296: URL: https://github.com/apache/spark/pull/33296#discussion_r668529473 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetSchemaConverter.scala ## @@ -586,11 +586,10 @@

[GitHub] [spark] SparkQA commented on pull request #33258: [SPARK-36037][SQL] Support ANSI SQL LOCALTIMESTAMP datetime value function

2021-07-13 Thread GitBox
SparkQA commented on pull request #33258: URL: https://github.com/apache/spark/pull/33258#issuecomment-878874133 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45476/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA removed a comment on pull request #33297: [SPARK-36069] from_json's exception should contain field name, type and value

2021-07-13 Thread GitBox
SparkQA removed a comment on pull request #33297: URL: https://github.com/apache/spark/pull/33297#issuecomment-878749259 **[Test build #140952 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140952/testReport)** for PR 33297 at commit

[GitHub] [spark] SparkQA commented on pull request #33297: [SPARK-36069] from_json's exception should contain field name, type and value

2021-07-13 Thread GitBox
SparkQA commented on pull request #33297: URL: https://github.com/apache/spark/pull/33297#issuecomment-878869088 **[Test build #140952 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140952/testReport)** for PR 33297 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #33078: [SPARK-35546][Shuffle] Enable push-based shuffle when multiple app attempts are enabled and manage concurrent access to the state in

2021-07-13 Thread GitBox
SparkQA removed a comment on pull request #33078: URL: https://github.com/apache/spark/pull/33078#issuecomment-878790692 **[Test build #140956 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140956/testReport)** for PR 33078 at commit

[GitHub] [spark] SparkQA commented on pull request #33078: [SPARK-35546][Shuffle] Enable push-based shuffle when multiple app attempts are enabled and manage concurrent access to the state in a better

2021-07-13 Thread GitBox
SparkQA commented on pull request #33078: URL: https://github.com/apache/spark/pull/33078#issuecomment-878867711 **[Test build #140956 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140956/testReport)** for PR 33078 at commit

[GitHub] [spark] Viethd27 closed pull request #33319: issue-36096

2021-07-13 Thread GitBox
Viethd27 closed pull request #33319: URL: https://github.com/apache/spark/pull/33319 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [spark] Viethd27 opened a new pull request #33319: issue-36096

2021-07-13 Thread GitBox
Viethd27 opened a new pull request #33319: URL: https://github.com/apache/spark/pull/33319 ### What changes were proposed in this pull request? ### Why are the changes needed? ### Does this PR introduce _any_ user-facing change? ### How

[GitHub] [spark] AngersZhuuuu commented on a change in pull request #33296: [SPARK-34402][SQL] Group exception about data format schema

2021-07-13 Thread GitBox
AngersZh commented on a change in pull request #33296: URL: https://github.com/apache/spark/pull/33296#discussion_r668515375 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetSchemaConverter.scala ## @@ -586,11 +586,10 @@

[GitHub] [spark] SparkQA commented on pull request #32959: [SPARK-35780][SQL] Support DATE/TIMESTAMP literals across the full range

2021-07-13 Thread GitBox
SparkQA commented on pull request #32959: URL: https://github.com/apache/spark/pull/32959#issuecomment-878861280 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45472/ -- This is an automated message from the

[GitHub] [spark] SparkQA commented on pull request #30869: [SPARK-33865][SQL] When HiveDDL, we need check avro schema too

2021-07-13 Thread GitBox
SparkQA commented on pull request #30869: URL: https://github.com/apache/spark/pull/30869#issuecomment-878861302 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45473/ -- This is an automated message from the

[GitHub] [spark] SparkQA commented on pull request #33286: [SPARK-36079][SQL] Null-based filter estimate should always be in the range [0, 1]

2021-07-13 Thread GitBox
SparkQA commented on pull request #33286: URL: https://github.com/apache/spark/pull/33286#issuecomment-878857932 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45471/ -- This is an automated message from the

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33310: [WIP][SPARK-36105][SQL] OptimizeLocalShuffleReader support reading data of multiple mappers in one task

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #33310: URL: https://github.com/apache/spark/pull/33310#issuecomment-878856721 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140951/

[GitHub] [spark] AmplabJenkins commented on pull request #33310: [WIP][SPARK-36105][SQL] OptimizeLocalShuffleReader support reading data of multiple mappers in one task

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33310: URL: https://github.com/apache/spark/pull/33310#issuecomment-878856721 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140951/ -- This

[GitHub] [spark] gengliangwang commented on pull request #33318: [SPARK-36119][SQL] Add new SQL function to_timestamp_ltz

2021-07-13 Thread GitBox
gengliangwang commented on pull request #33318: URL: https://github.com/apache/spark/pull/33318#issuecomment-878856009 This is supposed to be the last for `*_ltz` functions in my plan. We will have `to_timestmap_ltz`, `make_timestamp_ltz`, and type constructor `timestmap_ltz`. -- This

[GitHub] [spark] SparkQA removed a comment on pull request #33310: [WIP][SPARK-36105][SQL] OptimizeLocalShuffleReader support reading data of multiple mappers in one task

2021-07-13 Thread GitBox
SparkQA removed a comment on pull request #33310: URL: https://github.com/apache/spark/pull/33310#issuecomment-878749275 **[Test build #140951 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140951/testReport)** for PR 33310 at commit

[GitHub] [spark] SparkQA commented on pull request #33318: [SPARK-36119][SQL] Add new SQL function to_timestamp_ltz

2021-07-13 Thread GitBox
SparkQA commented on pull request #33318: URL: https://github.com/apache/spark/pull/33318#issuecomment-878855883 **[Test build #140966 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140966/testReport)** for PR 33318 at commit

[GitHub] [spark] SparkQA commented on pull request #33310: [WIP][SPARK-36105][SQL] OptimizeLocalShuffleReader support reading data of multiple mappers in one task

2021-07-13 Thread GitBox
SparkQA commented on pull request #33310: URL: https://github.com/apache/spark/pull/33310#issuecomment-878855529 **[Test build #140951 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140951/testReport)** for PR 33310 at commit

[GitHub] [spark] gengliangwang opened a new pull request #33318: [SPARK-36119][SQL] Add new SQL function to_timestamp_ltz

2021-07-13 Thread GitBox
gengliangwang opened a new pull request #33318: URL: https://github.com/apache/spark/pull/33318 ### What changes were proposed in this pull request? Add new SQL function `to_timestamp_ltz` syntax: ``` to_timestamp_ltz(timestamp_str_column[, fmt])

[GitHub] [spark] SparkQA commented on pull request #33258: [SPARK-36037][SQL] Support ANSI SQL LOCALTIMESTAMP datetime value function

2021-07-13 Thread GitBox
SparkQA commented on pull request #33258: URL: https://github.com/apache/spark/pull/33258#issuecomment-878853535 **[Test build #140965 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140965/testReport)** for PR 33258 at commit

[GitHub] [spark] SparkQA commented on pull request #33311: [SPARK-36109][SS][TEST] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
SparkQA commented on pull request #33311: URL: https://github.com/apache/spark/pull/33311#issuecomment-878853398 **[Test build #140964 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140964/testReport)** for PR 33311 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #32959: [SPARK-35780][SQL] Support DATE/TIMESTAMP literals across the full range

2021-07-13 Thread GitBox
SparkQA removed a comment on pull request #32959: URL: https://github.com/apache/spark/pull/32959#issuecomment-878815679 **[Test build #140958 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140958/testReport)** for PR 32959 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32959: [SPARK-35780][SQL] Support DATE/TIMESTAMP literals across the full range

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #32959: URL: https://github.com/apache/spark/pull/32959#issuecomment-878849562 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140958/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33258: [SPARK-36037][SQL] Support ANSI SQL LOCALTIMESTAMP datetime value function

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #33258: URL: https://github.com/apache/spark/pull/33258#issuecomment-878850585 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140962/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33311: [SPARK-36109][SS][TEST] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #33311: URL: https://github.com/apache/spark/pull/33311#issuecomment-878850052 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [spark] SparkQA removed a comment on pull request #33311: [SPARK-36109][SS][TEST] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
SparkQA removed a comment on pull request #33311: URL: https://github.com/apache/spark/pull/33311#issuecomment-878821618 **[Test build #140961 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140961/testReport)** for PR 33311 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #33258: [SPARK-36037][SQL] Support ANSI SQL LOCALTIMESTAMP datetime value function

2021-07-13 Thread GitBox
SparkQA removed a comment on pull request #33258: URL: https://github.com/apache/spark/pull/33258#issuecomment-878848597 **[Test build #140962 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140962/testReport)** for PR 33258 at commit

[GitHub] [spark] viirya commented on pull request #33311: [SPARK-36109][SS][TEST] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
viirya commented on pull request #33311: URL: https://github.com/apache/spark/pull/33311#issuecomment-878851649 retest this please -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [spark] beliefer commented on a change in pull request #33296: [SPARK-34402][SQL] Group exception about data format schema

2021-07-13 Thread GitBox
beliefer commented on a change in pull request #33296: URL: https://github.com/apache/spark/pull/33296#discussion_r668500974 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetSchemaConverter.scala ## @@ -586,11 +586,10 @@

[GitHub] [spark] SparkQA commented on pull request #33258: [SPARK-36037][SQL] Support ANSI SQL LOCALTIMESTAMP datetime value function

2021-07-13 Thread GitBox
SparkQA commented on pull request #33258: URL: https://github.com/apache/spark/pull/33258#issuecomment-878850562 **[Test build #140962 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140962/testReport)** for PR 33258 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #33258: [SPARK-36037][SQL] Support ANSI SQL LOCALTIMESTAMP datetime value function

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33258: URL: https://github.com/apache/spark/pull/33258#issuecomment-878850585 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140962/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #33311: [SPARK-36109][SS][TEST] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33311: URL: https://github.com/apache/spark/pull/33311#issuecomment-878850528 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140961/ -- This

[GitHub] [spark] SparkQA commented on pull request #33311: [SPARK-36109][SS][TEST] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
SparkQA commented on pull request #33311: URL: https://github.com/apache/spark/pull/33311#issuecomment-878850406 **[Test build #140961 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140961/testReport)** for PR 33311 at commit

[GitHub] [spark] SparkQA commented on pull request #33311: [SPARK-36109][SS][TEST] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
SparkQA commented on pull request #33311: URL: https://github.com/apache/spark/pull/33311#issuecomment-878850030 Kubernetes integration test unable to build dist. exiting with code: 1 URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45474/ --

[GitHub] [spark] AmplabJenkins commented on pull request #33311: [SPARK-36109][SS][TEST] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33311: URL: https://github.com/apache/spark/pull/33311#issuecomment-878850052 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45474/ --

[GitHub] [spark] AmplabJenkins commented on pull request #32959: [SPARK-35780][SQL] Support DATE/TIMESTAMP literals across the full range

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #32959: URL: https://github.com/apache/spark/pull/32959#issuecomment-878849562 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140958/ -- This

[GitHub] [spark] SparkQA commented on pull request #32959: [SPARK-35780][SQL] Support DATE/TIMESTAMP literals across the full range

2021-07-13 Thread GitBox
SparkQA commented on pull request #32959: URL: https://github.com/apache/spark/pull/32959#issuecomment-878849469 **[Test build #140958 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140958/testReport)** for PR 32959 at commit

[GitHub] [spark] SparkQA commented on pull request #33070: [SPARK-35551][SQL] Handle the COUNT bug for lateral subqueries

2021-07-13 Thread GitBox
SparkQA commented on pull request #33070: URL: https://github.com/apache/spark/pull/33070#issuecomment-878848803 **[Test build #140963 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140963/testReport)** for PR 33070 at commit

[GitHub] [spark] SparkQA commented on pull request #33258: [SPARK-36037][SQL] Support ANSI SQL LOCALTIMESTAMP datetime value function

2021-07-13 Thread GitBox
SparkQA commented on pull request #33258: URL: https://github.com/apache/spark/pull/33258#issuecomment-878848597 **[Test build #140962 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140962/testReport)** for PR 33258 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #33317: [SPARK-36095][CORE] Grouping exception in core/rdd

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33317: URL: https://github.com/apache/spark/pull/33317#issuecomment-878846823 Can one of the admins verify this patch? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33078: [SPARK-35546][Shuffle] Enable push-based shuffle when multiple app attempts are enabled and manage concurrent access to the sta

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #33078: URL: https://github.com/apache/spark/pull/33078#issuecomment-878845916 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45470/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33258: [SPARK-36037][SQL] Support ANSI SQL LOCALTIMESTAMP datetime value function

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #33258: URL: https://github.com/apache/spark/pull/33258#issuecomment-878845919 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45475/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33310: [WIP][SPARK-36105][SQL] OptimizeLocalShuffleReader support reading data of multiple mappers in one task

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #33310: URL: https://github.com/apache/spark/pull/33310#issuecomment-878845915 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45469/

[GitHub] [spark] AmplabJenkins commented on pull request #33078: [SPARK-35546][Shuffle] Enable push-based shuffle when multiple app attempts are enabled and manage concurrent access to the state in a

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33078: URL: https://github.com/apache/spark/pull/33078#issuecomment-878845916 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45470/ --

[GitHub] [spark] AmplabJenkins commented on pull request #33310: [WIP][SPARK-36105][SQL] OptimizeLocalShuffleReader support reading data of multiple mappers in one task

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33310: URL: https://github.com/apache/spark/pull/33310#issuecomment-878845915 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45469/ --

[GitHub] [spark] AmplabJenkins commented on pull request #33258: [SPARK-36037][SQL] Support ANSI SQL LOCALTIMESTAMP datetime value function

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33258: URL: https://github.com/apache/spark/pull/33258#issuecomment-878845919 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45475/ --

[GitHub] [spark] beliefer commented on a change in pull request #33258: [SPARK-36037][SQL] Support ANSI SQL LOCALTIMESTAMP datetime value function

2021-07-13 Thread GitBox
beliefer commented on a change in pull request #33258: URL: https://github.com/apache/spark/pull/33258#discussion_r668401909 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/datetimeExpressions.scala ## @@ -236,6 +274,8 @@ case class

[GitHub] [spark] HeartSaVioR commented on a change in pull request #31989: [SPARK-34891][SS] Introduce state store manager for session window in streaming query

2021-07-13 Thread GitBox
HeartSaVioR commented on a change in pull request #31989: URL: https://github.com/apache/spark/pull/31989#discussion_r668496627 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/StreamingSessionWindowStateManager.scala ## @@ -0,0 +1,370 @@

[GitHub] [spark] dgd-contributor commented on pull request #33317: [SPARK-36095][CORE] Grouping exception in core/rdd

2021-07-13 Thread GitBox
dgd-contributor commented on pull request #33317: URL: https://github.com/apache/spark/pull/33317#issuecomment-878844654 cc @beliefer @allisonwang-db -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [spark] dgd-contributor opened a new pull request #33317: [SPARK-36095][CORE] Grouping exception in core/rdd

2021-07-13 Thread GitBox
dgd-contributor opened a new pull request #33317: URL: https://github.com/apache/spark/pull/33317 ### What changes were proposed in this pull request? This PR group exception messages in core/src/main/scala/org/apache/spark/rdd ### Why are the changes needed? It will largely

[GitHub] [spark] HeartSaVioR commented on a change in pull request #31989: [SPARK-34891][SS] Introduce state store manager for session window in streaming query

2021-07-13 Thread GitBox
HeartSaVioR commented on a change in pull request #31989: URL: https://github.com/apache/spark/pull/31989#discussion_r668492722 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/StreamingSessionWindowStateManager.scala ## @@ -0,0 +1,370 @@

[GitHub] [spark] SparkQA commented on pull request #30869: [SPARK-33865][SQL] When HiveDDL, we need check avro schema too

2021-07-13 Thread GitBox
SparkQA commented on pull request #30869: URL: https://github.com/apache/spark/pull/30869#issuecomment-878841068 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45473/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #32959: [SPARK-35780][SQL] Support DATE/TIMESTAMP literals across the full range

2021-07-13 Thread GitBox
SparkQA commented on pull request #32959: URL: https://github.com/apache/spark/pull/32959#issuecomment-878839743 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45472/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #33286: [SPARK-36079][SQL] Null-based filter estimate should always be in the range [0, 1]

2021-07-13 Thread GitBox
SparkQA commented on pull request #33286: URL: https://github.com/apache/spark/pull/33286#issuecomment-878838306 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45471/ -- This is an automated message from the Apache

[GitHub] [spark] viirya commented on a change in pull request #31989: [SPARK-34891][SS] Introduce state store manager for session window in streaming query

2021-07-13 Thread GitBox
viirya commented on a change in pull request #31989: URL: https://github.com/apache/spark/pull/31989#discussion_r668488747 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/StreamingSessionWindowStateManager.scala ## @@ -0,0 +1,370 @@ +/* +

[GitHub] [spark] dgd-contributor commented on a change in pull request #33291: [SPARK-35561][SQL] Remove leading zeros from empty static number type partition

2021-07-13 Thread GitBox
dgd-contributor commented on a change in pull request #33291: URL: https://github.com/apache/spark/pull/33291#discussion_r668386989 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/PartitioningUtils.scala ## @@ -351,10 +351,24 @@ object

[GitHub] [spark] SparkQA commented on pull request #33310: [WIP][SPARK-36105][SQL] OptimizeLocalShuffleReader support reading data of multiple mappers in one task

2021-07-13 Thread GitBox
SparkQA commented on pull request #33310: URL: https://github.com/apache/spark/pull/33310#issuecomment-878830685 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45469/ -- This is an automated message from the

[GitHub] [spark] toujours33 commented on pull request #33061: [SPARK-35862][SS] Remove hardcoded time zone time format for watermark stats

2021-07-13 Thread GitBox
toujours33 commented on pull request #33061: URL: https://github.com/apache/spark/pull/33061#issuecomment-878830218 Thanks for your reviewing, Seems there is no strong reason to do this change, I will close this! -- This is an automated message from the Apache Git Service. To respond to

[GitHub] [spark] toujours33 closed pull request #33061: [SPARK-35862][SS] Remove hardcoded time zone time format for watermark stats

2021-07-13 Thread GitBox
toujours33 closed pull request #33061: URL: https://github.com/apache/spark/pull/33061 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [spark] eejbyfeldt commented on pull request #33205: [SPARK-20384][SQL] Support value class in nested schema for Dataset

2021-07-13 Thread GitBox
eejbyfeldt commented on pull request #33205: URL: https://github.com/apache/spark/pull/33205#issuecomment-878826533 @mickjermsurawong-stripe I created a WIP PR with my branch here: https://github.com/apache/spark/pull/33316 I think you can just take the `getConstructorParameters` from

[GitHub] [spark] SparkQA commented on pull request #33078: [SPARK-35546][Shuffle] Enable push-based shuffle when multiple app attempts are enabled and manage concurrent access to the state in a better

2021-07-13 Thread GitBox
SparkQA commented on pull request #33078: URL: https://github.com/apache/spark/pull/33078#issuecomment-878826115 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45470/ -- This is an automated message from the

[GitHub] [spark] SparkQA commented on pull request #33258: [SPARK-36037][SQL] Support ANSI SQL LOCALTIMESTAMP datetime value function

2021-07-13 Thread GitBox
SparkQA commented on pull request #33258: URL: https://github.com/apache/spark/pull/33258#issuecomment-878825352 Kubernetes integration test unable to build dist. exiting with code: 1 URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45475/ --

[GitHub] [spark] AngersZhuuuu commented on pull request #33296: [SPARK-34402][SQL] Group exception about data format schema

2021-07-13 Thread GitBox
AngersZh commented on pull request #33296: URL: https://github.com/apache/spark/pull/33296#issuecomment-878821801 ping @maropu @cloud-fan @beliefer -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [spark] AngersZhuuuu removed a comment on pull request #33296: [SPARK-34402][SQL] Group exception about data format schema

2021-07-13 Thread GitBox
AngersZh removed a comment on pull request #33296: URL: https://github.com/apache/spark/pull/33296#issuecomment-878166370 FYI @cloud-fan -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [spark] SparkQA commented on pull request #33311: [SPARK-36109][SS][TEST] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
SparkQA commented on pull request #33311: URL: https://github.com/apache/spark/pull/33311#issuecomment-878821618 **[Test build #140961 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140961/testReport)** for PR 33311 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #33316: [WIP][SPARK-20384][SQL] Support value classes and always encoded as underlying type

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33316: URL: https://github.com/apache/spark/pull/33316#issuecomment-878820358 Can one of the admins verify this patch? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [spark] zhouyejoe commented on pull request #33078: [SPARK-35546][Shuffle] Enable push-based shuffle when multiple app attempts are enabled and manage concurrent access to the state in a bett

2021-07-13 Thread GitBox
zhouyejoe commented on pull request #33078: URL: https://github.com/apache/spark/pull/33078#issuecomment-878820216 Addressed all the comments other than adding unit tests as @otterc commented for potential concurrency issues. -- This is an automated message from the Apache Git Service.

[GitHub] [spark] viirya commented on a change in pull request #33311: [SPARK-36109][SS][TEST] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
viirya commented on a change in pull request #33311: URL: https://github.com/apache/spark/pull/33311#discussion_r668469212 ## File path: sql/core/src/test/scala/org/apache/spark/sql/streaming/StreamTest.scala ## @@ -871,6 +871,9 @@ trait StreamTest extends QueryTest with

[GitHub] [spark] eejbyfeldt opened a new pull request #33316: [WIP][SPARK-20384][SQL] Support value classes and always encoded as underlying type

2021-07-13 Thread GitBox
eejbyfeldt opened a new pull request #33316: URL: https://github.com/apache/spark/pull/33316 ### What changes were proposed in this pull request? This PR adds support for using value class in nested case classes. This has previously been proposed in the following PRs:

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33293: [SPARK-36076][SQL][3.1] ArrayIndexOutOfBounds in Cast string to times…

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #33293: URL: https://github.com/apache/spark/pull/33293#issuecomment-878819296 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140949/

[GitHub] [spark] SparkQA commented on pull request #33258: [SPARK-36037][SQL] Support ANSI SQL LOCALTIMESTAMP datetime value function

2021-07-13 Thread GitBox
SparkQA commented on pull request #33258: URL: https://github.com/apache/spark/pull/33258#issuecomment-878819603 **[Test build #140960 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140960/testReport)** for PR 33258 at commit

[GitHub] [spark] zhouyejoe commented on a change in pull request #33078: [SPARK-35546][Shuffle] Enable push-based shuffle when multiple app attempts are enabled and manage concurrent access to the sta

2021-07-13 Thread GitBox
zhouyejoe commented on a change in pull request #33078: URL: https://github.com/apache/spark/pull/33078#discussion_r668468557 ## File path: common/network-shuffle/src/test/java/org/apache/spark/network/shuffle/RemoteBlockPushResolverSuite.java ## @@ -219,101 +238,145 @@

[GitHub] [spark] AmplabJenkins commented on pull request #33293: [SPARK-36076][SQL][3.1] ArrayIndexOutOfBounds in Cast string to times…

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33293: URL: https://github.com/apache/spark/pull/33293#issuecomment-878819296 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140949/ -- This

[GitHub] [spark] SparkQA removed a comment on pull request #33293: [SPARK-36076][SQL][3.1] ArrayIndexOutOfBounds in Cast string to times…

2021-07-13 Thread GitBox
SparkQA removed a comment on pull request #33293: URL: https://github.com/apache/spark/pull/33293#issuecomment-878728740 **[Test build #140949 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140949/testReport)** for PR 33293 at commit

[GitHub] [spark] SparkQA commented on pull request #33293: [SPARK-36076][SQL][3.1] ArrayIndexOutOfBounds in Cast string to times…

2021-07-13 Thread GitBox
SparkQA commented on pull request #33293: URL: https://github.com/apache/spark/pull/33293#issuecomment-878818147 **[Test build #140949 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140949/testReport)** for PR 33293 at commit

[GitHub] [spark] allisonwang-db commented on a change in pull request #33070: [SPARK-35551][SQL] Handle the COUNT bug for lateral subqueries

2021-07-13 Thread GitBox
allisonwang-db commented on a change in pull request #33070: URL: https://github.com/apache/spark/pull/33070#discussion_r668428086 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/DecorrelateInnerQuery.scala ## @@ -428,7 +451,132 @@ object

[GitHub] [spark] SparkQA commented on pull request #30869: [SPARK-33865][SQL] When HiveDDL, we need check avro schema too

2021-07-13 Thread GitBox
SparkQA commented on pull request #30869: URL: https://github.com/apache/spark/pull/30869#issuecomment-878816249 **[Test build #140959 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140959/testReport)** for PR 30869 at commit

[GitHub] [spark] SparkQA commented on pull request #32959: [SPARK-35780][SQL] Support DATE/TIMESTAMP literals across the full range

2021-07-13 Thread GitBox
SparkQA commented on pull request #32959: URL: https://github.com/apache/spark/pull/32959#issuecomment-878815679 **[Test build #140958 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140958/testReport)** for PR 32959 at commit

[GitHub] [spark] SparkQA commented on pull request #33286: [SPARK-36079][SQL] Null-based filter estimate should always be in the range [0, 1]

2021-07-13 Thread GitBox
SparkQA commented on pull request #33286: URL: https://github.com/apache/spark/pull/33286#issuecomment-878815490 **[Test build #140957 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140957/testReport)** for PR 33286 at commit

[GitHub] [spark] viirya commented on a change in pull request #31989: [SPARK-34891][SS] Introduce state store manager for session window in streaming query

2021-07-13 Thread GitBox
viirya commented on a change in pull request #31989: URL: https://github.com/apache/spark/pull/31989#discussion_r668462823 ## File path: sql/core/src/test/scala/org/apache/spark/sql/execution/streaming/state/StreamingSessionWindowStateManagerSuite.scala ## @@ -0,0 +1,195 @@

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33311: [SPARK-36109][SS][TEST] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #33311: URL: https://github.com/apache/spark/pull/33311#issuecomment-878814075 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140946/

[GitHub] [spark] AmplabJenkins commented on pull request #33311: [SPARK-36109][SS][TEST] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33311: URL: https://github.com/apache/spark/pull/33311#issuecomment-878814075 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140946/ -- This

[GitHub] [spark] allisonwang-db commented on a change in pull request #33070: [SPARK-35551][SQL] Handle the COUNT bug for lateral subqueries

2021-07-13 Thread GitBox
allisonwang-db commented on a change in pull request #33070: URL: https://github.com/apache/spark/pull/33070#discussion_r668428086 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/DecorrelateInnerQuery.scala ## @@ -428,7 +451,132 @@ object

[GitHub] [spark] viirya commented on a change in pull request #31989: [SPARK-34891][SS] Introduce state store manager for session window in streaming query

2021-07-13 Thread GitBox
viirya commented on a change in pull request #31989: URL: https://github.com/apache/spark/pull/31989#discussion_r668460586 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/StreamingSessionWindowStateManager.scala ## @@ -0,0 +1,370 @@ +/* +

[GitHub] [spark] SparkQA commented on pull request #33310: [WIP][SPARK-36105][SQL] OptimizeLocalShuffleReader support reading data of multiple mappers in one task

2021-07-13 Thread GitBox
SparkQA commented on pull request #33310: URL: https://github.com/apache/spark/pull/33310#issuecomment-878812570 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45469/ -- This is an automated message from the Apache

[GitHub] [spark] wangyum commented on pull request #33286: [SPARK-36079][SQL] Null-based filter estimate should always be in the range [0, 1]

2021-07-13 Thread GitBox
wangyum commented on pull request #33286: URL: https://github.com/apache/spark/pull/33286#issuecomment-878810758 @karenfeng Do you have plan to fix the null count higher than the row count? -- This is an automated message from the Apache Git Service. To respond to the message, please log

[GitHub] [spark] SparkQA commented on pull request #33078: [SPARK-35546][Shuffle] Enable push-based shuffle when multiple app attempts are enabled and manage concurrent access to the state in a better

2021-07-13 Thread GitBox
SparkQA commented on pull request #33078: URL: https://github.com/apache/spark/pull/33078#issuecomment-878809929 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45470/ -- This is an automated message from the Apache

[GitHub] [spark] ReachInfi commented on pull request #33314: [SPARK-36118][SQL] add bitmap functions for Spark SQL

2021-07-13 Thread GitBox
ReachInfi commented on pull request #33314: URL: https://github.com/apache/spark/pull/33314#issuecomment-878808235 done -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[GitHub] [spark] viirya commented on a change in pull request #31989: [SPARK-34891][SS] Introduce state store manager for session window in streaming query

2021-07-13 Thread GitBox
viirya commented on a change in pull request #31989: URL: https://github.com/apache/spark/pull/31989#discussion_r668446915 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/StreamingSessionWindowStateManager.scala ## @@ -0,0 +1,370 @@ +/* +

[GitHub] [spark] HyukjinKwon commented on pull request #33314: Add bitmap functions in Spark SQL

2021-07-13 Thread GitBox
HyukjinKwon commented on pull request #33314: URL: https://github.com/apache/spark/pull/33314#issuecomment-878807095 Can you link it to the PR titile? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

<    1   2   3   4   5   6   7   >