[GitHub] [spark] wangyum commented on pull request #33324: [SPARK-36093][SQL] RemoveRedundantAliases should not change Command's parameter's expression's name

2021-07-13 Thread GitBox
wangyum commented on pull request #33324: URL: https://github.com/apache/spark/pull/33324#issuecomment-879521634 cc @wangshisan -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comm

[GitHub] [spark] SparkQA commented on pull request #33077: [SPARK-34892][SS] Introduce MergingSortWithSessionWindowStateIterator sorting input rows and rows in state efficiently

2021-07-13 Thread GitBox
SparkQA commented on pull request #33077: URL: https://github.com/apache/spark/pull/33077#issuecomment-879512560 **[Test build #140989 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140989/testReport)** for PR 33077 at commit [`e4a74a3`](https://github.com

[GitHub] [spark] sunchao commented on pull request #33330: [SPARK-36123][SQL] Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread GitBox
sunchao commented on pull request #0: URL: https://github.com/apache/spark/pull/0#issuecomment-879512049 thanks @dongjoon-hyun - what do you think if I open a separate PR to do the refactoring on the test suite first? it will make the changes easier. -- This is an automated messa

[GitHub] [spark] sunchao commented on a change in pull request #33330: [SPARK-36123][SQL] Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread GitBox
sunchao commented on a change in pull request #0: URL: https://github.com/apache/spark/pull/0#discussion_r669215145 ## File path: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetColumnIndexSuite.scala ## @@ -31,24 +31,25 @@ class Parqu

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33330: [SPARK-36123][SQL] Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #0: URL: https://github.com/apache/spark/pull/0#issuecomment-879511386 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45500/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33331: [SPARK-36125][PYTHON] Implement non-equality comparison operators between two Categoricals

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #1: URL: https://github.com/apache/spark/pull/1#issuecomment-879511385 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140988/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33077: [SPARK-34892][SS] Introduce MergingSortWithSessionWindowStateIterator sorting input rows and rows in state efficiently

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #33077: URL: https://github.com/apache/spark/pull/33077#issuecomment-879511382 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45501/

[GitHub] [spark] AmplabJenkins commented on pull request #33331: [SPARK-36125][PYTHON] Implement non-equality comparison operators between two Categoricals

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #1: URL: https://github.com/apache/spark/pull/1#issuecomment-879511385 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140988/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #33330: [SPARK-36123][SQL] Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #0: URL: https://github.com/apache/spark/pull/0#issuecomment-879511386 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45500/ -- T

[GitHub] [spark] AmplabJenkins commented on pull request #33077: [SPARK-34892][SS] Introduce MergingSortWithSessionWindowStateIterator sorting input rows and rows in state efficiently

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33077: URL: https://github.com/apache/spark/pull/33077#issuecomment-879511382 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45501/ -- T

[GitHub] [spark] ulysses-you commented on a change in pull request #32872: [SPARK-35639][SQL] Make hasCoalescedPartition return true if something was actually coalesced

2021-07-13 Thread GitBox
ulysses-you commented on a change in pull request #32872: URL: https://github.com/apache/spark/pull/32872#discussion_r669214397 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/CustomShuffleReaderExec.scala ## @@ -87,8 +87,15 @@ case class CustomSh

[GitHub] [spark] SparkQA commented on pull request #33331: [SPARK-36125][PYTHON] Implement non-equality comparison operators between two Categoricals

2021-07-13 Thread GitBox
SparkQA commented on pull request #1: URL: https://github.com/apache/spark/pull/1#issuecomment-879508116 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45502/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #33077: [SPARK-34892][SS] Introduce MergingSortWithSessionWindowStateIterator sorting input rows and rows in state efficiently

2021-07-13 Thread GitBox
SparkQA commented on pull request #33077: URL: https://github.com/apache/spark/pull/33077#issuecomment-879502187 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45501/ -- This is an automated message from the A

[GitHub] [spark] SparkQA removed a comment on pull request #33331: [SPARK-36125][PYTHON] Implement non-equality comparison operators between two Categoricals

2021-07-13 Thread GitBox
SparkQA removed a comment on pull request #1: URL: https://github.com/apache/spark/pull/1#issuecomment-879491612 **[Test build #140988 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140988/testReport)** for PR 1 at commit [`c9cf74b`](https://gi

[GitHub] [spark] SparkQA commented on pull request #33330: [SPARK-36123][SQL] Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread GitBox
SparkQA commented on pull request #0: URL: https://github.com/apache/spark/pull/0#issuecomment-879501811 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45500/ -- This is an automated message from the A

[GitHub] [spark] SparkQA commented on pull request #33331: [SPARK-36125][PYTHON] Implement non-equality comparison operators between two Categoricals

2021-07-13 Thread GitBox
SparkQA commented on pull request #1: URL: https://github.com/apache/spark/pull/1#issuecomment-879501280 **[Test build #140988 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140988/testReport)** for PR 1 at commit [`c9cf74b`](https://github.co

[GitHub] [spark] SparkQA commented on pull request #33331: [SPARK-36125][PYTHON] Implement non-equality comparison operators between two Categoricals

2021-07-13 Thread GitBox
SparkQA commented on pull request #1: URL: https://github.com/apache/spark/pull/1#issuecomment-879491612 **[Test build #140988 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140988/testReport)** for PR 1 at commit [`c9cf74b`](https://github.com

[GitHub] [spark] dongjoon-hyun commented on a change in pull request #33330: [SPARK-36123][SQL] Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread GitBox
dongjoon-hyun commented on a change in pull request #0: URL: https://github.com/apache/spark/pull/0#discussion_r669196510 ## File path: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetColumnIndexSuite.scala ## @@ -31,24 +31,25 @@ class

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33286: [SPARK-36079][SQL] Null-based filter estimate should always be in the range [0, 1]

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #33286: URL: https://github.com/apache/spark/pull/33286#issuecomment-879490705 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45499/

[GitHub] [spark] AmplabJenkins commented on pull request #33286: [SPARK-36079][SQL] Null-based filter estimate should always be in the range [0, 1]

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33286: URL: https://github.com/apache/spark/pull/33286#issuecomment-879490705 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45499/ -- T

[GitHub] [spark] SparkQA commented on pull request #33077: [SPARK-34892][SS] Introduce MergingSortWithSessionWindowStateIterator sorting input rows and rows in state efficiently

2021-07-13 Thread GitBox
SparkQA commented on pull request #33077: URL: https://github.com/apache/spark/pull/33077#issuecomment-879490397 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45501/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #33330: [SPARK-36123][SQL] Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread GitBox
SparkQA commented on pull request #0: URL: https://github.com/apache/spark/pull/0#issuecomment-879489845 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45500/ -- This is an automated message from the Apache

[GitHub] [spark] dongjoon-hyun commented on pull request #33330: [SPARK-36123][SQL] Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread GitBox
dongjoon-hyun commented on pull request #0: URL: https://github.com/apache/spark/pull/0#issuecomment-879489131 Thank you for pinging me, @sunchao . -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [spark] github-actions[bot] closed pull request #31771: [SPARK-34652][AVRO] Support SchemaRegistry in from_avro method

2021-07-13 Thread GitBox
github-actions[bot] closed pull request #31771: URL: https://github.com/apache/spark/pull/31771 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: re

[GitHub] [spark] otterc commented on a change in pull request #33329: [WIP][SPARK-35917][SHUFFLE][CORE][3.2] Disable push-based shuffle feature to prevent it from being used

2021-07-13 Thread GitBox
otterc commented on a change in pull request #33329: URL: https://github.com/apache/spark/pull/33329#discussion_r669189297 ## File path: core/src/main/scala/org/apache/spark/internal/config/package.scala ## @@ -2079,7 +2079,7 @@ package object config { "conjunction wit

[GitHub] [spark] github-actions[bot] closed pull request #31937: [SPARK-10816][SS] Support session window natively

2021-07-13 Thread GitBox
github-actions[bot] closed pull request #31937: URL: https://github.com/apache/spark/pull/31937 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: re

[GitHub] [spark] github-actions[bot] commented on pull request #31926: [SPARK-34775][SQL] Push down limit through window when partitionSpec is not empty

2021-07-13 Thread GitBox
github-actions[bot] commented on pull request #31926: URL: https://github.com/apache/spark/pull/31926#issuecomment-879483610 We're closing this PR because it hasn't been updated in a while. This isn't a judgement on the merit of the PR in any way. It's just a way of keeping the PR queue ma

[GitHub] [spark] SparkQA commented on pull request #33286: [SPARK-36079][SQL] Null-based filter estimate should always be in the range [0, 1]

2021-07-13 Thread GitBox
SparkQA commented on pull request #33286: URL: https://github.com/apache/spark/pull/33286#issuecomment-879482756 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45499/ -- This is an automated message from the A

[GitHub] [spark] xinrong-databricks commented on a change in pull request #33331: [SPARK-36125][PYTHON] Implement non-equality comparison operators between two Categoricals

2021-07-13 Thread GitBox
xinrong-databricks commented on a change in pull request #1: URL: https://github.com/apache/spark/pull/1#discussion_r669186589 ## File path: python/pyspark/pandas/data_type_ops/categorical_ops.py ## @@ -64,15 +66,28 @@ def astype(self, index_ops: IndexOpsLike, dtype: Un

[GitHub] [spark] xinrong-databricks opened a new pull request #33331: [SPARK-36125][PYTHON] Implement non-equality comparison operators of Categoricals

2021-07-13 Thread GitBox
xinrong-databricks opened a new pull request #1: URL: https://github.com/apache/spark/pull/1 ### What changes were proposed in this pull request? Implement non-equality comparison operators between two Categoricals. ### Why are the changes needed? pandas supports non-equ

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33077: [SPARK-34892][SS] Introduce MergingSortWithSessionWindowStateIterator sorting input rows and rows in state efficiently

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #33077: URL: https://github.com/apache/spark/pull/33077#issuecomment-879391296 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45495/

[GitHub] [spark] SparkQA commented on pull request #33077: [SPARK-34892][SS] Introduce MergingSortWithSessionWindowStateIterator sorting input rows and rows in state efficiently

2021-07-13 Thread GitBox
SparkQA commented on pull request #33077: URL: https://github.com/apache/spark/pull/33077#issuecomment-879471127 **[Test build #140987 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140987/testReport)** for PR 33077 at commit [`b540632`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #33330: [SPARK-36123][SQL] Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread GitBox
SparkQA commented on pull request #0: URL: https://github.com/apache/spark/pull/0#issuecomment-879470999 **[Test build #140986 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140986/testReport)** for PR 0 at commit [`41a7ca8`](https://github.com

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33327: [SPARK-36109][SS][TEST][3.1] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #33327: URL: https://github.com/apache/spark/pull/33327#issuecomment-879470604 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140980/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32049: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #32049: URL: https://github.com/apache/spark/pull/32049#issuecomment-879470602 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45498/

[GitHub] [spark] AmplabJenkins commented on pull request #33327: [SPARK-36109][SS][TEST][3.1] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33327: URL: https://github.com/apache/spark/pull/33327#issuecomment-879470604 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140980/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #32049: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #32049: URL: https://github.com/apache/spark/pull/32049#issuecomment-879470602 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45498/ -- T

[GitHub] [spark] SparkQA commented on pull request #33286: [SPARK-36079][SQL] Null-based filter estimate should always be in the range [0, 1]

2021-07-13 Thread GitBox
SparkQA commented on pull request #33286: URL: https://github.com/apache/spark/pull/33286#issuecomment-879470445 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45499/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #32049: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-07-13 Thread GitBox
SparkQA commented on pull request #32049: URL: https://github.com/apache/spark/pull/32049#issuecomment-879459700 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45498/ -- This is an automated message from the A

[GitHub] [spark] SparkQA removed a comment on pull request #33327: [SPARK-36109][SS][TEST][3.1] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
SparkQA removed a comment on pull request #33327: URL: https://github.com/apache/spark/pull/33327#issuecomment-879316709 **[Test build #140980 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140980/testReport)** for PR 33327 at commit [`f5ec2c7`](https://gi

[GitHub] [spark] SparkQA commented on pull request #33327: [SPARK-36109][SS][TEST][3.1] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
SparkQA commented on pull request #33327: URL: https://github.com/apache/spark/pull/33327#issuecomment-879458752 **[Test build #140980 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140980/testReport)** for PR 33327 at commit [`f5ec2c7`](https://github.co

[GitHub] [spark] sunchao commented on pull request #33330: [SPARK-36123][SQL] Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread GitBox
sunchao commented on pull request #0: URL: https://github.com/apache/spark/pull/0#issuecomment-879455576 cc @dongjoon-hyun @viirya @cloud-fan -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [spark] sunchao opened a new pull request #33330: [SPARK-36123][SQL] Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread GitBox
sunchao opened a new pull request #0: URL: https://github.com/apache/spark/pull/0 ### What changes were proposed in this pull request? Fix the skipping values logic in Parquet vectorized reader when column index is effective, by considering nulls. Also refactored

[GitHub] [spark] dongjoon-hyun commented on pull request #33329: [WIP][SPARK-35917][SHUFFLE][CORE][3.2] Disable push-based shuffle feature to prevent it from being used

2021-07-13 Thread GitBox
dongjoon-hyun commented on pull request #33329: URL: https://github.com/apache/spark/pull/33329#issuecomment-879452177 I'm okay with disabling this, but I'm wondering what is the different from branch-3.1 because branch-3.1 also has the same configuration. If we want to disable this explic

[GitHub] [spark] dongjoon-hyun commented on a change in pull request #33329: [WIP][SPARK-35917][SHUFFLE][CORE]Disable push-based shuffle feature to prevent it from being used

2021-07-13 Thread GitBox
dongjoon-hyun commented on a change in pull request #33329: URL: https://github.com/apache/spark/pull/33329#discussion_r669153013 ## File path: core/src/main/scala/org/apache/spark/util/Utils.scala ## @@ -2598,11 +2598,16 @@ private[spark] object Utils extends Logging { * S

[GitHub] [spark] SparkQA commented on pull request #33286: [SPARK-36079][SQL] Null-based filter estimate should always be in the range [0, 1]

2021-07-13 Thread GitBox
SparkQA commented on pull request #33286: URL: https://github.com/apache/spark/pull/33286#issuecomment-879449174 **[Test build #140985 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140985/testReport)** for PR 33286 at commit [`67a228b`](https://github.com

[GitHub] [spark] viirya commented on pull request #33326: [SPARK-36109][SS][TEST][3.0] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
viirya commented on pull request #33326: URL: https://github.com/apache/spark/pull/33326#issuecomment-879449209 Thank you @dongjoon-hyun and @HeartSaVioR -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [spark] dongjoon-hyun commented on a change in pull request #33329: [WIP][SPARK-35917][SHUFFLE][CORE]Disable push-based shuffle feature to prevent it from being used

2021-07-13 Thread GitBox
dongjoon-hyun commented on a change in pull request #33329: URL: https://github.com/apache/spark/pull/33329#discussion_r669153013 ## File path: core/src/main/scala/org/apache/spark/util/Utils.scala ## @@ -2598,11 +2598,16 @@ private[spark] object Utils extends Logging { * S

[GitHub] [spark] dongjoon-hyun commented on a change in pull request #33329: [WIP][SPARK-35917][SHUFFLE][CORE]Disable push-based shuffle feature to prevent it from being used

2021-07-13 Thread GitBox
dongjoon-hyun commented on a change in pull request #33329: URL: https://github.com/apache/spark/pull/33329#discussion_r669151977 ## File path: core/src/main/scala/org/apache/spark/internal/config/package.scala ## @@ -2079,7 +2079,7 @@ package object config { "conjunct

[GitHub] [spark] SparkQA commented on pull request #32049: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-07-13 Thread GitBox
SparkQA commented on pull request #32049: URL: https://github.com/apache/spark/pull/32049#issuecomment-879446170 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45498/ -- This is an automated message from the Apache

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33286: [SPARK-36079][SQL] Null-based filter estimate should always be in the range [0, 1]

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #33286: URL: https://github.com/apache/spark/pull/33286#issuecomment-879420335 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140983/ -

[GitHub] [spark] SparkQA removed a comment on pull request #33286: [SPARK-36079][SQL] Null-based filter estimate should always be in the range [0, 1]

2021-07-13 Thread GitBox
SparkQA removed a comment on pull request #33286: URL: https://github.com/apache/spark/pull/33286#issuecomment-879416458 **[Test build #140983 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140983/testReport)** for PR 33286 at commit [`e0a5895`](https://gi

[GitHub] [spark] SparkQA commented on pull request #33286: [SPARK-36079][SQL] Null-based filter estimate should always be in the range [0, 1]

2021-07-13 Thread GitBox
SparkQA commented on pull request #33286: URL: https://github.com/apache/spark/pull/33286#issuecomment-879420312 **[Test build #140983 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140983/testReport)** for PR 33286 at commit [`e0a5895`](https://github.co

[GitHub] [spark] AmplabJenkins commented on pull request #33286: [SPARK-36079][SQL] Null-based filter estimate should always be in the range [0, 1]

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33286: URL: https://github.com/apache/spark/pull/33286#issuecomment-879420335 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140983/ -- This

[GitHub] [spark] SparkQA commented on pull request #33326: [SPARK-36109][SS][TEST][3.0] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
SparkQA commented on pull request #33326: URL: https://github.com/apache/spark/pull/33326#issuecomment-879420392 **[Test build #140979 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140979/testReport)** for PR 33326 at commit [`3c8f119`](https://github.co

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32049: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #32049: URL: https://github.com/apache/spark/pull/32049#issuecomment-879423126 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45497/

[GitHub] [spark] AmplabJenkins commented on pull request #32049: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #32049: URL: https://github.com/apache/spark/pull/32049#issuecomment-879423126 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45497/ -- T

[GitHub] [spark] SparkQA commented on pull request #32049: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-07-13 Thread GitBox
SparkQA commented on pull request #32049: URL: https://github.com/apache/spark/pull/32049#issuecomment-879423105 Kubernetes integration test unable to build dist. exiting with code: 1 URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45497/ -- This

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33326: [SPARK-36109][SS][TEST][3.0] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #33326: URL: https://github.com/apache/spark/pull/33326#issuecomment-879421436 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140979/ -

[GitHub] [spark] AmplabJenkins commented on pull request #33326: [SPARK-36109][SS][TEST][3.0] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33326: URL: https://github.com/apache/spark/pull/33326#issuecomment-879421436 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140979/ -- This

[GitHub] [spark] SparkQA removed a comment on pull request #33326: [SPARK-36109][SS][TEST][3.0] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
SparkQA removed a comment on pull request #33326: URL: https://github.com/apache/spark/pull/33326#issuecomment-879278682 **[Test build #140979 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140979/testReport)** for PR 33326 at commit [`3c8f119`](https://gi

[GitHub] [spark] SparkQA removed a comment on pull request #32049: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-07-13 Thread GitBox
SparkQA removed a comment on pull request #32049: URL: https://github.com/apache/spark/pull/32049#issuecomment-879414907 **[Test build #140982 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140982/testReport)** for PR 32049 at commit [`7540b59`](https://gi

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33286: [SPARK-36079][SQL] Null-based filter estimate should always be in the range [0, 1]

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #33286: URL: https://github.com/apache/spark/pull/33286#issuecomment-879418798 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45496/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32049: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #32049: URL: https://github.com/apache/spark/pull/32049#issuecomment-879418565 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140982/ -

[GitHub] [spark] shardulm94 commented on a change in pull request #33328: [SPARK-28266][SQL] convertToLogicalRelation should not interpret `path` property when reading Hive tables

2021-07-13 Thread GitBox
shardulm94 commented on a change in pull request #33328: URL: https://github.com/apache/spark/pull/33328#discussion_r669126948 ## File path: sql/hive/src/test/scala/org/apache/spark/sql/hive/HiveMetastoreCatalogSuite.scala ## @@ -363,4 +363,33 @@ class DataSourceWithHiveMetast

[GitHub] [spark] SparkQA commented on pull request #32049: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-07-13 Thread GitBox
SparkQA commented on pull request #32049: URL: https://github.com/apache/spark/pull/32049#issuecomment-879419291 **[Test build #140984 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140984/testReport)** for PR 32049 at commit [`2c889c6`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #33286: [SPARK-36079][SQL] Null-based filter estimate should always be in the range [0, 1]

2021-07-13 Thread GitBox
SparkQA commented on pull request #33286: URL: https://github.com/apache/spark/pull/33286#issuecomment-879418781 Kubernetes integration test unable to build dist. exiting with code: 1 URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45496/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #33286: [SPARK-36079][SQL] Null-based filter estimate should always be in the range [0, 1]

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33286: URL: https://github.com/apache/spark/pull/33286#issuecomment-879418798 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45496/ -- T

[GitHub] [spark] AmplabJenkins commented on pull request #32049: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #32049: URL: https://github.com/apache/spark/pull/32049#issuecomment-879418565 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140982/ -- This

[GitHub] [spark] SparkQA commented on pull request #32049: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-07-13 Thread GitBox
SparkQA commented on pull request #32049: URL: https://github.com/apache/spark/pull/32049#issuecomment-879418533 **[Test build #140982 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140982/testReport)** for PR 32049 at commit [`7540b59`](https://github.co

[GitHub] [spark] SparkQA commented on pull request #33286: [SPARK-36079][SQL] Null-based filter estimate should always be in the range [0, 1]

2021-07-13 Thread GitBox
SparkQA commented on pull request #33286: URL: https://github.com/apache/spark/pull/33286#issuecomment-879416458 **[Test build #140983 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140983/testReport)** for PR 33286 at commit [`e0a5895`](https://github.com

[GitHub] [spark] dongjoon-hyun closed pull request #33326: [SPARK-36109][SS][TEST][3.0] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
dongjoon-hyun closed pull request #33326: URL: https://github.com/apache/spark/pull/33326 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: reviews-

[GitHub] [spark] dongjoon-hyun closed pull request #33327: [SPARK-36109][SS][TEST][3.1] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
dongjoon-hyun closed pull request #33327: URL: https://github.com/apache/spark/pull/33327 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: reviews-

[GitHub] [spark] SparkQA commented on pull request #32049: [SPARK-34952][SQL] Aggregate (Min/Max/Count) push down for Parquet

2021-07-13 Thread GitBox
SparkQA commented on pull request #32049: URL: https://github.com/apache/spark/pull/32049#issuecomment-879414907 **[Test build #140982 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140982/testReport)** for PR 32049 at commit [`7540b59`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #33077: [SPARK-34892][SS] Introduce MergingSortWithSessionWindowStateIterator sorting input rows and rows in state efficiently

2021-07-13 Thread GitBox
SparkQA commented on pull request #33077: URL: https://github.com/apache/spark/pull/33077#issuecomment-879391268 Kubernetes integration test unable to build dist. exiting with code: 1 URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45495/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #33077: [SPARK-34892][SS] Introduce MergingSortWithSessionWindowStateIterator sorting input rows and rows in state efficiently

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33077: URL: https://github.com/apache/spark/pull/33077#issuecomment-879391296 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45495/ -- T

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33077: [SPARK-34892][SS] Introduce MergingSortWithSessionWindowStateIterator sorting input rows and rows in state efficiently

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #33077: URL: https://github.com/apache/spark/pull/33077#issuecomment-879386639 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140981/ -

[GitHub] [spark] SparkQA removed a comment on pull request #33077: [SPARK-34892][SS] Introduce MergingSortWithSessionWindowStateIterator sorting input rows and rows in state efficiently

2021-07-13 Thread GitBox
SparkQA removed a comment on pull request #33077: URL: https://github.com/apache/spark/pull/33077#issuecomment-879381320 **[Test build #140981 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140981/testReport)** for PR 33077 at commit [`60f6114`](https://gi

[GitHub] [spark] SparkQA commented on pull request #33077: [SPARK-34892][SS] Introduce MergingSortWithSessionWindowStateIterator sorting input rows and rows in state efficiently

2021-07-13 Thread GitBox
SparkQA commented on pull request #33077: URL: https://github.com/apache/spark/pull/33077#issuecomment-879386603 **[Test build #140981 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140981/testReport)** for PR 33077 at commit [`60f6114`](https://github.co

[GitHub] [spark] AmplabJenkins commented on pull request #33077: [SPARK-34892][SS] Introduce MergingSortWithSessionWindowStateIterator sorting input rows and rows in state efficiently

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33077: URL: https://github.com/apache/spark/pull/33077#issuecomment-879386639 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140981/ -- This

[GitHub] [spark] SparkQA commented on pull request #33077: [SPARK-34892][SS] Introduce MergingSortWithSessionWindowStateIterator sorting input rows and rows in state efficiently

2021-07-13 Thread GitBox
SparkQA commented on pull request #33077: URL: https://github.com/apache/spark/pull/33077#issuecomment-879381320 **[Test build #140981 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140981/testReport)** for PR 33077 at commit [`60f6114`](https://github.com

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33327: [SPARK-36109][SS][TEST][3.1] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #33327: URL: https://github.com/apache/spark/pull/33327#issuecomment-879379566 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45494/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33309: [SPARK-36106][SQL][CORE] Label error classes for subset of QueryCompilationErrors

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #33309: URL: https://github.com/apache/spark/pull/33309#issuecomment-879379568 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140977/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32959: [SPARK-35780][SQL] Support DATE/TIMESTAMP literals across the full range

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #32959: URL: https://github.com/apache/spark/pull/32959#issuecomment-879379565 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140976/ -

[GitHub] [spark] AmplabJenkins commented on pull request #32959: [SPARK-35780][SQL] Support DATE/TIMESTAMP literals across the full range

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #32959: URL: https://github.com/apache/spark/pull/32959#issuecomment-879379565 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140976/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #33327: [SPARK-36109][SS][TEST][3.1] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33327: URL: https://github.com/apache/spark/pull/33327#issuecomment-879379566 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45494/ -- T

[GitHub] [spark] AmplabJenkins commented on pull request #33309: [SPARK-36106][SQL][CORE] Label error classes for subset of QueryCompilationErrors

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33309: URL: https://github.com/apache/spark/pull/33309#issuecomment-879379568 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140977/ -- This

[GitHub] [spark] SparkQA commented on pull request #33327: [SPARK-36109][SS][TEST][3.1] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
SparkQA commented on pull request #33327: URL: https://github.com/apache/spark/pull/33327#issuecomment-879371155 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45494/ -- This is an automated message from the A

[GitHub] [spark] SparkQA removed a comment on pull request #32959: [SPARK-35780][SQL] Support DATE/TIMESTAMP literals across the full range

2021-07-13 Thread GitBox
SparkQA removed a comment on pull request #32959: URL: https://github.com/apache/spark/pull/32959#issuecomment-879197621 **[Test build #140976 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140976/testReport)** for PR 32959 at commit [`4723f8e`](https://gi

[GitHub] [spark] SparkQA commented on pull request #32959: [SPARK-35780][SQL] Support DATE/TIMESTAMP literals across the full range

2021-07-13 Thread GitBox
SparkQA commented on pull request #32959: URL: https://github.com/apache/spark/pull/32959#issuecomment-879369773 **[Test build #140976 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140976/testReport)** for PR 32959 at commit [`4723f8e`](https://github.co

[GitHub] [spark] viirya commented on pull request #33077: [SPARK-34892][SS] Introduce MergingSortWithSessionWindowStateIterator sorting input rows and rows in state efficiently

2021-07-13 Thread GitBox
viirya commented on pull request #33077: URL: https://github.com/apache/spark/pull/33077#issuecomment-879368499 Thanks @HeartSaVioR. I will review this today. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL ab

[GitHub] [spark] HeartSaVioR commented on pull request #33077: [SPARK-34892][SS] Introduce MergingSortWithSessionWindowStateIterator sorting input rows and rows in state efficiently

2021-07-13 Thread GitBox
HeartSaVioR commented on pull request #33077: URL: https://github.com/apache/spark/pull/33077#issuecomment-879367696 cc. @viirya @xuanyuanking Please take a look. Thanks! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and us

[GitHub] [spark] SparkQA removed a comment on pull request #33309: [SPARK-36106][SQL][CORE] Label error classes for subset of QueryCompilationErrors

2021-07-13 Thread GitBox
SparkQA removed a comment on pull request #33309: URL: https://github.com/apache/spark/pull/33309#issuecomment-879271462 **[Test build #140977 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140977/testReport)** for PR 33309 at commit [`db1d67c`](https://gi

[GitHub] [spark] SparkQA commented on pull request #33309: [SPARK-36106][SQL][CORE] Label error classes for subset of QueryCompilationErrors

2021-07-13 Thread GitBox
SparkQA commented on pull request #33309: URL: https://github.com/apache/spark/pull/33309#issuecomment-879365340 **[Test build #140977 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/140977/testReport)** for PR 33309 at commit [`db1d67c`](https://github.co

[GitHub] [spark] SparkQA commented on pull request #33327: [SPARK-36109][SS][TEST][3.1] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
SparkQA commented on pull request #33327: URL: https://github.com/apache/spark/pull/33327#issuecomment-879348471 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45494/ -- This is an automated message from the Apache

[GitHub] [spark] HeartSaVioR commented on pull request #31989: [SPARK-34891][SS] Introduce state store manager for session window in streaming query

2021-07-13 Thread GitBox
HeartSaVioR commented on pull request #31989: URL: https://github.com/apache/spark/pull/31989#issuecomment-879347792 Thanks all for reviewing and merging! I'll update the next PR. We have two PRs now. -- This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33309: [SPARK-36106][SQL][CORE] Label error classes for subset of QueryCompilationErrors

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #33309: URL: https://github.com/apache/spark/pull/33309#issuecomment-879342573 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45491/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33326: [SPARK-36109][SS][TEST][3.0] Check data after adding data to topic in KafkaSourceStressSuite

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #33326: URL: https://github.com/apache/spark/pull/33326#issuecomment-879342572 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/45493/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #33324: [SPARK-36093][SQL] RemoveRedundantAliases should not change Command's parameter's expression's name

2021-07-13 Thread GitBox
AmplabJenkins removed a comment on pull request #33324: URL: https://github.com/apache/spark/pull/33324#issuecomment-879342574 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140974/ -

[GitHub] [spark] AmplabJenkins commented on pull request #33324: [SPARK-36093][SQL] RemoveRedundantAliases should not change Command's parameter's expression's name

2021-07-13 Thread GitBox
AmplabJenkins commented on pull request #33324: URL: https://github.com/apache/spark/pull/33324#issuecomment-879342574 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/140974/ -- This

<    1   2   3   4   5   6   7   >