[GitHub] [spark] SparkQA removed a comment on pull request #29333: [WIP][SPARK-32357][INFRA] Publish failed and succeeded test reports in GitHub Actions

2020-08-04 Thread GitBox
SparkQA removed a comment on pull request #29333: URL: https://github.com/apache/spark/pull/29333#issuecomment-668937610 **[Test build #127073 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127073/testReport)** for PR 29333 at commit [`2688f21`](https://gi

[GitHub] [spark] SparkQA commented on pull request #29333: [WIP][SPARK-32357][INFRA] Publish failed and succeeded test reports in GitHub Actions

2020-08-04 Thread GitBox
SparkQA commented on pull request #29333: URL: https://github.com/apache/spark/pull/29333#issuecomment-668968035 **[Test build #127073 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127073/testReport)** for PR 29333 at commit [`2688f21`](https://github.co

[GitHub] [spark] HyukjinKwon commented on a change in pull request #29349: [SPARK-32528][SQL][TEST] The analyze method should make sure the plan is analyzed

2020-08-04 Thread GitBox
HyukjinKwon commented on a change in pull request #29349: URL: https://github.com/apache/spark/pull/29349#discussion_r465458165 ## File path: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/analysis/AnalysisSuite.scala ## @@ -47,6 +48,13 @@ import org.apache.spark.sq

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29332: [SPARK-32518][CORE] CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds of resources

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29332: URL: https://github.com/apache/spark/pull/29332#issuecomment-668959724 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29332: [SPARK-32518][CORE] CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds of resources

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29332: URL: https://github.com/apache/spark/pull/29332#issuecomment-668959724 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29332: [SPARK-32518][CORE] CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds of resources

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29332: URL: https://github.com/apache/spark/pull/29332#issuecomment-668845081 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/127

[GitHub] [spark] SparkQA commented on pull request #29332: [SPARK-32518][CORE] CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds of resources

2020-08-04 Thread GitBox
SparkQA commented on pull request #29332: URL: https://github.com/apache/spark/pull/29332#issuecomment-668959450 **[Test build #127074 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127074/testReport)** for PR 29332 at commit [`786b145`](https://github.com

[GitHub] [spark] beliefer commented on pull request #27429: [SPARK-28330][SQL] Support ANSI SQL: result offset clause in query expression

2020-08-04 Thread GitBox
beliefer commented on pull request #27429: URL: https://github.com/apache/spark/pull/27429#issuecomment-668956644 cc @cloud-fan This is an automated message from the Apache Git Service. To respond to the message, please log

[GitHub] [spark] stczwd commented on a change in pull request #29339: [Spark-32512][SQL] add alter table add/drop partition command for datasourcev2

2020-08-04 Thread GitBox
stczwd commented on a change in pull request #29339: URL: https://github.com/apache/spark/pull/29339#discussion_r465425990 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/AlterTableDropPartitionExec.scala ## @@ -0,0 +1,58 @@ +/* + * Licensed

[GitHub] [spark] HyukjinKwon commented on pull request #29333: [WIP][SPARK-32357][INFRA] Publish failed and succeeded test reports in GitHub Actions

2020-08-04 Thread GitBox
HyukjinKwon commented on pull request #29333: URL: https://github.com/apache/spark/pull/29333#issuecomment-668945164 The latest commit above (https://github.com/apache/spark/pull/29333/commits/2688f21f1852b3e6a577fa9292985b346b9bdf6d) contains the problem in terms of forked repos and PRs a

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29333: [WIP][SPARK-32357][INFRA] Publish failed and succeeded test reports in GitHub Actions

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29333: URL: https://github.com/apache/spark/pull/29333#issuecomment-668938127 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29333: [WIP][SPARK-32357][INFRA] Publish failed and succeeded test reports in GitHub Actions

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29333: URL: https://github.com/apache/spark/pull/29333#issuecomment-668938127 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] SparkQA commented on pull request #29333: [WIP][SPARK-32357][INFRA] Publish failed and succeeded test reports in GitHub Actions

2020-08-04 Thread GitBox
SparkQA commented on pull request #29333: URL: https://github.com/apache/spark/pull/29333#issuecomment-668937610 **[Test build #127073 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127073/testReport)** for PR 29333 at commit [`2688f21`](https://github.com

[GitHub] [spark] HyukjinKwon commented on pull request #29333: [WIP][SPARK-32357][INFRA] Publish failed and succeeded test reports in GitHub Actions

2020-08-04 Thread GitBox
HyukjinKwon commented on pull request #29333: URL: https://github.com/apache/spark/pull/29333#issuecomment-668935709 I am making some changes to demonstrate the problem in terms of the fork and PRs. Please ignore the changes made from now on. I will switch back from the draft later when th

[GitHub] [spark] HyukjinKwon closed pull request #29320: [SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

2020-08-04 Thread GitBox
HyukjinKwon closed pull request #29320: URL: https://github.com/apache/spark/pull/29320 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [spark] HyukjinKwon commented on pull request #29320: [SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

2020-08-04 Thread GitBox
HyukjinKwon commented on pull request #29320: URL: https://github.com/apache/spark/pull/29320#issuecomment-668934219 Thank you @viirya for approaching this. I am merging this to master. This is an automated message from the A

[GitHub] [spark] HyukjinKwon commented on pull request #29333: [WIP][SPARK-32357][INFRA] Publish failed and succeeded test reports in GitHub Actions

2020-08-04 Thread GitBox
HyukjinKwon commented on pull request #29333: URL: https://github.com/apache/spark/pull/29333#issuecomment-668933522 Sure, let me take a closer look for that approach. In worst case, we might have to drop this and go back to the original @viirya's approach at #29169. -

[GitHub] [spark] wangshisan commented on pull request #29266: [SPARK-32464][SQL] Support skew handling on join that has one side wi…

2020-08-04 Thread GitBox
wangshisan commented on pull request #29266: URL: https://github.com/apache/spark/pull/29266#issuecomment-668932395 > this is with AQE? if so can we please add that to description and it might be nice to describe approach taken to handle it in description as well. Added. --

[GitHub] [spark] wangshisan commented on a change in pull request #29266: [SPARK-32464][SQL] Support skew handling on join that has one side wi…

2020-08-04 Thread GitBox
wangshisan commented on a change in pull request #29266: URL: https://github.com/apache/spark/pull/29266#discussion_r465428028 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/OptimizeSkewedJoin.scala ## @@ -250,6 +251,85 @@ case class OptimizeSkew

[GitHub] [spark] wangshisan edited a comment on pull request #29266: [SPARK-32464][SQL] Support skew handling on join that has one side wi…

2020-08-04 Thread GitBox
wangshisan edited a comment on pull request #29266: URL: https://github.com/apache/spark/pull/29266#issuecomment-668926319 > Yea I'm also wondering the approach here. The skew join handling needs to split the skew side, and repeat the other side. I don't think we can split the buckets of b

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29339: [Spark-32512][SQL][WIP] add alter table add/drop partition command for datasourcev2

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29339: URL: https://github.com/apache/spark/pull/29339#issuecomment-668927448 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29339: [Spark-32512][SQL][WIP] add alter table add/drop partition command for datasourcev2

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29339: URL: https://github.com/apache/spark/pull/29339#issuecomment-668927448 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] stczwd commented on a change in pull request #29339: [Spark-32512][SQL][WIP] add alter table add/drop partition command for datasourcev2

2020-08-04 Thread GitBox
stczwd commented on a change in pull request #29339: URL: https://github.com/apache/spark/pull/29339#discussion_r465425990 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/AlterTableDropPartitionExec.scala ## @@ -0,0 +1,58 @@ +/* + * Licensed

[GitHub] [spark] SparkQA commented on pull request #29339: [Spark-32512][SQL][WIP] add alter table add/drop partition command for datasourcev2

2020-08-04 Thread GitBox
SparkQA commented on pull request #29339: URL: https://github.com/apache/spark/pull/29339#issuecomment-668927081 **[Test build #127072 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127072/testReport)** for PR 29339 at commit [`61cae52`](https://github.com

[GitHub] [spark] wangshisan commented on pull request #29266: [SPARK-32464][SQL] Support skew handling on join that has one side wi…

2020-08-04 Thread GitBox
wangshisan commented on pull request #29266: URL: https://github.com/apache/spark/pull/29266#issuecomment-668926319 > Yea I'm also wondering the approach here. The skew join handling needs to split the skew side, and repeat the other side. I don't think we can split the buckets of bucketed

[GitHub] [spark] wangshisan commented on a change in pull request #29266: [SPARK-32464][SQL] Support skew handling on join that has one side wi…

2020-08-04 Thread GitBox
wangshisan commented on a change in pull request #29266: URL: https://github.com/apache/spark/pull/29266#discussion_r465422329 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/OptimizeSkewedJoin.scala ## @@ -250,6 +251,85 @@ case class OptimizeSkew

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29342: [SPARK-32399][SQL] Full outer shuffled hash join

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29342: URL: https://github.com/apache/spark/pull/29342#issuecomment-668916700 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29342: [SPARK-32399][SQL] Full outer shuffled hash join

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29342: URL: https://github.com/apache/spark/pull/29342#issuecomment-668916700 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] SparkQA removed a comment on pull request #29342: [SPARK-32399][SQL] Full outer shuffled hash join

2020-08-04 Thread GitBox
SparkQA removed a comment on pull request #29342: URL: https://github.com/apache/spark/pull/29342#issuecomment-668826573 **[Test build #127065 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127065/testReport)** for PR 29342 at commit [`01f1f04`](https://gi

[GitHub] [spark] SparkQA commented on pull request #29342: [SPARK-32399][SQL] Full outer shuffled hash join

2020-08-04 Thread GitBox
SparkQA commented on pull request #29342: URL: https://github.com/apache/spark/pull/29342#issuecomment-668916087 **[Test build #127065 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127065/testReport)** for PR 29342 at commit [`01f1f04`](https://github.co

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29353: [Spark-32532][SQL] Improve ORC read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29353: URL: https://github.com/apache/spark/pull/29353#issuecomment-668914789 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29353: [Spark-32532][SQL] Improve ORC read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29353: URL: https://github.com/apache/spark/pull/29353#issuecomment-668914789 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] SparkQA commented on pull request #29353: [Spark-32532][SQL] Improve ORC read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
SparkQA commented on pull request #29353: URL: https://github.com/apache/spark/pull/29353#issuecomment-668914387 **[Test build #127071 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127071/testReport)** for PR 29353 at commit [`13af454`](https://github.com

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29353: [Spark-32532][SQL] Improve ORC read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29353: URL: https://github.com/apache/spark/pull/29353#issuecomment-668823788 Can one of the admins verify this patch? This is an automated message from the Apache Git Service.

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29352: [SPARK-32531][SQL][TEST] Add benchmarks for nested structs and arrays for different file formats

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29352: URL: https://github.com/apache/spark/pull/29352#issuecomment-668912120 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] viirya commented on pull request #29353: [Spark-32532][SQL] Improve ORC read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
viirya commented on pull request #29353: URL: https://github.com/apache/spark/pull/29353#issuecomment-668912282 ok to test This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #29352: [SPARK-32531][SQL][TEST] Add benchmarks for nested structs and arrays for different file formats

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29352: URL: https://github.com/apache/spark/pull/29352#issuecomment-668912120 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] SparkQA commented on pull request #29352: [SPARK-32531][SQL][TEST] Add benchmarks for nested structs and arrays for different file formats

2020-08-04 Thread GitBox
SparkQA commented on pull request #29352: URL: https://github.com/apache/spark/pull/29352#issuecomment-668911605 **[Test build #127070 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127070/testReport)** for PR 29352 at commit [`b10f1fc`](https://github.com

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29352: [SPARK-32531][SQL][TEST] Add benchmarks for nested structs and arrays for different file formats

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29352: URL: https://github.com/apache/spark/pull/29352#issuecomment-668813824 Can one of the admins verify this patch? This is an automated message from the Apache Git Service.

[GitHub] [spark] viirya commented on pull request #29352: [SPARK-32531][SQL][TEST] Add benchmarks for nested structs and arrays for different file formats

2020-08-04 Thread GitBox
viirya commented on pull request #29352: URL: https://github.com/apache/spark/pull/29352#issuecomment-668910801 ok to test This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #29211: [SPARK-31197][CORE] Shutdown executor once we are done decommissioning

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29211: URL: https://github.com/apache/spark/pull/29211#issuecomment-668909646 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29211: [SPARK-31197][CORE] Shutdown executor once we are done decommissioning

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29211: URL: https://github.com/apache/spark/pull/29211#issuecomment-668909646 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA removed a comment on pull request #29211: [SPARK-31197][CORE] Shutdown executor once we are done decommissioning

2020-08-04 Thread GitBox
SparkQA removed a comment on pull request #29211: URL: https://github.com/apache/spark/pull/29211#issuecomment-668858317 **[Test build #127067 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127067/testReport)** for PR 29211 at commit [`55a7f1f`](https://gi

[GitHub] [spark] SparkQA commented on pull request #29211: [SPARK-31197][CORE] Shutdown executor once we are done decommissioning

2020-08-04 Thread GitBox
SparkQA commented on pull request #29211: URL: https://github.com/apache/spark/pull/29211#issuecomment-668908957 **[Test build #127067 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127067/testReport)** for PR 29211 at commit [`55a7f1f`](https://github.co

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29304: [SPARK-32494][SQL] Null Aware Anti Join Optimize Support Multi-Column

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29304: URL: https://github.com/apache/spark/pull/29304#issuecomment-668901692 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/127

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29304: [SPARK-32494][SQL] Null Aware Anti Join Optimize Support Multi-Column

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29304: URL: https://github.com/apache/spark/pull/29304#issuecomment-668901687 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To r

[GitHub] [spark] SparkQA removed a comment on pull request #29304: [SPARK-32494][SQL] Null Aware Anti Join Optimize Support Multi-Column

2020-08-04 Thread GitBox
SparkQA removed a comment on pull request #29304: URL: https://github.com/apache/spark/pull/29304#issuecomment-668866753 **[Test build #127068 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127068/testReport)** for PR 29304 at commit [`054581a`](https://gi

[GitHub] [spark] AmplabJenkins commented on pull request #29304: [SPARK-32494][SQL] Null Aware Anti Join Optimize Support Multi-Column

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29304: URL: https://github.com/apache/spark/pull/29304#issuecomment-668901687 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] SparkQA commented on pull request #29304: [SPARK-32494][SQL] Null Aware Anti Join Optimize Support Multi-Column

2020-08-04 Thread GitBox
SparkQA commented on pull request #29304: URL: https://github.com/apache/spark/pull/29304#issuecomment-668901595 **[Test build #127068 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127068/testReport)** for PR 29304 at commit [`054581a`](https://github.co

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29311: [SPARK-32501][SQL] Convert null to "null" in structs, maps and arrays while casting to strings

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29311: URL: https://github.com/apache/spark/pull/29311#issuecomment-668887461 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29311: [SPARK-32501][SQL] Convert null to "null" in structs, maps and arrays while casting to strings

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29311: URL: https://github.com/apache/spark/pull/29311#issuecomment-668887461 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] SparkQA removed a comment on pull request #29311: [SPARK-32501][SQL] Convert null to "null" in structs, maps and arrays while casting to strings

2020-08-04 Thread GitBox
SparkQA removed a comment on pull request #29311: URL: https://github.com/apache/spark/pull/29311#issuecomment-66878 **[Test build #127064 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127064/testReport)** for PR 29311 at commit [`a686b99`](https://gi

[GitHub] [spark] SparkQA commented on pull request #29311: [SPARK-32501][SQL] Convert null to "null" in structs, maps and arrays while casting to strings

2020-08-04 Thread GitBox
SparkQA commented on pull request #29311: URL: https://github.com/apache/spark/pull/29311#issuecomment-668886701 **[Test build #127064 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127064/testReport)** for PR 29311 at commit [`a686b99`](https://github.co

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29322: [SPARK-32511][SQL] Add dropFields method to Column class

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29322: URL: https://github.com/apache/spark/pull/29322#issuecomment-668885433 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29322: [SPARK-32511][SQL] Add dropFields method to Column class

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29322: URL: https://github.com/apache/spark/pull/29322#issuecomment-668885433 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] SparkQA removed a comment on pull request #29322: [SPARK-32511][SQL] Add dropFields method to Column class

2020-08-04 Thread GitBox
SparkQA removed a comment on pull request #29322: URL: https://github.com/apache/spark/pull/29322#issuecomment-668786682 **[Test build #127063 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127063/testReport)** for PR 29322 at commit [`948fc9c`](https://gi

[GitHub] [spark] SparkQA commented on pull request #29322: [SPARK-32511][SQL] Add dropFields method to Column class

2020-08-04 Thread GitBox
SparkQA commented on pull request #29322: URL: https://github.com/apache/spark/pull/29322#issuecomment-668884775 **[Test build #127063 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127063/testReport)** for PR 29322 at commit [`948fc9c`](https://github.co

[GitHub] [spark] viirya commented on pull request #29326: [WIP][SPARK-32502][BUILD] Upgrade Guava to 27.0-jre and Hadoop to 3.2.1

2020-08-04 Thread GitBox
viirya commented on pull request #29326: URL: https://github.com/apache/spark/pull/29326#issuecomment-668878065 I did some tests. Few changes are required to pass the failed Hive tests: 1. Shading Guava at hive-exec packaging and a few code changes to hive-common and hive-exec regard

[GitHub] [spark] leanken edited a comment on pull request #29304: [SPARK-32494][SQL] Null Aware Anti Join Optimize Support Multi-Column

2020-08-04 Thread GitBox
leanken edited a comment on pull request #29304: URL: https://github.com/apache/spark/pull/29304#issuecomment-668873447 @agrawaldevesh already pushed the InvertedIndex version POC. and gather some test result on TPCH 1TB Q16 It is indeed causing performance regression for single column c

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29355: [SPARK-31419][SQL][DOCS][FOLLOW-UP]Complete the documentation for Table-valued Function

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29355: URL: https://github.com/apache/spark/pull/29355#issuecomment-668873379 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] leanken commented on pull request #29304: [SPARK-32494][SQL] Null Aware Anti Join Optimize Support Multi-Column

2020-08-04 Thread GitBox
leanken commented on pull request #29304: URL: https://github.com/apache/spark/pull/29304#issuecomment-668873447 @agrawaldevesh already pushed the InvertedIndex version POC. and gather some test result on TPCH 1TB Q16 It is indeed causing performance regression for single column case, as

[GitHub] [spark] SparkQA removed a comment on pull request #29355: [SPARK-31419][SQL][DOCS][FOLLOW-UP]Complete the documentation for Table-valued Function

2020-08-04 Thread GitBox
SparkQA removed a comment on pull request #29355: URL: https://github.com/apache/spark/pull/29355#issuecomment-668869807 **[Test build #127069 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127069/testReport)** for PR 29355 at commit [`fda6ecf`](https://gi

[GitHub] [spark] AmplabJenkins commented on pull request #29355: [SPARK-31419][SQL][DOCS][FOLLOW-UP]Complete the documentation for Table-valued Function

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29355: URL: https://github.com/apache/spark/pull/29355#issuecomment-668873379 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] leanken closed pull request #29304: [SPARK-32494][SQL] Null Aware Anti Join Optimize Support Multi-Column

2020-08-04 Thread GitBox
leanken closed pull request #29304: URL: https://github.com/apache/spark/pull/29304 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [spark] SparkQA commented on pull request #29355: [SPARK-31419][SQL][DOCS][FOLLOW-UP]Complete the documentation for Table-valued Function

2020-08-04 Thread GitBox
SparkQA commented on pull request #29355: URL: https://github.com/apache/spark/pull/29355#issuecomment-668873262 **[Test build #127069 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127069/testReport)** for PR 29355 at commit [`fda6ecf`](https://github.co

[GitHub] [spark] huaxingao commented on pull request #29355: [SPARK-31419][SQL][DOCS][FOLLOW-UP]Complete the documentation for Table-valued Function

2020-08-04 Thread GitBox
huaxingao commented on pull request #29355: URL: https://github.com/apache/spark/pull/29355#issuecomment-668870989 cc @maropu This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] huaxingao commented on a change in pull request #29355: [SPARK-31419][SQL][DOCS][FOLLOW-UP]Complete the documentation for Table-valued Function

2020-08-04 Thread GitBox
huaxingao commented on a change in pull request #29355: URL: https://github.com/apache/spark/pull/29355#discussion_r465379539 ## File path: docs/sql-ref-syntax-qry-select-tvf.md ## @@ -21,25 +21,7 @@ license: | ### Description -A table-valued function (TVF) is a function t

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29355: [SPARK-31419][SQL][DOCS][FOLLOW-UP]Complete the documentation for Table-valued Function

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29355: URL: https://github.com/apache/spark/pull/29355#issuecomment-668870354 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29355: [SPARK-31419][SQL][DOCS][FOLLOW-UP]Complete the documentation for Table-valued Function

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29355: URL: https://github.com/apache/spark/pull/29355#issuecomment-668870354 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29354: [WIP][Spark-32533][SQL] Improve Avro read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29354: URL: https://github.com/apache/spark/pull/29354#issuecomment-668869629 Can one of the admins verify this patch? This is an automated message from the Apache Git Service.

[GitHub] [spark] AmplabJenkins commented on pull request #29354: [WIP][Spark-32533][SQL] Improve Avro read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29354: URL: https://github.com/apache/spark/pull/29354#issuecomment-668870164 Can one of the admins verify this patch? This is an automated message from the Apache Git Service. To resp

[GitHub] [spark] SparkQA commented on pull request #29355: [SPARK-31419][SQL][DOCS][FOLLOW-UP]Complete the documentation for Table-valued Function

2020-08-04 Thread GitBox
SparkQA commented on pull request #29355: URL: https://github.com/apache/spark/pull/29355#issuecomment-668869807 **[Test build #127069 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127069/testReport)** for PR 29355 at commit [`fda6ecf`](https://github.com

[GitHub] [spark] AmplabJenkins commented on pull request #29354: [WIP][Spark-32533][SQL] Improve Avro read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29354: URL: https://github.com/apache/spark/pull/29354#issuecomment-668869629 Can one of the admins verify this patch? This is an automated message from the Apache Git Service. To resp

[GitHub] [spark] msamirkhan commented on pull request #29354: [WIP][Spark-32533][SQL] Improve Avro read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
msamirkhan commented on pull request #29354: URL: https://github.com/apache/spark/pull/29354#issuecomment-668868466 Set as WIP because I had originally made the changes on branch-3.0 and there were conflicts when I cherry-picked them on to master. So running tests and making sure I resolve

[GitHub] [spark] huaxingao opened a new pull request #29355: [SPARK-31419][SQL][DOCS][FOLLOW-UP]Complete the documentation for Table-valued Function

2020-08-04 Thread GitBox
huaxingao opened a new pull request #29355: URL: https://github.com/apache/spark/pull/29355 # What changes were proposed in this pull request? There are two types of TVF. We only documented one type. Adding the doc for the 2nd type. ### Why are the changes needed? complete

[GitHub] [spark] msamirkhan opened a new pull request #29354: [WIP][Spark-32533][SQL] Improve Avro read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
msamirkhan opened a new pull request #29354: URL: https://github.com/apache/spark/pull/29354 ### What changes were proposed in this pull request? Changes to Reading/Writing Avro file format. There are 8 commits in total. The first 3 are simpler changes to AvroSerializer and AvroD

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29304: [SPARK-32494][SQL] Null Aware Anti Join Optimize Support Multi-Column

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29304: URL: https://github.com/apache/spark/pull/29304#issuecomment-668867316 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29304: [SPARK-32494][SQL] Null Aware Anti Join Optimize Support Multi-Column

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29304: URL: https://github.com/apache/spark/pull/29304#issuecomment-668867316 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] SparkQA commented on pull request #29304: [SPARK-32494][SQL] Null Aware Anti Join Optimize Support Multi-Column

2020-08-04 Thread GitBox
SparkQA commented on pull request #29304: URL: https://github.com/apache/spark/pull/29304#issuecomment-668866753 **[Test build #127068 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127068/testReport)** for PR 29304 at commit [`054581a`](https://github.com

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29347: [WIP][SPARK-32492][SQL][FOLLOWUP][test-maven] Fix jenkins maven jobs

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29347: URL: https://github.com/apache/spark/pull/29347#issuecomment-668859736 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/127

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29347: [WIP][SPARK-32492][SQL][FOLLOWUP][test-maven] Fix jenkins maven jobs

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29347: URL: https://github.com/apache/spark/pull/29347#issuecomment-668859733 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To r

[GitHub] [spark] AmplabJenkins commented on pull request #29347: [WIP][SPARK-32492][SQL][FOLLOWUP][test-maven] Fix jenkins maven jobs

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29347: URL: https://github.com/apache/spark/pull/29347#issuecomment-668859733 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] SparkQA removed a comment on pull request #29347: [WIP][SPARK-32492][SQL][FOLLOWUP][test-maven] Fix jenkins maven jobs

2020-08-04 Thread GitBox
SparkQA removed a comment on pull request #29347: URL: https://github.com/apache/spark/pull/29347#issuecomment-668670956 **[Test build #127057 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127057/testReport)** for PR 29347 at commit [`b4a5fec`](https://gi

[GitHub] [spark] SparkQA commented on pull request #29347: [WIP][SPARK-32492][SQL][FOLLOWUP][test-maven] Fix jenkins maven jobs

2020-08-04 Thread GitBox
SparkQA commented on pull request #29347: URL: https://github.com/apache/spark/pull/29347#issuecomment-668859277 **[Test build #127057 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127057/testReport)** for PR 29347 at commit [`b4a5fec`](https://github.co

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29211: [SPARK-31197][CORE] Shutdown executor once we are done decommissioning

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29211: URL: https://github.com/apache/spark/pull/29211#issuecomment-668858869 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29211: [SPARK-31197][CORE] Shutdown executor once we are done decommissioning

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29211: URL: https://github.com/apache/spark/pull/29211#issuecomment-668858869 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] SparkQA commented on pull request #29211: [SPARK-31197][CORE] Shutdown executor once we are done decommissioning

2020-08-04 Thread GitBox
SparkQA commented on pull request #29211: URL: https://github.com/apache/spark/pull/29211#issuecomment-668858317 **[Test build #127067 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127067/testReport)** for PR 29211 at commit [`55a7f1f`](https://github.com

[GitHub] [spark] viirya commented on pull request #28761: [SPARK-25557][SQL][test-hive1.2] Nested column predicate pushdown for ORC

2020-08-04 Thread GitBox
viirya commented on pull request #28761: URL: https://github.com/apache/spark/pull/28761#issuecomment-668853324 The `java.lang.NoClassDefFoundError` looks weird. This PR doesn't change dependencies. Are we sure test-hive1.2 is ok to build in current master? ---

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28761: [SPARK-25557][SQL][test-hive1.2] Nested column predicate pushdown for ORC

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #28761: URL: https://github.com/apache/spark/pull/28761#issuecomment-668852729 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/127

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28761: [SPARK-25557][SQL][test-hive1.2] Nested column predicate pushdown for ORC

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #28761: URL: https://github.com/apache/spark/pull/28761#issuecomment-668852723 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To r

[GitHub] [spark] SparkQA removed a comment on pull request #28761: [SPARK-25557][SQL][test-hive1.2] Nested column predicate pushdown for ORC

2020-08-04 Thread GitBox
SparkQA removed a comment on pull request #28761: URL: https://github.com/apache/spark/pull/28761#issuecomment-668829716 **[Test build #127066 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127066/testReport)** for PR 28761 at commit [`0747fcd`](https://gi

[GitHub] [spark] AmplabJenkins commented on pull request #28761: [SPARK-25557][SQL][test-hive1.2] Nested column predicate pushdown for ORC

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #28761: URL: https://github.com/apache/spark/pull/28761#issuecomment-668852723 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] SparkQA commented on pull request #28761: [SPARK-25557][SQL][test-hive1.2] Nested column predicate pushdown for ORC

2020-08-04 Thread GitBox
SparkQA commented on pull request #28761: URL: https://github.com/apache/spark/pull/28761#issuecomment-668852579 **[Test build #127066 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127066/testReport)** for PR 28761 at commit [`0747fcd`](https://github.co

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29347: [WIP][SPARK-32492][SQL][FOLLOWUP][test-maven] Fix jenkins maven jobs

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29347: URL: https://github.com/apache/spark/pull/29347#issuecomment-668850466 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/127

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29347: [WIP][SPARK-32492][SQL][FOLLOWUP][test-maven] Fix jenkins maven jobs

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29347: URL: https://github.com/apache/spark/pull/29347#issuecomment-668850458 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To r

[GitHub] [spark] AmplabJenkins commented on pull request #29347: [WIP][SPARK-32492][SQL][FOLLOWUP][test-maven] Fix jenkins maven jobs

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29347: URL: https://github.com/apache/spark/pull/29347#issuecomment-668850458 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] SparkQA removed a comment on pull request #29347: [WIP][SPARK-32492][SQL][FOLLOWUP][test-maven] Fix jenkins maven jobs

2020-08-04 Thread GitBox
SparkQA removed a comment on pull request #29347: URL: https://github.com/apache/spark/pull/29347#issuecomment-668679221 **[Test build #127058 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127058/testReport)** for PR 29347 at commit [`f780d8d`](https://gi

[GitHub] [spark] SparkQA commented on pull request #29347: [WIP][SPARK-32492][SQL][FOLLOWUP][test-maven] Fix jenkins maven jobs

2020-08-04 Thread GitBox
SparkQA commented on pull request #29347: URL: https://github.com/apache/spark/pull/29347#issuecomment-668849940 **[Test build #127058 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127058/testReport)** for PR 29347 at commit [`f780d8d`](https://github.co

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29332: [SPARK-32518][CORE] CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds of resources

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29332: URL: https://github.com/apache/spark/pull/29332#issuecomment-668845074 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To r

[GitHub] [spark] AmplabJenkins commented on pull request #29332: [SPARK-32518][CORE] CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds of resources

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29332: URL: https://github.com/apache/spark/pull/29332#issuecomment-668845074 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

<    1   2   3   4   5   6   >