[GitHub] [spark] cloud-fan commented on a change in pull request #29318: [SPARK-32509][SQL] Ignore unused DPP True Filter in Canonicalization

2020-07-31 Thread GitBox
cloud-fan commented on a change in pull request #29318: URL: https://github.com/apache/spark/pull/29318#discussion_r463652535 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/DataSourceScanExec.scala ## @@ -608,12 +608,16 @@ case class FileSourceScanExec(

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29318: [SPARK-32509][SQL] Ignore unused DPP True Filter in Canonicalization

2020-07-31 Thread GitBox
AmplabJenkins removed a comment on pull request #29318: URL: https://github.com/apache/spark/pull/29318#issuecomment-667088534 Can one of the admins verify this patch? This is an automated message from the Apache Git

[GitHub] [spark] cloud-fan commented on pull request #29318: [SPARK-32509][SQL] Ignore unused DPP True Filter in Canonicalization

2020-07-31 Thread GitBox
cloud-fan commented on pull request #29318: URL: https://github.com/apache/spark/pull/29318#issuecomment-667153397 ok to test This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] rahij commented on pull request #29309: [SPARK-29886][SQL] Add support for satisfying HashClusteredDistribution by DataSourceV2 implementations

2020-07-31 Thread GitBox
rahij commented on pull request #29309: URL: https://github.com/apache/spark/pull/29309#issuecomment-667152493 @cloud-fan would you be the right person to review this? This is an automated message from the Apache Git

[GitHub] [spark] cloud-fan edited a comment on pull request #28617: [SPARK-31694][SQL] Add SupportsPartitions APIs on DataSourceV2

2020-07-31 Thread GitBox
cloud-fan edited a comment on pull request #28617: URL: https://github.com/apache/spark/pull/28617#issuecomment-667150913 Ah I missed that comment. Makes sense. Maybe we should update the javadoc of `Table` to make the abstraction more general. I think it's still better to have real

[GitHub] [spark] cloud-fan commented on pull request #28617: [SPARK-31694][SQL] Add SupportsPartitions APIs on DataSourceV2

2020-07-31 Thread GitBox
cloud-fan commented on pull request #28617: URL: https://github.com/apache/spark/pull/28617#issuecomment-667150913 Ah I missed that comment. Makes sense. Maybe we should update the javadoc of `Table` to make the abstraction more general. I think it's still better to have real

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29271: [SPARK-32467][UI]Avoid encoding URL twice on https redirect

2020-07-31 Thread GitBox
AmplabJenkins removed a comment on pull request #29271: URL: https://github.com/apache/spark/pull/29271#issuecomment-667147847 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] maropu commented on a change in pull request #29317: [SPARK-32510][SQL] Check duplicate nested columns in read from JDBC datasource

2020-07-31 Thread GitBox
maropu commented on a change in pull request #29317: URL: https://github.com/apache/spark/pull/29317#discussion_r463643653 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/jdbc/JdbcUtils.scala ## @@ -819,8 +819,10 @@ object JdbcUtils extends

[GitHub] [spark] AmplabJenkins commented on pull request #29271: [SPARK-32467][UI]Avoid encoding URL twice on https redirect

2020-07-31 Thread GitBox
AmplabJenkins commented on pull request #29271: URL: https://github.com/apache/spark/pull/29271#issuecomment-667147847 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #29271: [SPARK-32467][UI]Avoid encoding URL twice on https redirect

2020-07-31 Thread GitBox
SparkQA removed a comment on pull request #29271: URL: https://github.com/apache/spark/pull/29271#issuecomment-667077427 **[Test build #126879 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126879/testReport)** for PR 29271 at commit

[GitHub] [spark] SparkQA commented on pull request #29271: [SPARK-32467][UI]Avoid encoding URL twice on https redirect

2020-07-31 Thread GitBox
SparkQA commented on pull request #29271: URL: https://github.com/apache/spark/pull/29271#issuecomment-667146764 **[Test build #126879 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126879/testReport)** for PR 29271 at commit

[GitHub] [spark] cloud-fan edited a comment on pull request #29307: [SPARK-32083][SQL] AQE coalesce should at least return one partition

2020-07-31 Thread GitBox
cloud-fan edited a comment on pull request #29307: URL: https://github.com/apache/spark/pull/29307#issuecomment-667144111 thanks for the review, merging to master! This is an automated message from the Apache Git Service. To

[GitHub] [spark] cloud-fan closed pull request #29307: [SPARK-32083][SQL] AQE coalesce should at least return one partition

2020-07-31 Thread GitBox
cloud-fan closed pull request #29307: URL: https://github.com/apache/spark/pull/29307 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [spark] cloud-fan commented on pull request #29307: [SPARK-32083][SQL] AQE coalesce should at least return one partition

2020-07-31 Thread GitBox
cloud-fan commented on pull request #29307: URL: https://github.com/apache/spark/pull/29307#issuecomment-667144111 thanks for the review, merging to master/3.0! This is an automated message from the Apache Git Service. To

[GitHub] [spark] attilapiros commented on a change in pull request #29211: [SPARK-31197][CORE] Shutdown executor once we are done decommissioning

2020-07-31 Thread GitBox
attilapiros commented on a change in pull request #29211: URL: https://github.com/apache/spark/pull/29211#discussion_r463627379 ## File path: core/src/main/scala/org/apache/spark/storage/BlockManagerDecommissioner.scala ## @@ -327,4 +354,28 @@ private[storage] class

[GitHub] [spark] cloud-fan closed pull request #29315: [SPARK-31894][SS][FOLLOW-UP] Rephrase the config doc

2020-07-31 Thread GitBox
cloud-fan closed pull request #29315: URL: https://github.com/apache/spark/pull/29315 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [spark] cloud-fan commented on pull request #29315: [SPARK-31894][SS][FOLLOW-UP] Rephrase the config doc

2020-07-31 Thread GitBox
cloud-fan commented on pull request #29315: URL: https://github.com/apache/spark/pull/29315#issuecomment-667142325 This just updates the config doc and we don't need to wait for the test. Thanks, merging to master! This is

[GitHub] [spark] cloud-fan commented on a change in pull request #29317: [SPARK-32510][SQL] Check duplicate nested columns in read from JDBC datasource

2020-07-31 Thread GitBox
cloud-fan commented on a change in pull request #29317: URL: https://github.com/apache/spark/pull/29317#discussion_r463636749 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/jdbc/JdbcUtils.scala ## @@ -819,8 +819,10 @@ object JdbcUtils extends

[GitHub] [spark] attilapiros commented on a change in pull request #29211: [SPARK-31197][CORE] Shutdown executor once we are done decommissioning

2020-07-31 Thread GitBox
attilapiros commented on a change in pull request #29211: URL: https://github.com/apache/spark/pull/29211#discussion_r463627379 ## File path: core/src/main/scala/org/apache/spark/storage/BlockManagerDecommissioner.scala ## @@ -327,4 +354,28 @@ private[storage] class

[GitHub] [spark] maropu commented on a change in pull request #29317: [SPARK-32510][SQL] Check duplicate nested columns in read from JDBC datasource

2020-07-31 Thread GitBox
maropu commented on a change in pull request #29317: URL: https://github.com/apache/spark/pull/29317#discussion_r463629913 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/jdbc/JdbcUtils.scala ## @@ -819,8 +819,10 @@ object JdbcUtils extends

[GitHub] [spark] maropu commented on a change in pull request #29317: [SPARK-32510][SQL] Check duplicate nested columns in read from JDBC datasource

2020-07-31 Thread GitBox
maropu commented on a change in pull request #29317: URL: https://github.com/apache/spark/pull/29317#discussion_r463627914 ## File path: sql/core/src/test/scala/org/apache/spark/sql/jdbc/JdbcNestedDataSourceSuite.scala ## @@ -0,0 +1,51 @@ +/* + * Licensed to the Apache

[GitHub] [spark] attilapiros commented on a change in pull request #29211: [SPARK-31197][CORE] Shutdown executor once we are done decommissioning

2020-07-31 Thread GitBox
attilapiros commented on a change in pull request #29211: URL: https://github.com/apache/spark/pull/29211#discussion_r463627379 ## File path: core/src/main/scala/org/apache/spark/storage/BlockManagerDecommissioner.scala ## @@ -327,4 +354,28 @@ private[storage] class

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29299: [SPARK-32490][BUILD] Upgrade netty-all to 4.1.51.Final

2020-07-31 Thread GitBox
AmplabJenkins removed a comment on pull request #29299: URL: https://github.com/apache/spark/pull/29299#issuecomment-667130024 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29299: [SPARK-32490][BUILD] Upgrade netty-all to 4.1.51.Final

2020-07-31 Thread GitBox
AmplabJenkins removed a comment on pull request #29299: URL: https://github.com/apache/spark/pull/29299#issuecomment-667130019 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins commented on pull request #29299: [SPARK-32490][BUILD] Upgrade netty-all to 4.1.51.Final

2020-07-31 Thread GitBox
AmplabJenkins commented on pull request #29299: URL: https://github.com/apache/spark/pull/29299#issuecomment-667130019 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #29299: [SPARK-32490][BUILD] Upgrade netty-all to 4.1.51.Final

2020-07-31 Thread GitBox
SparkQA removed a comment on pull request #29299: URL: https://github.com/apache/spark/pull/29299#issuecomment-667065717 **[Test build #126878 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126878/testReport)** for PR 29299 at commit

[GitHub] [spark] SparkQA commented on pull request #29299: [SPARK-32490][BUILD] Upgrade netty-all to 4.1.51.Final

2020-07-31 Thread GitBox
SparkQA commented on pull request #29299: URL: https://github.com/apache/spark/pull/29299#issuecomment-667129270 **[Test build #126878 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126878/testReport)** for PR 29299 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29315: [SPARK-31894][SS][FOLLOW-UP] Rephrase the config doc

2020-07-31 Thread GitBox
AmplabJenkins removed a comment on pull request #29315: URL: https://github.com/apache/spark/pull/29315#issuecomment-667127024 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] tgravescs commented on pull request #29225: [SPARK-32287][TESTS] Flaky Test: ExecutorAllocationManagerSuite.add executors default profile

2020-07-31 Thread GitBox
tgravescs commented on pull request #29225: URL: https://github.com/apache/spark/pull/29225#issuecomment-667127868 It would be really nice if we could get some timestamps on how long tests were taking This is an automated

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29315: [SPARK-31894][SS][FOLLOW-UP] Rephrase the config doc

2020-07-31 Thread GitBox
AmplabJenkins removed a comment on pull request #29315: URL: https://github.com/apache/spark/pull/29315#issuecomment-667127016 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] SparkQA removed a comment on pull request #29315: [SPARK-31894][SS][FOLLOW-UP] Rephrase the config doc

2020-07-31 Thread GitBox
SparkQA removed a comment on pull request #29315: URL: https://github.com/apache/spark/pull/29315#issuecomment-667081276 **[Test build #126880 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126880/testReport)** for PR 29315 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #29315: [SPARK-31894][SS][FOLLOW-UP] Rephrase the config doc

2020-07-31 Thread GitBox
AmplabJenkins commented on pull request #29315: URL: https://github.com/apache/spark/pull/29315#issuecomment-667127016 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29315: [SPARK-31894][SS][FOLLOW-UP] Rephrase the config doc

2020-07-31 Thread GitBox
SparkQA commented on pull request #29315: URL: https://github.com/apache/spark/pull/29315#issuecomment-667126528 **[Test build #126880 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126880/testReport)** for PR 29315 at commit

[GitHub] [spark] Udbhav30 commented on pull request #29319: [SPARK-32480] Support insert overwrite to move data to trash

2020-07-31 Thread GitBox
Udbhav30 commented on pull request #29319: URL: https://github.com/apache/spark/pull/29319#issuecomment-667125399 > > Instead of directly deleting the data, we can provide flexibility to move data to the trash and then delete it permanently. > > hm, we need to move data into a trash

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28527: [SPARK-31709][SQL] Proper base path for database/table location when it is a relative path

2020-07-31 Thread GitBox
AmplabJenkins removed a comment on pull request #28527: URL: https://github.com/apache/spark/pull/28527#issuecomment-667124759 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28527: [SPARK-31709][SQL] Proper base path for database/table location when it is a relative path

2020-07-31 Thread GitBox
AmplabJenkins removed a comment on pull request #28527: URL: https://github.com/apache/spark/pull/28527#issuecomment-667124751 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins commented on pull request #28527: [SPARK-31709][SQL] Proper base path for database/table location when it is a relative path

2020-07-31 Thread GitBox
AmplabJenkins commented on pull request #28527: URL: https://github.com/apache/spark/pull/28527#issuecomment-667124751 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #28527: [SPARK-31709][SQL] Proper base path for database/table location when it is a relative path

2020-07-31 Thread GitBox
SparkQA removed a comment on pull request #28527: URL: https://github.com/apache/spark/pull/28527#issuecomment-667033617 **[Test build #126876 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126876/testReport)** for PR 28527 at commit

[GitHub] [spark] SparkQA commented on pull request #28527: [SPARK-31709][SQL] Proper base path for database/table location when it is a relative path

2020-07-31 Thread GitBox
SparkQA commented on pull request #28527: URL: https://github.com/apache/spark/pull/28527#issuecomment-667124268 **[Test build #126876 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126876/testReport)** for PR 28527 at commit

[GitHub] [spark] tgravescs commented on pull request #29225: [SPARK-32287][TESTS] Flaky Test: ExecutorAllocationManagerSuite.add executors default profile

2020-07-31 Thread GitBox
tgravescs commented on pull request #29225: URL: https://github.com/apache/spark/pull/29225#issuecomment-667123030 is there anyway to access the unit test detailed logs from GitHub action? This is an automated message

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29304: [SPARK-32494][SQL] Null Aware Anti Join Optimize Support Multi-Column

2020-07-31 Thread GitBox
AmplabJenkins removed a comment on pull request #29304: URL: https://github.com/apache/spark/pull/29304#issuecomment-667121034 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA removed a comment on pull request #29304: [SPARK-32494][SQL] Null Aware Anti Join Optimize Support Multi-Column

2020-07-31 Thread GitBox
SparkQA removed a comment on pull request #29304: URL: https://github.com/apache/spark/pull/29304#issuecomment-666987665 **[Test build #126872 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126872/testReport)** for PR 29304 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #29304: [SPARK-32494][SQL] Null Aware Anti Join Optimize Support Multi-Column

2020-07-31 Thread GitBox
AmplabJenkins commented on pull request #29304: URL: https://github.com/apache/spark/pull/29304#issuecomment-667121034 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29304: [SPARK-32494][SQL] Null Aware Anti Join Optimize Support Multi-Column

2020-07-31 Thread GitBox
SparkQA commented on pull request #29304: URL: https://github.com/apache/spark/pull/29304#issuecomment-667119930 **[Test build #126872 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126872/testReport)** for PR 29304 at commit

[GitHub] [spark] maropu commented on pull request #29319: [SPARK-32480] Support insert overwrite to move data to trash

2020-07-31 Thread GitBox
maropu commented on pull request #29319: URL: https://github.com/apache/spark/pull/29319#issuecomment-667116847 > Instead of directly deleting the data, we can provide flexibility to move data to the trash and then delete it permanently. hm, we need to move data into a trash even

[GitHub] [spark] HyukjinKwon edited a comment on pull request #29306: [SPARK-32497][INFRA] Installs qpdf package for CRAN check in GitHub Actions

2020-07-31 Thread GitBox
HyukjinKwon edited a comment on pull request #29306: URL: https://github.com/apache/spark/pull/29306#issuecomment-666931393 cc @ScrapCodes and @zhengruifeng FYI. The CRAN check _might_ fail with the error in the PR description with R 4.0.0 in the release image. In that case, `qpdf` will

[GitHub] [spark] HyukjinKwon closed pull request #29279: [SPARK-31418][CORE][FOLLOW-UP][MINOR] Fix log messages to print stage id instead of the object name

2020-07-31 Thread GitBox
HyukjinKwon closed pull request #29279: URL: https://github.com/apache/spark/pull/29279 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [spark] HyukjinKwon commented on pull request #29279: [SPARK-31418][CORE][FOLLOW-UP][MINOR] Fix log messages to print stage id instead of the object name

2020-07-31 Thread GitBox
HyukjinKwon commented on pull request #29279: URL: https://github.com/apache/spark/pull/29279#issuecomment-667112489 Merged to master. This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] AmplabJenkins commented on pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main package for PySpark documentation

2020-07-31 Thread GitBox
AmplabJenkins commented on pull request #29320: URL: https://github.com/apache/spark/pull/29320#issuecomment-667112295 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main package for PySpark documentation

2020-07-31 Thread GitBox
AmplabJenkins removed a comment on pull request #29320: URL: https://github.com/apache/spark/pull/29320#issuecomment-667112295 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] HyukjinKwon closed pull request #29297: [SPARK-32406][SQL][FOLLOWUP] Make RESET fail against static and core configs

2020-07-31 Thread GitBox
HyukjinKwon closed pull request #29297: URL: https://github.com/apache/spark/pull/29297 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29303: [SPARK-32492][SQL] Fulfill missing column meta information COLUMN_SIZE /DECIMAL_DIGITS/NUM_PREC_RADIX/ORDINAL_POSITION for thri

2020-07-31 Thread GitBox
AmplabJenkins removed a comment on pull request #29303: URL: https://github.com/apache/spark/pull/29303#issuecomment-667111632 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29303: [SPARK-32492][SQL] Fulfill missing column meta information COLUMN_SIZE /DECIMAL_DIGITS/NUM_PREC_RADIX/ORDINAL_POSITION for thriftserver

2020-07-31 Thread GitBox
AmplabJenkins commented on pull request #29303: URL: https://github.com/apache/spark/pull/29303#issuecomment-667111632 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main package for PySpark documentation

2020-07-31 Thread GitBox
SparkQA commented on pull request #29320: URL: https://github.com/apache/spark/pull/29320#issuecomment-667111686 **[Test build #126889 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126889/testReport)** for PR 29320 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #29303: [SPARK-32492][SQL] Fulfill missing column meta information COLUMN_SIZE /DECIMAL_DIGITS/NUM_PREC_RADIX/ORDINAL_POSITION for thriftserv

2020-07-31 Thread GitBox
SparkQA removed a comment on pull request #29303: URL: https://github.com/apache/spark/pull/29303#issuecomment-667092676 **[Test build #126887 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126887/testReport)** for PR 29303 at commit

[GitHub] [spark] HyukjinKwon commented on pull request #29297: [SPARK-32406][SQL][FOLLOWUP] Make RESET fail against static and core configs

2020-07-31 Thread GitBox
HyukjinKwon commented on pull request #29297: URL: https://github.com/apache/spark/pull/29297#issuecomment-667111851 Merged to master. This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] maropu commented on a change in pull request #29303: [SPARK-32492][SQL] Fulfill missing column meta information COLUMN_SIZE /DECIMAL_DIGITS/NUM_PREC_RADIX/ORDINAL_POSITION for thrifts

2020-07-31 Thread GitBox
maropu commented on a change in pull request #29303: URL: https://github.com/apache/spark/pull/29303#discussion_r463601005 ## File path: sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/SparkGetColumnsOperation.scala ## @@ -126,12 +124,52 @@

[GitHub] [spark] SparkQA commented on pull request #29303: [SPARK-32492][SQL] Fulfill missing column meta information COLUMN_SIZE /DECIMAL_DIGITS/NUM_PREC_RADIX/ORDINAL_POSITION for thriftserver clien

2020-07-31 Thread GitBox
SparkQA commented on pull request #29303: URL: https://github.com/apache/spark/pull/29303#issuecomment-667111282 **[Test build #126887 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126887/testReport)** for PR 29303 at commit

[GitHub] [spark] HyukjinKwon opened a new pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main package for PySpark documentation

2020-07-31 Thread GitBox
HyukjinKwon opened a new pull request #29320: URL: https://github.com/apache/spark/pull/29320 ### What changes were proposed in this pull request? This PR proposes to write the main page of PySpark documentation. ### Why are the changes needed? For better usability and

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29307: [SPARK-32083][SQL] AQE coalesce should at least return one partition

2020-07-31 Thread GitBox
AmplabJenkins removed a comment on pull request #29307: URL: https://github.com/apache/spark/pull/29307#issuecomment-667104751 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29307: [SPARK-32083][SQL] AQE coalesce should at least return one partition

2020-07-31 Thread GitBox
AmplabJenkins commented on pull request #29307: URL: https://github.com/apache/spark/pull/29307#issuecomment-667104751 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #29307: [SPARK-32083][SQL] AQE coalesce should at least return one partition

2020-07-31 Thread GitBox
SparkQA removed a comment on pull request #29307: URL: https://github.com/apache/spark/pull/29307#issuecomment-666968253 **[Test build #126859 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126859/testReport)** for PR 29307 at commit

[GitHub] [spark] SparkQA commented on pull request #29307: [SPARK-32083][SQL] AQE coalesce should at least return one partition

2020-07-31 Thread GitBox
SparkQA commented on pull request #29307: URL: https://github.com/apache/spark/pull/29307#issuecomment-667103802 **[Test build #126859 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126859/testReport)** for PR 29307 at commit

[GitHub] [spark] maropu commented on pull request #29303: [SPARK-32492][SQL] Fulfill missing column meta information COLUMN_SIZE /DECIMAL_DIGITS/NUM_PREC_RADIX/ORDINAL_POSITION for thriftserver client

2020-07-31 Thread GitBox
maropu commented on pull request #29303: URL: https://github.com/apache/spark/pull/29303#issuecomment-667102957 To check if we could fetch the new metadata via `DatabaseMetaData`, could you add tests in `SparkMetadataOperationSuite`, too?

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29317: [SPARK-32510][SQL] Check duplicate nested columns in read from JDBC datasource

2020-07-31 Thread GitBox
AmplabJenkins removed a comment on pull request #29317: URL: https://github.com/apache/spark/pull/29317#issuecomment-667079962 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29146: [SPARK-32257][SQL] Reports explicit errors for invalid usage of SET/RESET command

2020-07-31 Thread GitBox
AmplabJenkins removed a comment on pull request #29146: URL: https://github.com/apache/spark/pull/29146#issuecomment-667102267 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA commented on pull request #29317: [SPARK-32510][SQL] Check duplicate nested columns in read from JDBC datasource

2020-07-31 Thread GitBox
SparkQA commented on pull request #29317: URL: https://github.com/apache/spark/pull/29317#issuecomment-667102486 **[Test build #126888 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126888/testReport)** for PR 29317 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #29146: [SPARK-32257][SQL] Reports explicit errors for invalid usage of SET/RESET command

2020-07-31 Thread GitBox
AmplabJenkins commented on pull request #29146: URL: https://github.com/apache/spark/pull/29146#issuecomment-667102267 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29146: [SPARK-32257][SQL] Reports explicit errors for invalid usage of SET/RESET command

2020-07-31 Thread GitBox
SparkQA commented on pull request #29146: URL: https://github.com/apache/spark/pull/29146#issuecomment-667101365 **[Test build #126875 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126875/testReport)** for PR 29146 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #29146: [SPARK-32257][SQL] Reports explicit errors for invalid usage of SET/RESET command

2020-07-31 Thread GitBox
SparkQA removed a comment on pull request #29146: URL: https://github.com/apache/spark/pull/29146#issuecomment-667032219 **[Test build #126875 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126875/testReport)** for PR 29146 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29318: [SPARK-32509][SQL] Ignore unused DPP True Filter in Canonicalization

2020-07-31 Thread GitBox
AmplabJenkins removed a comment on pull request #29318: URL: https://github.com/apache/spark/pull/29318#issuecomment-667088115 Can one of the admins verify this patch? This is an automated message from the Apache Git

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29319: [SPARK-32480] Support insert overwrite to move data to trash

2020-07-31 Thread GitBox
AmplabJenkins removed a comment on pull request #29319: URL: https://github.com/apache/spark/pull/29319#issuecomment-667094546 Can one of the admins verify this patch? This is an automated message from the Apache Git

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28939: [SPARK-32119][CORE] ExecutorPlugin doesn't work with Standalone Cluster and Kubernetes with --jars

2020-07-31 Thread GitBox
AmplabJenkins removed a comment on pull request #28939: URL: https://github.com/apache/spark/pull/28939#issuecomment-667088071 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29278: [SPARK-32160][CORE][PYSPARK] Add a config to switch allow/disallow to create SparkContext in executors.

2020-07-31 Thread GitBox
AmplabJenkins removed a comment on pull request #29278: URL: https://github.com/apache/spark/pull/29278#issuecomment-667089706 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] SparkQA removed a comment on pull request #29278: [SPARK-32160][CORE][PYSPARK] Add a config to switch allow/disallow to create SparkContext in executors.

2020-07-31 Thread GitBox
SparkQA removed a comment on pull request #29278: URL: https://github.com/apache/spark/pull/29278#issuecomment-666973287 **[Test build #126869 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126869/testReport)** for PR 29278 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #29297: [SPARK-32406][SQL][FOLLOWUP] Make RESET fail against static and core configs

2020-07-31 Thread GitBox
SparkQA removed a comment on pull request #29297: URL: https://github.com/apache/spark/pull/29297#issuecomment-666969420 **[Test build #126868 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126868/testReport)** for PR 29297 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28939: [SPARK-32119][CORE] ExecutorPlugin doesn't work with Standalone Cluster and Kubernetes with --jars

2020-07-31 Thread GitBox
AmplabJenkins removed a comment on pull request #28939: URL: https://github.com/apache/spark/pull/28939#issuecomment-667088060 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] SparkQA removed a comment on pull request #27429: [SPARK-28330][SQL] Support ANSI SQL: result offset clause in query expression

2020-07-31 Thread GitBox
SparkQA removed a comment on pull request #27429: URL: https://github.com/apache/spark/pull/27429#issuecomment-666968408 **[Test build #126863 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126863/testReport)** for PR 27429 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29297: [SPARK-32406][SQL][FOLLOWUP] Make RESET fail against static and core configs

2020-07-31 Thread GitBox
AmplabJenkins removed a comment on pull request #29297: URL: https://github.com/apache/spark/pull/29297#issuecomment-667092309 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29278: [SPARK-32160][CORE][PYSPARK] Add a config to switch allow/disallow to create SparkContext in executors.

2020-07-31 Thread GitBox
AmplabJenkins removed a comment on pull request #29278: URL: https://github.com/apache/spark/pull/29278#issuecomment-667089695 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27429: [SPARK-28330][SQL] Support ANSI SQL: result offset clause in query expression

2020-07-31 Thread GitBox
AmplabJenkins removed a comment on pull request #27429: URL: https://github.com/apache/spark/pull/27429#issuecomment-667092489 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] Udbhav30 commented on pull request #29319: [SPARK-32480] Support insert overwrite to move data to trash

2020-07-31 Thread GitBox
Udbhav30 commented on pull request #29319: URL: https://github.com/apache/spark/pull/29319#issuecomment-667097470 @dongjoon-hyun can you please review. This is an automated message from the Apache Git Service. To respond to

[GitHub] [spark] AmplabJenkins commented on pull request #29319: [SPARK-32480] Support insert overwrite to move data to trash

2020-07-31 Thread GitBox
AmplabJenkins commented on pull request #29319: URL: https://github.com/apache/spark/pull/29319#issuecomment-667094965 Can one of the admins verify this patch? This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins commented on pull request #29319: [SPARK-32480] Support insert overwrite to move data to trash

2020-07-31 Thread GitBox
AmplabJenkins commented on pull request #29319: URL: https://github.com/apache/spark/pull/29319#issuecomment-667094546 Can one of the admins verify this patch? This is an automated message from the Apache Git Service. To

[GitHub] [spark] Udbhav30 opened a new pull request #29319: [SPARK-32480] Support insert overwrite to move data to trash

2020-07-31 Thread GitBox
Udbhav30 opened a new pull request #29319: URL: https://github.com/apache/spark/pull/29319 ### What changes were proposed in this pull request? Instead of deleting the data, we can move the data to trash. Based on the configuration provided by the user it will be deleted permanently

[GitHub] [spark] SparkQA commented on pull request #29303: [SPARK-32492][SQL] Fulfill missing column meta information COLUMN_SIZE /DECIMAL_DIGITS/NUM_PREC_RADIX/ORDINAL_POSITION for thriftserver clien

2020-07-31 Thread GitBox
SparkQA commented on pull request #29303: URL: https://github.com/apache/spark/pull/29303#issuecomment-667092676 **[Test build #126887 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126887/testReport)** for PR 29303 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #27429: [SPARK-28330][SQL] Support ANSI SQL: result offset clause in query expression

2020-07-31 Thread GitBox
AmplabJenkins commented on pull request #27429: URL: https://github.com/apache/spark/pull/27429#issuecomment-667092489 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29291: [SPARK-30276][SQL] Support Filter expression allows simultaneous use of DISTINCT

2020-07-31 Thread GitBox
SparkQA commented on pull request #29291: URL: https://github.com/apache/spark/pull/29291#issuecomment-667092474 **[Test build #126886 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126886/testReport)** for PR 29291 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #29297: [SPARK-32406][SQL][FOLLOWUP] Make RESET fail against static and core configs

2020-07-31 Thread GitBox
AmplabJenkins commented on pull request #29297: URL: https://github.com/apache/spark/pull/29297#issuecomment-667092309 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #27429: [SPARK-28330][SQL] Support ANSI SQL: result offset clause in query expression

2020-07-31 Thread GitBox
SparkQA commented on pull request #27429: URL: https://github.com/apache/spark/pull/27429#issuecomment-667091616 **[Test build #126863 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126863/testReport)** for PR 27429 at commit

[GitHub] [spark] prakharjain09 commented on pull request #29318: [SPARK-32509][SQL] Ignore unused DPP True Filter in Canonicalization

2020-07-31 Thread GitBox
prakharjain09 commented on pull request #29318: URL: https://github.com/apache/spark/pull/29318#issuecomment-667091693 cc - @cloud-fan @maryannxue This is an automated message from the Apache Git Service. To respond to the

[GitHub] [spark] prakharjain09 edited a comment on pull request #29318: [SPARK-32509][SQL] Ignore unused DPP True Filter in Canonicalization

2020-07-31 Thread GitBox
prakharjain09 edited a comment on pull request #29318: URL: https://github.com/apache/spark/pull/29318#issuecomment-667091693 cc - @cloud-fan @maryannxue @dongjoon-hyun This is an automated message from the Apache Git

[GitHub] [spark] SparkQA commented on pull request #29297: [SPARK-32406][SQL][FOLLOWUP] Make RESET fail against static and core configs

2020-07-31 Thread GitBox
SparkQA commented on pull request #29297: URL: https://github.com/apache/spark/pull/29297#issuecomment-667091257 **[Test build #126868 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126868/testReport)** for PR 29297 at commit

[GitHub] [spark] MaxGekk commented on pull request #29317: [SPARK-32510][SQL] Check duplicate nested columns in read from JDBC datasource

2020-07-31 Thread GitBox
MaxGekk commented on pull request #29317: URL: https://github.com/apache/spark/pull/29317#issuecomment-667091282 @cloud-fan @HyukjinKwon @maropu Please, review this PR. This is an automated message from the Apache Git

[GitHub] [spark] MaxGekk commented on a change in pull request #29317: [SPARK-32510][SQL] Check duplicate nested columns in read from JDBC datasource

2020-07-31 Thread GitBox
MaxGekk commented on a change in pull request #29317: URL: https://github.com/apache/spark/pull/29317#discussion_r463576239 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/jdbc/JdbcUtils.scala ## @@ -819,8 +819,10 @@ object JdbcUtils extends

[GitHub] [spark] AmplabJenkins commented on pull request #29278: [SPARK-32160][CORE][PYSPARK] Add a config to switch allow/disallow to create SparkContext in executors.

2020-07-31 Thread GitBox
AmplabJenkins commented on pull request #29278: URL: https://github.com/apache/spark/pull/29278#issuecomment-667089695 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29294: [SPARK-32160][CORE][PYSPARK][3.0] Add a config to switch allow/disallow to create SparkContext in executors.

2020-07-31 Thread GitBox
SparkQA commented on pull request #29294: URL: https://github.com/apache/spark/pull/29294#issuecomment-667089902 **[Test build #126885 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126885/testReport)** for PR 29294 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #29318: [SPARK-32509][SQL] Ignore unused DPP True Filter in Canonicalization

2020-07-31 Thread GitBox
AmplabJenkins commented on pull request #29318: URL: https://github.com/apache/spark/pull/29318#issuecomment-667088534 Can one of the admins verify this patch? This is an automated message from the Apache Git Service. To

[GitHub] [spark] SparkQA commented on pull request #28617: [SPARK-31694][SQL] Add SupportsPartitions APIs on DataSourceV2

2020-07-31 Thread GitBox
SparkQA commented on pull request #28617: URL: https://github.com/apache/spark/pull/28617#issuecomment-667088326 **[Test build #126884 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126884/testReport)** for PR 28617 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #28939: [SPARK-32119][CORE] ExecutorPlugin doesn't work with Standalone Cluster and Kubernetes with --jars

2020-07-31 Thread GitBox
AmplabJenkins commented on pull request #28939: URL: https://github.com/apache/spark/pull/28939#issuecomment-667088060 This is an automated message from the Apache Git Service. To respond to the message, please log on to

<    1   2   3   4   5   6   7   8   >