[GitHub] [spark] SparkQA commented on pull request #31485: [SPARK-SQL][34137] Update suquery's stats when build LogicalPlan's stats

2021-02-05 Thread GitBox
SparkQA commented on pull request #31485: URL: https://github.com/apache/spark/pull/31485#issuecomment-773875429 **[Test build #134921 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134921/testReport)** for PR 31485 at commit

[GitHub] [spark] SparkQA commented on pull request #31245: [SPARK-34157][SQL] Unify output of SHOW TABLES and pass output attributes properly

2021-02-05 Thread GitBox
SparkQA commented on pull request #31245: URL: https://github.com/apache/spark/pull/31245#issuecomment-773911014 **[Test build #134930 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134930/testReport)** for PR 31245 at commit

[GitHub] [spark] cloud-fan commented on a change in pull request #31245: [SPARK-34157][SQL] Unify output of SHOW TABLES and pass output attributes properly

2021-02-05 Thread GitBox
cloud-fan commented on a change in pull request #31245: URL: https://github.com/apache/spark/pull/31245#discussion_r570794178 ## File path: sql/core/src/test/scala/org/apache/spark/sql/execution/command/v1/ShowTablesSuite.scala ## @@ -119,6 +102,34 @@ trait

[GitHub] [spark] SparkQA removed a comment on pull request #31481: [SQL][MINOR][TEST][3.1] Re-enable some DS v2 char/varchar test

2021-02-05 Thread GitBox
SparkQA removed a comment on pull request #31481: URL: https://github.com/apache/spark/pull/31481#issuecomment-773796124 **[Test build #134909 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134909/testReport)** for PR 31481 at commit

[GitHub] [spark] SparkQA commented on pull request #31488: [SPARK-34376][SQL] Support regexp as a SQL function

2021-02-05 Thread GitBox
SparkQA commented on pull request #31488: URL: https://github.com/apache/spark/pull/31488#issuecomment-773914040 **[Test build #134931 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134931/testReport)** for PR 31488 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #31413: [SPARK-32985][SQL] Decouple bucket scan and bucket filter pruning for data source v1

2021-02-05 Thread GitBox
SparkQA removed a comment on pull request #31413: URL: https://github.com/apache/spark/pull/31413#issuecomment-773794905 **[Test build #134910 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134910/testReport)** for PR 31413 at commit

[GitHub] [spark] AngersZhuuuu commented on a change in pull request #30957: [SPARK-31937][SQL] Support processing ArrayType/MapType/StructType data using no-serde mode script transform

2021-02-05 Thread GitBox
AngersZh commented on a change in pull request #30957: URL: https://github.com/apache/spark/pull/30957#discussion_r570793559 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/BaseScriptTransformationExec.scala ## @@ -47,7 +47,13 @@ trait

[GitHub] [spark] yaooqinn commented on pull request #31488: [SPARK-34376][SQL] Support regexp as a SQL function

2021-02-05 Thread GitBox
yaooqinn commented on pull request #31488: URL: https://github.com/apache/spark/pull/31488#issuecomment-773910330 cc @cloud-fan @maropu thanks for checking This is an automated message from the Apache Git Service. To respond

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31482: [SPARK-34346][CORE][SQL][3.1] io.file.buffer.size set by spark.buffer.size will override by loading hive-site.xml accidentally

2021-02-05 Thread GitBox
AmplabJenkins removed a comment on pull request #31482: URL: https://github.com/apache/spark/pull/31482#issuecomment-773905147 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134917/

[GitHub] [spark] SparkQA commented on pull request #31245: [SPARK-34157][SQL] Unify output of SHOW TABLES and pass output attributes properly

2021-02-05 Thread GitBox
SparkQA commented on pull request #31245: URL: https://github.com/apache/spark/pull/31245#issuecomment-773908001 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39504/

[GitHub] [spark] SparkQA commented on pull request #29087: [SPARK-28227][SQL] Support projection, aggregate/window functions, and lateral view in the TRANSFORM clause

2021-02-05 Thread GitBox
SparkQA commented on pull request #29087: URL: https://github.com/apache/spark/pull/29087#issuecomment-773907871 **[Test build #134928 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134928/testReport)** for PR 29087 at commit

[GitHub] [spark] beliefer commented on a change in pull request #31245: [SPARK-34157][SQL] Unify output of SHOW TABLES and pass output attributes properly

2021-02-05 Thread GitBox
beliefer commented on a change in pull request #31245: URL: https://github.com/apache/spark/pull/31245#discussion_r570791022 ## File path: docs/sql-migration-guide.md ## @@ -40,6 +40,10 @@ license: | - In Spark 3.2, script transform default FIELD DELIMIT is `\u0001` for no

[GitHub] [spark] cloud-fan commented on a change in pull request #31245: [SPARK-34157][SQL] Unify output of SHOW TABLES and pass output attributes properly

2021-02-05 Thread GitBox
cloud-fan commented on a change in pull request #31245: URL: https://github.com/apache/spark/pull/31245#discussion_r570792389 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala ## @@ -825,22 +825,10 @@ case class DescribeColumnCommand(

[GitHub] [spark] SparkQA commented on pull request #31258: [SPARK-34168] [SQL] Support DPP in AQE when the join is Broadcast hash join at the beginning

2021-02-05 Thread GitBox
SparkQA commented on pull request #31258: URL: https://github.com/apache/spark/pull/31258#issuecomment-773907021 **[Test build #134926 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134926/testReport)** for PR 31258 at commit

[GitHub] [spark] cloud-fan commented on a change in pull request #31245: [SPARK-34157][SQL] Unify output of SHOW TABLES and pass output attributes properly

2021-02-05 Thread GitBox
cloud-fan commented on a change in pull request #31245: URL: https://github.com/apache/spark/pull/31245#discussion_r570789600 ## File path: docs/sql-migration-guide.md ## @@ -40,6 +40,10 @@ license: | - In Spark 3.2, script transform default FIELD DELIMIT is `\u0001` for no

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31471: [SPARK-34355][SQL] Add log and time cost for commit job

2021-02-05 Thread GitBox
AmplabJenkins removed a comment on pull request #31471: URL: https://github.com/apache/spark/pull/31471#issuecomment-773871847 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134906/

[GitHub] [spark] AmplabJenkins commented on pull request #31482: [SPARK-34346][CORE][SQL][3.1] io.file.buffer.size set by spark.buffer.size will override by loading hive-site.xml accidentally may caus

2021-02-05 Thread GitBox
AmplabJenkins commented on pull request #31482: URL: https://github.com/apache/spark/pull/31482#issuecomment-773905147 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134917/

[GitHub] [spark] AmplabJenkins commented on pull request #31471: [SPARK-34355][SQL] Add log and time cost for commit job

2021-02-05 Thread GitBox
AmplabJenkins commented on pull request #31471: URL: https://github.com/apache/spark/pull/31471#issuecomment-773871847 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134906/

[GitHub] [spark] cloud-fan commented on a change in pull request #31245: [SPARK-34157][SQL] Unify output of SHOW TABLES and pass output attributes properly

2021-02-05 Thread GitBox
cloud-fan commented on a change in pull request #31245: URL: https://github.com/apache/spark/pull/31245#discussion_r570789600 ## File path: docs/sql-migration-guide.md ## @@ -40,6 +40,10 @@ license: | - In Spark 3.2, script transform default FIELD DELIMIT is `\u0001` for no

[GitHub] [spark] yaooqinn opened a new pull request #31488: [SPARK-34376][SQL] Support regexp as a SQL function

2021-02-05 Thread GitBox
yaooqinn opened a new pull request #31488: URL: https://github.com/apache/spark/pull/31488 ### What changes were proposed in this pull request? We have equality in `SqlBase.g4` for `RLIKE: 'RLIKE' | 'REGEXP';` We seemed to miss adding` REGEXP` as a SQL function just

[GitHub] [spark] SparkQA commented on pull request #31482: [SPARK-34346][CORE][SQL][3.1] io.file.buffer.size set by spark.buffer.size will override by loading hive-site.xml accidentally may cause perf

2021-02-05 Thread GitBox
SparkQA commented on pull request #31482: URL: https://github.com/apache/spark/pull/31482#issuecomment-773904416 **[Test build #134917 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134917/testReport)** for PR 31482 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #31482: [SPARK-34346][CORE][SQL][3.1] io.file.buffer.size set by spark.buffer.size will override by loading hive-site.xml accidentally may ca

2021-02-05 Thread GitBox
SparkQA removed a comment on pull request #31482: URL: https://github.com/apache/spark/pull/31482#issuecomment-773846744 **[Test build #134917 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134917/testReport)** for PR 31482 at commit

[GitHub] [spark] cloud-fan commented on a change in pull request #31245: [SPARK-34157][SQL] Unify output of SHOW TABLES and pass output attributes properly

2021-02-05 Thread GitBox
cloud-fan commented on a change in pull request #31245: URL: https://github.com/apache/spark/pull/31245#discussion_r570789867 ## File path: docs/sql-migration-guide.md ## @@ -40,6 +40,10 @@ license: | - In Spark 3.2, script transform default FIELD DELIMIT is `\u0001` for no

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31133: [SPARK-26836][SQL] Supporting Avro schema evolution for partitioned Hive tables with "avro.schema.literal"

2021-02-05 Thread GitBox
AmplabJenkins removed a comment on pull request #31133: URL: https://github.com/apache/spark/pull/31133#issuecomment-773898069 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134919/

[GitHub] [spark] SparkQA commented on pull request #31348: [SPARK-34245][CORE] Ensure Master removes executors that failed to send finished state

2021-02-05 Thread GitBox
SparkQA commented on pull request #31348: URL: https://github.com/apache/spark/pull/31348#issuecomment-773899913 **[Test build #134925 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134925/testReport)** for PR 31348 at commit

[GitHub] [spark] SparkQA commented on pull request #31245: [SPARK-34157][SQL] Unify output of SHOW TABLES and pass output attributes properly

2021-02-05 Thread GitBox
SparkQA commented on pull request #31245: URL: https://github.com/apache/spark/pull/31245#issuecomment-773897779 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39504/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31133: [SPARK-26836][SQL] Supporting Avro schema evolution for partitioned Hive tables with "avro.schema.literal"

2021-02-05 Thread GitBox
AmplabJenkins removed a comment on pull request #31133: URL: https://github.com/apache/spark/pull/31133#issuecomment-773897288 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39502/

[GitHub] [spark] AmplabJenkins commented on pull request #31477: [SPARK-34369][SQL][WEBUI] Track number of pairs processed out of Join.

2021-02-05 Thread GitBox
AmplabJenkins commented on pull request #31477: URL: https://github.com/apache/spark/pull/31477#issuecomment-773897291 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134905/

[GitHub] [spark] SparkQA commented on pull request #30869: [SPARK-33865][SQL] When HiveDDL, we need check avro schema too

2021-02-05 Thread GitBox
SparkQA commented on pull request #30869: URL: https://github.com/apache/spark/pull/30869#issuecomment-773900126 **[Test build #134927 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134927/testReport)** for PR 30869 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31471: [SPARK-34355][SQL] Add log and time cost for commit job

2021-02-05 Thread GitBox
AmplabJenkins removed a comment on pull request #31471: URL: https://github.com/apache/spark/pull/31471#issuecomment-773867185 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134899/

[GitHub] [spark] AmplabJenkins commented on pull request #31483: [SPARK-33434][PYTHON][DOCS] Added RuntimeConfig to PySpark docs

2021-02-05 Thread GitBox
AmplabJenkins commented on pull request #31483: URL: https://github.com/apache/spark/pull/31483#issuecomment-773897295 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39503/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31483: [SPARK-33434][PYTHON][DOCS] Added RuntimeConfig to PySpark docs

2021-02-05 Thread GitBox
AmplabJenkins removed a comment on pull request #31483: URL: https://github.com/apache/spark/pull/31483#issuecomment-773897295 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39503/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31477: [SPARK-34369][SQL][WEBUI] Track number of pairs processed out of Join.

2021-02-05 Thread GitBox
AmplabJenkins removed a comment on pull request #31477: URL: https://github.com/apache/spark/pull/31477#issuecomment-773897291 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134905/

[GitHub] [spark] AmplabJenkins commented on pull request #31484: [SPARK-34374][SQL][DSTREAM] Use standard methods to extract keys or values from a Map

2021-02-05 Thread GitBox
AmplabJenkins commented on pull request #31484: URL: https://github.com/apache/spark/pull/31484#issuecomment-773897293 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39499/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31477: [SPARK-34369][SQL][WEBUI] Track number of pairs processed out of Join.

2021-02-05 Thread GitBox
AmplabJenkins removed a comment on pull request #31477: URL: https://github.com/apache/spark/pull/31477#issuecomment-773867180 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39497/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31484: [SPARK-34374][SQL][DSTREAM] Use standard methods to extract keys or values from a Map

2021-02-05 Thread GitBox
AmplabJenkins removed a comment on pull request #31484: URL: https://github.com/apache/spark/pull/31484#issuecomment-773897293 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39499/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30957: [SPARK-31937][SQL] Support processing ArrayType/MapType/StructType data using no-serde mode script transform

2021-02-05 Thread GitBox
AmplabJenkins removed a comment on pull request #30957: URL: https://github.com/apache/spark/pull/30957#issuecomment-773867182 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39498/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31482: [SPARK-34346][CORE][SQL][3.1] io.file.buffer.size set by spark.buffer.size will override by loading hive-site.xml accidentally

2021-02-05 Thread GitBox
AmplabJenkins removed a comment on pull request #31482: URL: https://github.com/apache/spark/pull/31482#issuecomment-773867179 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39500/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31204: [SPARK-26399][WEBUI][CORE] Add new stage-level REST APIs and parameters

2021-02-05 Thread GitBox
AmplabJenkins removed a comment on pull request #31204: URL: https://github.com/apache/spark/pull/31204#issuecomment-773867183 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134911/

[GitHub] [spark] AmplabJenkins commented on pull request #31133: [SPARK-26836][SQL] Supporting Avro schema evolution for partitioned Hive tables with "avro.schema.literal"

2021-02-05 Thread GitBox
AmplabJenkins commented on pull request #31133: URL: https://github.com/apache/spark/pull/31133#issuecomment-773898069 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134919/

[GitHub] [spark] AmplabJenkins commented on pull request #31482: [SPARK-34346][CORE][SQL][3.1] io.file.buffer.size set by spark.buffer.size will override by loading hive-site.xml accidentally may caus

2021-02-05 Thread GitBox
AmplabJenkins commented on pull request #31482: URL: https://github.com/apache/spark/pull/31482#issuecomment-773867179 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39500/

[GitHub] [spark] AmplabJenkins commented on pull request #31477: [SPARK-34369][SQL][WEBUI] Track number of pairs processed out of Join.

2021-02-05 Thread GitBox
AmplabJenkins commented on pull request #31477: URL: https://github.com/apache/spark/pull/31477#issuecomment-773867180 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39497/

[GitHub] [spark] AmplabJenkins commented on pull request #31204: [SPARK-26399][WEBUI][CORE] Add new stage-level REST APIs and parameters

2021-02-05 Thread GitBox
AmplabJenkins commented on pull request #31204: URL: https://github.com/apache/spark/pull/31204#issuecomment-773867183 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134911/

[GitHub] [spark] AmplabJenkins commented on pull request #31472: [SPARK-34356][ML] OVR transform fix potential column conflict

2021-02-05 Thread GitBox
AmplabJenkins commented on pull request #31472: URL: https://github.com/apache/spark/pull/31472#issuecomment-773897290 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39501/

[GitHub] [spark] AmplabJenkins commented on pull request #30957: [SPARK-31937][SQL] Support processing ArrayType/MapType/StructType data using no-serde mode script transform

2021-02-05 Thread GitBox
AmplabJenkins commented on pull request #30957: URL: https://github.com/apache/spark/pull/30957#issuecomment-773867182 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39498/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31472: [SPARK-34356][ML] OVR transform fix potential column conflict

2021-02-05 Thread GitBox
AmplabJenkins removed a comment on pull request #31472: URL: https://github.com/apache/spark/pull/31472#issuecomment-773897290 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39501/

[GitHub] [spark] AmplabJenkins commented on pull request #31133: [SPARK-26836][SQL] Supporting Avro schema evolution for partitioned Hive tables with "avro.schema.literal"

2021-02-05 Thread GitBox
AmplabJenkins commented on pull request #31133: URL: https://github.com/apache/spark/pull/31133#issuecomment-773897288 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39502/

[GitHub] [spark] SparkQA removed a comment on pull request #31133: [SPARK-26836][SQL] Supporting Avro schema evolution for partitioned Hive tables with "avro.schema.literal"

2021-02-05 Thread GitBox
SparkQA removed a comment on pull request #31133: URL: https://github.com/apache/spark/pull/31133#issuecomment-773840372 **[Test build #134919 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134919/testReport)** for PR 31133 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31483: [SPARK-33434][PYTHON][DOCS] Added RuntimeConfig to PySpark docs

2021-02-05 Thread GitBox
AmplabJenkins removed a comment on pull request #31483: URL: https://github.com/apache/spark/pull/31483#issuecomment-773845409 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #31466: [SPARK-34352][SQL] Improve SQLQueryTestSuite so as could run on windows system

2021-02-05 Thread GitBox
AmplabJenkins commented on pull request #31466: URL: https://github.com/apache/spark/pull/31466#issuecomment-773897289 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134907/

[GitHub] [spark] wzhfy edited a comment on pull request #30965: [SPARK-33935][SQL] Fix CBO cost function

2021-02-05 Thread GitBox
wzhfy edited a comment on pull request #30965: URL: https://github.com/apache/spark/pull/30965#issuecomment-773864841 @tanelk Hi, sorry to see this so late. IIRC the reason to use a relative value for rowCount and size, is to normalize them to a similar scale while comparing cost.

[GitHub] [spark] AmplabJenkins commented on pull request #31471: [SPARK-34355][SQL] Add log and time cost for commit job

2021-02-05 Thread GitBox
AmplabJenkins commented on pull request #31471: URL: https://github.com/apache/spark/pull/31471#issuecomment-773867185 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134899/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31466: [SPARK-34352][SQL] Improve SQLQueryTestSuite so as could run on windows system

2021-02-05 Thread GitBox
AmplabJenkins removed a comment on pull request #31466: URL: https://github.com/apache/spark/pull/31466#issuecomment-773897289 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134907/

[GitHub] [spark] wzhfy edited a comment on pull request #30965: [SPARK-33935][SQL] Fix CBO cost function

2021-02-05 Thread GitBox
wzhfy edited a comment on pull request #30965: URL: https://github.com/apache/spark/pull/30965#issuecomment-773864841 @tanelk Hi, sorry to see this so late. IIRC the reason to use a relative value for rowCount and size, is to normalize them in a similar scale while comparing cost.

[GitHub] [spark] SparkQA commented on pull request #31485: [SPARK-SQL][34137] Update suquery's stats when build LogicalPlan's stats

2021-02-05 Thread GitBox
SparkQA commented on pull request #31485: URL: https://github.com/apache/spark/pull/31485#issuecomment-773892525 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39506/

[GitHub] [spark] SparkQA commented on pull request #31133: [SPARK-26836][SQL] Supporting Avro schema evolution for partitioned Hive tables with "avro.schema.literal"

2021-02-05 Thread GitBox
SparkQA commented on pull request #31133: URL: https://github.com/apache/spark/pull/31133#issuecomment-773897208 **[Test build #134919 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134919/testReport)** for PR 31133 at commit

[GitHub] [spark] AngersZhuuuu commented on pull request #29087: [SPARK-28227][SQL] Support projection, aggregate/window functions, and lateral view in the TRANSFORM clause

2021-02-05 Thread GitBox
AngersZh commented on pull request #29087: URL: https://github.com/apache/spark/pull/29087#issuecomment-773895797 retest this please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] beliefer commented on a change in pull request #31245: [SPARK-34157][SQL] Unify output of SHOW TABLES and pass output attributes properly

2021-02-05 Thread GitBox
beliefer commented on a change in pull request #31245: URL: https://github.com/apache/spark/pull/31245#discussion_r570815288 ## File path: sql/core/src/test/scala/org/apache/spark/sql/execution/command/v1/ShowTablesSuite.scala ## @@ -119,6 +102,34 @@ trait ShowTablesSuiteBase

[GitHub] [spark] wzhfy commented on pull request #30965: [SPARK-33935][SQL] Fix CBO cost function

2021-02-05 Thread GitBox
wzhfy commented on pull request #30965: URL: https://github.com/apache/spark/pull/30965#issuecomment-773864841 @tanelk Hi, sorry to see this so late. The reason to use a relative value for rowCount and size, is to normalize them in a similar scale while comparing cost. Otherwise,

[GitHub] [spark] SparkQA commented on pull request #31204: [SPARK-26399][WEBUI][CORE] Add new stage-level REST APIs and parameters

2021-02-05 Thread GitBox
SparkQA commented on pull request #31204: URL: https://github.com/apache/spark/pull/31204#issuecomment-773864089 **[Test build #134911 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134911/testReport)** for PR 31204 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #31204: [SPARK-26399][WEBUI][CORE] Add new stage-level REST APIs and parameters

2021-02-05 Thread GitBox
SparkQA removed a comment on pull request #31204: URL: https://github.com/apache/spark/pull/31204#issuecomment-773796762 **[Test build #134911 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134911/testReport)** for PR 31204 at commit

[GitHub] [spark] SparkQA commented on pull request #31482: [SPARK-34346][CORE][SQL][3.1] io.file.buffer.size set by spark.buffer.size will override by loading hive-site.xml accidentally may cause perf

2021-02-05 Thread GitBox
SparkQA commented on pull request #31482: URL: https://github.com/apache/spark/pull/31482#issuecomment-773863580 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39500/

[GitHub] [spark] SparkQA removed a comment on pull request #31483: [SPARK-33434][PYTHON][DOCS] Added RuntimeConfig to PySpark docs

2021-02-05 Thread GitBox
SparkQA removed a comment on pull request #31483: URL: https://github.com/apache/spark/pull/31483#issuecomment-773851149 **[Test build #134920 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134920/testReport)** for PR 31483 at commit

[GitHub] [spark] SparkQA commented on pull request #31483: [SPARK-33434][PYTHON][DOCS] Added RuntimeConfig to PySpark docs

2021-02-05 Thread GitBox
SparkQA commented on pull request #31483: URL: https://github.com/apache/spark/pull/31483#issuecomment-773890053 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39503/

[GitHub] [spark] SparkQA commented on pull request #31472: [SPARK-34356][ML] OVR transform fix potential column conflict

2021-02-05 Thread GitBox
SparkQA commented on pull request #31472: URL: https://github.com/apache/spark/pull/31472#issuecomment-773863219 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39501/

[GitHub] [spark] SparkQA commented on pull request #31483: [SPARK-33434][PYTHON][DOCS] Added RuntimeConfig to PySpark docs

2021-02-05 Thread GitBox
SparkQA commented on pull request #31483: URL: https://github.com/apache/spark/pull/31483#issuecomment-773863073 **[Test build #134920 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134920/testReport)** for PR 31483 at commit

[GitHub] [spark] JkSelf commented on pull request #31258: [SPARK-34168] [SQL] Support DPP in AQE when the join is Broadcast hash join at the beginning

2021-02-05 Thread GitBox
JkSelf commented on pull request #31258: URL: https://github.com/apache/spark/pull/31258#issuecomment-773889572 @wangyum Yes. This implementation only is the first PR to support the join is bhj before apply AQE rules. We will support the join is smj and then convert to bhj use case in the

[GitHub] [spark] wangyum commented on pull request #31258: [SPARK-34168] [SQL] Support DPP in AQE when the join is Broadcast hash join at the beginning

2021-02-05 Thread GitBox
wangyum commented on pull request #31258: URL: https://github.com/apache/spark/pull/31258#issuecomment-773886254 @JkSelf @cloud-fan This implementation can not reuse `BroadcastExchange` if BHJ after SMJ. For example: ```SQL SELECT count(*) FROM (SELECT c.c_customer_sk,

[GitHub] [spark] SparkQA commented on pull request #31133: [SPARK-26836][SQL] Supporting Avro schema evolution for partitioned Hive tables with "avro.schema.literal"

2021-02-05 Thread GitBox
SparkQA commented on pull request #31133: URL: https://github.com/apache/spark/pull/31133#issuecomment-773886154 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39502/

[GitHub] [spark] SparkQA commented on pull request #31484: [SPARK-34374][SQL][DSTREAM] Use standard methods to extract keys or values from a Map

2021-02-05 Thread GitBox
SparkQA commented on pull request #31484: URL: https://github.com/apache/spark/pull/31484#issuecomment-773862225 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39499/

[GitHub] [spark] beliefer commented on a change in pull request #31466: [SPARK-34352][SQL] Improve SQLQueryTestSuite so as could run on windows system

2021-02-05 Thread GitBox
beliefer commented on a change in pull request #31466: URL: https://github.com/apache/spark/pull/31466#discussion_r570808552 ## File path: sql/core/src/test/scala/org/apache/spark/sql/SQLQueryTestSuite.scala ## @@ -566,7 +563,14 @@ class SQLQueryTestSuite extends QueryTest

[GitHub] [spark] SparkQA removed a comment on pull request #31477: [SPARK-34369][SQL][WEBUI] Track number of pairs processed out of Join.

2021-02-05 Thread GitBox
SparkQA removed a comment on pull request #31477: URL: https://github.com/apache/spark/pull/31477#issuecomment-773773216 **[Test build #134905 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134905/testReport)** for PR 31477 at commit

[GitHub] [spark] SparkQA commented on pull request #31477: [SPARK-34369][SQL][WEBUI] Track number of pairs processed out of Join.

2021-02-05 Thread GitBox
SparkQA commented on pull request #31477: URL: https://github.com/apache/spark/pull/31477#issuecomment-773883741 **[Test build #134905 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134905/testReport)** for PR 31477 at commit

[GitHub] [spark] cloud-fan commented on pull request #31440: [SPARK-34331][SQL] Speed up DS v2 metadata col resolution

2021-02-05 Thread GitBox
cloud-fan commented on pull request #31440: URL: https://github.com/apache/spark/pull/31440#issuecomment-773883162 thanks for the review, merging to master/3.1! This is an automated message from the Apache Git Service. To

[GitHub] [spark] cloud-fan closed pull request #31440: [SPARK-34331][SQL] Speed up DS v2 metadata col resolution

2021-02-05 Thread GitBox
cloud-fan closed pull request #31440: URL: https://github.com/apache/spark/pull/31440 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

<    3   4   5   6   7   8