[jira] [Resolved] (SPARK-36444) Remove OptimizeSubqueries from batch of PartitionPruning

2021-08-19 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-36444. - Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 33664

[jira] [Assigned] (SPARK-36444) Remove OptimizeSubqueries from batch of PartitionPruning

2021-08-19 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-36444: --- Assignee: Yuming Wang > Remove OptimizeSubqueries from batch of PartitionPruning >

[jira] [Commented] (SPARK-34276) Check the unreleased/unresolved JIRAs/PRs of Parquet 1.11

2021-08-09 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17396450#comment-17396450 ] Yuming Wang commented on SPARK-34276: - I think so. cc [~smilegator] > Check the

[jira] [Commented] (SPARK-34276) Check the unreleased/unresolved JIRAs/PRs of Parquet 1.11

2021-08-09 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17396435#comment-17396435 ] Yuming Wang commented on SPARK-34276: - We have used parquet 1.11/1.12 in the production environment

[jira] [Commented] (SPARK-34276) Check the unreleased/unresolved JIRAs/PRs of Parquet 1.11

2021-08-09 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17396433#comment-17396433 ] Yuming Wang commented on SPARK-34276: - It seems that there is no unreleased/unresolved JIRAs/PRs of

[jira] [Resolved] (SPARK-36359) Coalesce drop all expressions after the first non nullable expression

2021-08-06 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-36359. - Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 33590

[jira] [Assigned] (SPARK-36359) Coalesce drop all expressions after the first non nullable expression

2021-08-06 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-36359: --- Assignee: Yuming Wang > Coalesce drop all expressions after the first non nullable

[jira] [Commented] (SPARK-36444) Remove OptimizeSubqueries from batch of PartitionPruning

2021-08-06 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36444?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17394641#comment-17394641 ] Yuming Wang commented on SPARK-36444: - Another case: {code:scala} sql("create table t1 using parquet

[jira] [Updated] (SPARK-36444) Remove OptimizeSubqueries from batch of PartitionPruning

2021-08-06 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-36444: Description: To support this case: {code:scala} sql( """ |SELECT date_id,

[jira] [Created] (SPARK-36444) Remove OptimizeSubqueries from batch of PartitionPruning

2021-08-06 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-36444: --- Summary: Remove OptimizeSubqueries from batch of PartitionPruning Key: SPARK-36444 URL: https://issues.apache.org/jira/browse/SPARK-36444 Project: Spark Issue

[jira] [Updated] (SPARK-36359) Coalesce drop all expressions after the first non nullable expression

2021-08-02 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-36359: Summary: Coalesce drop all expressions after the first non nullable expression (was: Coalesce

[jira] [Assigned] (SPARK-36373) DecimalPrecision only add necessary cast

2021-08-02 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-36373: --- Assignee: Yuming Wang > DecimalPrecision only add necessary cast >

[jira] [Resolved] (SPARK-36373) DecimalPrecision only add necessary cast

2021-08-02 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-36373. - Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 33602

[jira] [Created] (SPARK-36376) Collapse repartitions if there is a project between them

2021-08-01 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-36376: --- Summary: Collapse repartitions if there is a project between them Key: SPARK-36376 URL: https://issues.apache.org/jira/browse/SPARK-36376 Project: Spark Issue

[jira] [Created] (SPARK-36373) DecimalPrecision only add necessary cast

2021-08-01 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-36373: --- Summary: DecimalPrecision only add necessary cast Key: SPARK-36373 URL: https://issues.apache.org/jira/browse/SPARK-36373 Project: Spark Issue Type:

[jira] [Created] (SPARK-36359) Coalesce returns the first expression if it is non nullable

2021-07-30 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-36359: --- Summary: Coalesce returns the first expression if it is non nullable Key: SPARK-36359 URL: https://issues.apache.org/jira/browse/SPARK-36359 Project: Spark

[jira] [Commented] (SPARK-36290) Push down join condition evaluation

2021-07-29 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17389715#comment-17389715 ] Yuming Wang commented on SPARK-36290: - {code:java} spark.sql("create table t1 using parquet select

[jira] [Created] (SPARK-36290) Push down join condition evaluation

2021-07-26 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-36290: --- Summary: Push down join condition evaluation Key: SPARK-36290 URL: https://issues.apache.org/jira/browse/SPARK-36290 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-31809) Infer IsNotNull for non null intolerant child of null intolerant in join condition

2021-07-25 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17386991#comment-17386991 ] Yuming Wang commented on SPARK-31809: - {code:java} spark.sql("create table t1 using parquet select

[jira] [Created] (SPARK-36280) Remove redundant aliases after RewritePredicateSubquery

2021-07-23 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-36280: --- Summary: Remove redundant aliases after RewritePredicateSubquery Key: SPARK-36280 URL: https://issues.apache.org/jira/browse/SPARK-36280 Project: Spark Issue

[jira] [Resolved] (SPARK-30186) support Dynamic Partition Pruning in Adaptive Execution

2021-07-23 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-30186. - Fix Version/s: 3.2.0 Assignee: Ke Jia Resolution: Fixed > support Dynamic

[jira] [Created] (SPARK-36245) Deduplicate the right side of left semi/anti join

2021-07-21 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-36245: --- Summary: Deduplicate the right side of left semi/anti join Key: SPARK-36245 URL: https://issues.apache.org/jira/browse/SPARK-36245 Project: Spark Issue Type:

[jira] [Updated] (SPARK-36238) Spark UI load event timeline too slow for huge stage

2021-07-21 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-36238: Attachment: (was: screenshot-1.png) > Spark UI load event timeline too slow for huge stage >

[jira] [Resolved] (SPARK-36183) Push down limit 1 through Aggregate

2021-07-20 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-36183. - Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 33397

[jira] [Assigned] (SPARK-36183) Push down limit 1 through Aggregate

2021-07-20 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-36183: --- Assignee: Yuming Wang > Push down limit 1 through Aggregate >

[jira] [Updated] (HIVE-21521) Upgrade ORC to 1.5.5

2021-07-20 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/HIVE-21521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated HIVE-21521: --- Resolution: Won't Fix Status: Resolved (was: Patch Available) > Upgrade ORC to 1.5.5 >

[jira] [Assigned] (SPARK-36093) The result incorrect if the partition path case is inconsistent

2021-07-19 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-36093: --- Assignee: angerszhu (was: Apache Spark) > The result incorrect if the partition path case

[jira] [Created] (SPARK-36194) Remove the aggregation from left semi/anti join if the same aggregation has already been done on left side

2021-07-17 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-36194: --- Summary: Remove the aggregation from left semi/anti join if the same aggregation has already been done on left side Key: SPARK-36194 URL:

[jira] [Created] (SPARK-36183) Push down limit 1 through Aggregate

2021-07-16 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-36183: --- Summary: Push down limit 1 through Aggregate Key: SPARK-36183 URL: https://issues.apache.org/jira/browse/SPARK-36183 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-36162) extractJoinKeysWithColStats support EqualNullSafe

2021-07-15 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-36162: Description: sql("select * from date_dim join item on d_date_sk = i_item_sk").explain("cost")

[jira] [Created] (SPARK-36162) extractJoinKeysWithColStats support EqualNullSafe

2021-07-15 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-36162: --- Summary: extractJoinKeysWithColStats support EqualNullSafe Key: SPARK-36162 URL: https://issues.apache.org/jira/browse/SPARK-36162 Project: Spark Issue Type:

[jira] [Created] (SPARK-36155) Eliminate join base uniqueness

2021-07-15 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-36155: --- Summary: Eliminate join base uniqueness Key: SPARK-36155 URL: https://issues.apache.org/jira/browse/SPARK-36155 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-36093) The result incorrect if the partition path case is inconsistent

2021-07-12 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-36093: Labels: correctness (was: ) > The result incorrect if the partition path case is inconsistent >

[jira] [Created] (SPARK-36093) The result incorrect if the partition path case is inconsistent

2021-07-12 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-36093: --- Summary: The result incorrect if the partition path case is inconsistent Key: SPARK-36093 URL: https://issues.apache.org/jira/browse/SPARK-36093 Project: Spark

[jira] [Created] (SPARK-36086) The case of the delta table is inconsistent with parquet

2021-07-12 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-36086: --- Summary: The case of the delta table is inconsistent with parquet Key: SPARK-36086 URL: https://issues.apache.org/jira/browse/SPARK-36086 Project: Spark Issue

[jira] [Created] (SPARK-36080) Broadcast join outer join stream side

2021-07-09 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-36080: --- Summary: Broadcast join outer join stream side Key: SPARK-36080 URL: https://issues.apache.org/jira/browse/SPARK-36080 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-35991) Add PlanStability suite for TPCH

2021-07-06 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-35991: Affects Version/s: (was: 3.1.2) (was: 3.2.0)

[jira] [Updated] (SPARK-35991) Add PlanStability suite for TPCH

2021-07-06 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-35991: Issue Type: Improvement (was: Bug) > Add PlanStability suite for TPCH >

[jira] [Resolved] (SPARK-35908) Remove repartition if the child maximum number of rows less than or equal to 1

2021-07-01 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-35908. - Resolution: Not A Problem > Remove repartition if the child maximum number of rows less than or

[jira] [Updated] (SPARK-35967) Update nullability based on column statistics

2021-07-01 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-35967: Summary: Update nullability based on column statistics (was: Update nullability base on column

[jira] [Created] (SPARK-35967) Update nullability base on column statistics

2021-07-01 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-35967: --- Summary: Update nullability base on column statistics Key: SPARK-35967 URL: https://issues.apache.org/jira/browse/SPARK-35967 Project: Spark Issue Type:

[jira] [Updated] (SPARK-35904) Collapse above RebalancePartitions

2021-06-28 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-35904: Fix Version/s: (was: 3.2.0) > Collapse above RebalancePartitions >

[jira] [Reopened] (SPARK-35904) Collapse above RebalancePartitions

2021-06-28 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reopened SPARK-35904: - Reverted at https://github.com/apache/spark/commit/108635af1708173a72bec0e36bf3f2cea5b088c4 >

[jira] [Updated] (SPARK-35908) Remove repartition if the child maximum number of rows less than or equal to 1

2021-06-26 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-35908: Description: {code:scala} spark.sql("select count(*) from range(1, 10, 2,

[jira] [Created] (SPARK-35908) Remove repartition if the child maximum number of rows less than or equal to 1

2021-06-26 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-35908: --- Summary: Remove repartition if the child maximum number of rows less than or equal to 1 Key: SPARK-35908 URL: https://issues.apache.org/jira/browse/SPARK-35908

[jira] [Created] (SPARK-35906) Remove order by if the maximum number of rows less than or equal to 1

2021-06-26 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-35906: --- Summary: Remove order by if the maximum number of rows less than or equal to 1 Key: SPARK-35906 URL: https://issues.apache.org/jira/browse/SPARK-35906 Project: Spark

[jira] [Updated] (SPARK-35904) Collapse above RebalancePartitions

2021-06-25 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-35904: Description: Make RebalancePartitions extends RepartitionOperation. > Collapse above

[jira] [Created] (SPARK-35904) Collapse above RebalancePartitions

2021-06-25 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-35904: --- Summary: Collapse above RebalancePartitions Key: SPARK-35904 URL: https://issues.apache.org/jira/browse/SPARK-35904 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-35886) Codegen issue for decimal type

2021-06-24 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-35886: --- Summary: Codegen issue for decimal type Key: SPARK-35886 URL: https://issues.apache.org/jira/browse/SPARK-35886 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-34807) Push down filter through window after TransposeWindow

2021-06-23 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-34807: --- Assignee: Tanel Kiis > Push down filter through window after TransposeWindow >

[jira] [Resolved] (SPARK-34807) Push down filter through window after TransposeWindow

2021-06-23 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-34807. - Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 31980

[jira] [Updated] (SPARK-35837) Recommendations for Common Query Problems

2021-06-21 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-35837: Description: Teradata supportsĀ [Recommendations for Common Query

[jira] [Updated] (SPARK-35837) Recommendations for Common Query Problems

2021-06-21 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-35837: Description: Teradata supportsĀ [Recommendations for Common Query

[jira] [Updated] (SPARK-35837) Recommendations for Common Query Problems

2021-06-21 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-35837: Description: Teradata supportsĀ [Recommendations for Common Query

[jira] [Created] (SPARK-35837) Recommendations for Common Query Problems

2021-06-21 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-35837: --- Summary: Recommendations for Common Query Problems Key: SPARK-35837 URL: https://issues.apache.org/jira/browse/SPARK-35837 Project: Spark Issue Type: New

[jira] [Commented] (SPARK-35797) patternToRegex failed when pattern is star

2021-06-19 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17366080#comment-17366080 ] Yuming Wang commented on SPARK-35797: - How to reproduce this issue? > patternToRegex failed when

[jira] [Assigned] (SPARK-34120) Improve the statistics estimation

2021-06-18 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-34120: --- Assignee: Yuming Wang > Improve the statistics estimation >

[jira] [Resolved] (SPARK-34120) Improve the statistics estimation

2021-06-18 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-34120. - Fix Version/s: 3.2.0 Resolution: Fixed > Improve the statistics estimation >

[jira] [Assigned] (SPARK-35185) Improve Distinct statistics estimation

2021-06-18 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-35185: --- Assignee: Yuming Wang > Improve Distinct statistics estimation >

[jira] [Resolved] (SPARK-35185) Improve Distinct statistics estimation

2021-06-18 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-35185. - Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 32291

[jira] [Updated] (SPARK-35786) Support optimize repartition by expression in AQE

2021-06-17 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-35786: Parent: SPARK-35793 Issue Type: Sub-task (was: Improvement) > Support optimize

[jira] [Updated] (SPARK-30538) A not very elegant way to control ouput small file

2021-06-17 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-30538: Parent: SPARK-35793 Issue Type: Sub-task (was: Improvement) > A not very elegant way to

[jira] [Updated] (SPARK-35335) Improve CoalesceShufflePartitions to avoid generating small files

2021-06-17 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-35335: Parent: SPARK-35793 Issue Type: Sub-task (was: Improvement) > Improve

[jira] [Updated] (SPARK-35650) Coalesce small output files through AQE

2021-06-17 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-35650: Parent: SPARK-35793 Issue Type: Sub-task (was: New Feature) > Coalesce small output

[jira] [Updated] (SPARK-35725) Support repartition expand partitions in AQE

2021-06-17 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-35725: Parent Issue: SPARK-35793 (was: SPARK-33828) > Support repartition expand partitions in AQE >

[jira] [Updated] (SPARK-31264) Repartition by dynamic partition columns before insert table

2021-06-17 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-31264: Parent: SPARK-35793 Issue Type: Sub-task (was: Improvement) > Repartition by dynamic

[jira] [Created] (SPARK-35793) Repartition before writing data source tables

2021-06-17 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-35793: --- Summary: Repartition before writing data source tables Key: SPARK-35793 URL: https://issues.apache.org/jira/browse/SPARK-35793 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-35556) Remove the close HiveClient's SessionState

2021-06-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-35556. - Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 32693

[jira] [Assigned] (SPARK-35556) Remove the close HiveClient's SessionState

2021-06-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-35556: --- Assignee: Yang Jie > Remove the close HiveClient's SessionState >

[jira] [Updated] (SPARK-35556) Avoid log NoSuchMethodError when HiveClientImpl.state close

2021-06-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-35556: Component/s: (was: Tests) > Avoid log NoSuchMethodError when HiveClientImpl.state close >

[jira] [Updated] (SPARK-35556) Remove the close HiveClient's SessionState

2021-06-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-35556: Summary: Remove the close HiveClient's SessionState (was: Avoid log NoSuchMethodError when

[jira] [Updated] (SPARK-28560) Optimize shuffle reader to local shuffle reader when smj converted to bhj in adaptive execution

2021-06-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-28560: Attachment: localShuffleReader.png > Optimize shuffle reader to local shuffle reader when smj

[jira] [Commented] (SPARK-28560) Optimize shuffle reader to local shuffle reader when smj converted to bhj in adaptive execution

2021-06-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17364079#comment-17364079 ] Yuming Wang commented on SPARK-28560: - This is very useful if there is data skew at the probe side.

[jira] [Updated] (SPARK-27714) Support Join Reorder based on Genetic Algorithm when the # of joined tables > 12

2021-06-13 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-27714: Target Version/s: (was: 3.0.0) > Support Join Reorder based on Genetic Algorithm when the # of

[jira] [Assigned] (SPARK-35321) Spark 3.x can't talk to HMS 1.2.x and lower due to get_all_functions Thrift API missing

2021-06-11 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-35321: --- Assignee: Chao Sun > Spark 3.x can't talk to HMS 1.2.x and lower due to get_all_functions

[jira] [Resolved] (SPARK-35321) Spark 3.x can't talk to HMS 1.2.x and lower due to get_all_functions Thrift API missing

2021-06-11 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-35321. - Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 32887

[jira] [Created] (SPARK-35650) Coalesce small output files through AQE

2021-06-04 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-35650: --- Summary: Coalesce small output files through AQE Key: SPARK-35650 URL: https://issues.apache.org/jira/browse/SPARK-35650 Project: Spark Issue Type: New

[jira] [Assigned] (SPARK-34808) Removes outer join if it only has distinct on streamed side

2021-05-31 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-34808: --- Assignee: Yuming Wang > Removes outer join if it only has distinct on streamed side >

[jira] [Resolved] (SPARK-34808) Removes outer join if it only has distinct on streamed side

2021-05-31 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-34808. - Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 31908

[jira] [Updated] (SPARK-35571) tag v3.0.0 org.apache.spark.sql.catalyst.parser.AstBuilder import error

2021-05-31 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-35571: Target Version/s: (was: 3.0.0) > tag v3.0.0 org.apache.spark.sql.catalyst.parser.AstBuilder

[jira] [Commented] (SPARK-35568) UnsupportedOperationException: WholeStageCodegen (3) does not implement doExecuteBroadcast

2021-05-30 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17354174#comment-17354174 ] Yuming Wang commented on SPARK-35568: - Thank you [~dongjoon]. > UnsupportedOperationException:

[jira] [Updated] (SPARK-35568) UnsupportedOperationException: WholeStageCodegen (3) does not implement doExecuteBroadcast

2021-05-30 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-35568: Description: How to reproduce: {code:scala} sql( """ |SELECT s.store_id,

[jira] [Created] (SPARK-35568) UnsupportedOperationException: WholeStageCodegen (3) does not implement doExecuteBroadcast

2021-05-30 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-35568: --- Summary: UnsupportedOperationException: WholeStageCodegen (3) does not implement doExecuteBroadcast Key: SPARK-35568 URL: https://issues.apache.org/jira/browse/SPARK-35568

[jira] [Commented] (SPARK-35441) InMemoryFileIndex load all files into memroy

2021-05-23 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17350242#comment-17350242 ] Yuming Wang commented on SPARK-35441: - Please increase your driver memory. or you should merge small

[jira] [Updated] (SPARK-35494) Timestamp casting performance issue when invoked with timezone

2021-05-23 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-35494: Target Version/s: (was: 2.4.9) > Timestamp casting performance issue when invoked with timezone

[jira] [Commented] (SPARK-32291) COALESCE should not reduce the child parallelism if it is Join

2021-05-22 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32291?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17349930#comment-17349930 ] Yuming Wang commented on SPARK-32291: - We can use localCheckpoint to workaround this issue:

[jira] [Assigned] (SPARK-35244) invoke should throw the original exception

2021-05-21 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-35244: --- Assignee: Wenchen Fan (was: Apache Spark) > invoke should throw the original exception >

[jira] [Commented] (SPARK-35441) InMemoryFileIndex load all files into memroy

2021-05-19 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17347404#comment-17347404 ] Yuming Wang commented on SPARK-35441: - What is your driver memory? > InMemoryFileIndex load all

[jira] [Created] (SPARK-35415) Change information to map type for SHOW TABLE EXTENDED command

2021-05-16 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-35415: --- Summary: Change information to map type for SHOW TABLE EXTENDED command Key: SPARK-35415 URL: https://issues.apache.org/jira/browse/SPARK-35415 Project: Spark

[jira] [Resolved] (SPARK-35286) Replace SessionState.start with SessionState.setCurrentSessionState

2021-05-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-35286. - Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 32410

[jira] [Assigned] (SPARK-35286) Replace SessionState.start with SessionState.setCurrentSessionState

2021-05-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-35286: --- Assignee: Yuming Wang > Replace SessionState.start with

[jira] [Commented] (SPARK-35365) spark3.1.1 use too long time to analyze table fields

2021-05-11 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17342981#comment-17342981 ] Yuming Wang commented on SPARK-35365: - {noformat} -- 2.4

[jira] [Commented] (SPARK-35365) spark3.1.1 use too long to analyze table fields

2021-05-11 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17342354#comment-17342354 ] Yuming Wang commented on SPARK-35365: - [~xiaohua] Could you check which rule affect the performance,

[jira] [Created] (SPARK-35335) Improve CoalesceShufflePartitions to avoid generating small files

2021-05-07 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-35335: --- Summary: Improve CoalesceShufflePartitions to avoid generating small files Key: SPARK-35335 URL: https://issues.apache.org/jira/browse/SPARK-35335 Project: Spark

[jira] [Updated] (SPARK-35273) CombineFilters support non-deterministic expressions

2021-05-06 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-35273: Description: For example: {code:scala} spark.sql("create table t1(id int) using parquet")

[jira] [Commented] (SPARK-35321) Spark 3.x can't talk to HMS 1.2.x and lower due to get_all_functions Thrift API missing

2021-05-05 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17339944#comment-17339944 ] Yuming Wang commented on SPARK-35321: - Could we add a parameter to disable registerAllFunctionsOnce?

[jira] [Resolved] (SPARK-35315) Keep benchmark result consistent between spark-submit and SBT

2021-05-05 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35315?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-35315. - Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 32440

[jira] [Assigned] (SPARK-35315) Keep benchmark result consistent between spark-submit and SBT

2021-05-05 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35315?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-35315: --- Assignee: Chao Sun > Keep benchmark result consistent between spark-submit and SBT >

[jira] [Updated] (SPARK-35316) UnwrapCastInBinaryComparison support In/InSet predicate

2021-05-04 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-35316: Description: It will not pushdown filters for In/InSet predicates: {code:scala}

[jira] [Created] (SPARK-35316) UnwrapCastInBinaryComparison support In/InSet predicate

2021-05-04 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-35316: --- Summary: UnwrapCastInBinaryComparison support In/InSet predicate Key: SPARK-35316 URL: https://issues.apache.org/jira/browse/SPARK-35316 Project: Spark Issue

<    5   6   7   8   9   10   11   12   13   14   >