[jira] [Updated] (SPARK-39503) Add session catalog name for v1 database table and function

2022-06-17 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-39503: -- Description: To make it more clearer that this table or function comes from which catalog. It

[jira] [Updated] (SPARK-39503) Add session catalog name for v1 database table and function

2022-06-17 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-39503: -- Summary: Add session catalog name for v1 database table and function (was: Add session catalog name

[jira] [Created] (SPARK-39503) Add session catalog name for v1 table and function

2022-06-17 Thread XiDuo You (Jira)
XiDuo You created SPARK-39503: - Summary: Add session catalog name for v1 table and function Key: SPARK-39503 URL: https://issues.apache.org/jira/browse/SPARK-39503 Project: Spark Issue Type:

[jira] [Updated] (SPARK-39475) Pull out complex join keys for shuffled join

2022-06-14 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-39475: -- Summary: Pull out complex join keys for shuffled join (was: Pull out complex join keys) > Pull out

[jira] [Created] (SPARK-39475) Pull out complex join keys

2022-06-14 Thread XiDuo You (Jira)
XiDuo You created SPARK-39475: - Summary: Pull out complex join keys Key: SPARK-39475 URL: https://issues.apache.org/jira/browse/SPARK-39475 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-39454) failed to convert LogicalPlan to SparkPlan when subquery exists after "IN" predicate

2022-06-13 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17553874#comment-17553874 ] XiDuo You commented on SPARK-39454: --- [~allxu] this issue should be fixed by SPARK-37995 > failed to

[jira] [Created] (SPARK-39455) Improve expression non-codegen code path performance by cache data type matching

2022-06-13 Thread XiDuo You (Jira)
XiDuo You created SPARK-39455: - Summary: Improve expression non-codegen code path performance by cache data type matching Key: SPARK-39455 URL: https://issues.apache.org/jira/browse/SPARK-39455 Project:

[jira] [Updated] (SPARK-39397) Relax AliasAwareOutputExpression to support alias with expression

2022-06-07 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-39397: -- Description: We will pull out complex join keys from grouping expressions, so the project can hold a

[jira] [Updated] (SPARK-39397) Relax AliasAwareOutputExpression to support alias with expression

2022-06-07 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-39397: -- Description: We will pull out complex join keys from grouping expressions, so the project can hold a

[jira] [Created] (SPARK-39397) Relax AliasAwareOutputExpression to support alias with expression

2022-06-07 Thread XiDuo You (Jira)
XiDuo You created SPARK-39397: - Summary: Relax AliasAwareOutputExpression to support alias with expression Key: SPARK-39397 URL: https://issues.apache.org/jira/browse/SPARK-39397 Project: Spark

[jira] [Updated] (SPARK-39318) Remove tpch-plan-stability WithStats golden files

2022-05-27 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-39318: -- Description: It's a dead golden files since we have no stats with TPCH and no check for that. (was:

[jira] [Created] (SPARK-39318) Rmove tpch-plan-stability WithStats golden files

2022-05-27 Thread XiDuo You (Jira)
XiDuo You created SPARK-39318: - Summary: Rmove tpch-plan-stability WithStats golden files Key: SPARK-39318 URL: https://issues.apache.org/jira/browse/SPARK-39318 Project: Spark Issue Type:

[jira] [Updated] (SPARK-39318) Remove tpch-plan-stability WithStats golden files

2022-05-27 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-39318: -- Summary: Remove tpch-plan-stability WithStats golden files (was: Rmove tpch-plan-stability WithStats

[jira] [Updated] (SPARK-39316) Merge PromotePrecision and CheckOverflow into decimal binary arithmetic

2022-05-27 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-39316: -- Description: Merge {{PromotePrecision}} into {{{}dataType{}}}, for example, {{{}Add{}}}: {code:java}

[jira] [Created] (SPARK-39316) Merge PromotePrecision and CheckOverflow into decimal binary arithmetic

2022-05-27 Thread XiDuo You (Jira)
XiDuo You created SPARK-39316: - Summary: Merge PromotePrecision and CheckOverflow into decimal binary arithmetic Key: SPARK-39316 URL: https://issues.apache.org/jira/browse/SPARK-39316 Project: Spark

[jira] [Created] (SPARK-39315) Refactor PromotePrecision and CheckOverflow with decimal binary arithmetic

2022-05-27 Thread XiDuo You (Jira)
XiDuo You created SPARK-39315: - Summary: Refactor PromotePrecision and CheckOverflow with decimal binary arithmetic Key: SPARK-39315 URL: https://issues.apache.org/jira/browse/SPARK-39315 Project: Spark

[jira] [Updated] (SPARK-39291) Fetch blocks and open stream should not respond a closed channel

2022-05-25 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-39291: -- Description: If user cancel and interrupt a reduce task who is fetching shuffle blocks, the channel

[jira] [Created] (SPARK-39291) Fetch blocks and open stream should not respond a closed channel

2022-05-25 Thread XiDuo You (Jira)
XiDuo You created SPARK-39291: - Summary: Fetch blocks and open stream should not respond a closed channel Key: SPARK-39291 URL: https://issues.apache.org/jira/browse/SPARK-39291 Project: Spark

[jira] [Created] (SPARK-39267) Clean up dsl unnecessary symbol

2022-05-23 Thread XiDuo You (Jira)
XiDuo You created SPARK-39267: - Summary: Clean up dsl unnecessary symbol Key: SPARK-39267 URL: https://issues.apache.org/jira/browse/SPARK-39267 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-39220) codegen cause NullPointException

2022-05-19 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17539876#comment-17539876 ] XiDuo You commented on SPARK-39220: --- is it possible to also provide a stack log ? > codegen cause

[jira] [Updated] (SPARK-39172) Remove outer join if all output come from streamed side and buffered side keys exist unique key

2022-05-12 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-39172: -- Summary: Remove outer join if all output come from streamed side and buffered side keys exist unique

[jira] [Created] (SPARK-39172) Remove outer join if all output come from streamed side and buffered side keys exist unique

2022-05-12 Thread XiDuo You (Jira)
XiDuo You created SPARK-39172: - Summary: Remove outer join if all output come from streamed side and buffered side keys exist unique Key: SPARK-39172 URL: https://issues.apache.org/jira/browse/SPARK-39172

[jira] [Commented] (SPARK-39104) Null Pointer Exeption on unpersist call

2022-05-09 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17534136#comment-17534136 ] XiDuo You commented on SPARK-39104: --- it seems this bug also exists at 3.3.0 branch > Null Pointer

[jira] [Commented] (SPARK-39132) spark3.2.1 cache throw NPE

2022-05-09 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17534134#comment-17534134 ] XiDuo You commented on SPARK-39132: --- same bug with SPARK-39104 > spark3.2.1 cache throw NPE >

[jira] [Created] (SPARK-39122) Python UDF does not follow the conditional expression evaluation order

2022-05-07 Thread XiDuo You (Jira)
XiDuo You created SPARK-39122: - Summary: Python UDF does not follow the conditional expression evaluation order Key: SPARK-39122 URL: https://issues.apache.org/jira/browse/SPARK-39122 Project: Spark

[jira] [Created] (SPARK-39106) Correct conditional expression constant folding

2022-05-05 Thread XiDuo You (Jira)
XiDuo You created SPARK-39106: - Summary: Correct conditional expression constant folding Key: SPARK-39106 URL: https://issues.apache.org/jira/browse/SPARK-39106 Project: Spark Issue Type:

[jira] [Updated] (SPARK-39105) Add ConditionalExpression trait

2022-05-05 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-39105: -- Description: For developers, if a custom conditional like expression contains common sub expression

[jira] [Updated] (SPARK-39105) Add ConditionalExpression trait

2022-05-05 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-39105: -- Description: For develpers, if a custom conditional like expression contains common sub expression

[jira] [Updated] (SPARK-39105) Add ConditionalExpression trait

2022-05-05 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-39105: -- Description: For develpers, if a custom conditional like expression contains common sub expression

[jira] [Created] (SPARK-39105) Add ConditionalExpression trait

2022-05-05 Thread XiDuo You (Jira)
XiDuo You created SPARK-39105: - Summary: Add ConditionalExpression trait Key: SPARK-39105 URL: https://issues.apache.org/jira/browse/SPARK-39105 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-39040) Respect NaNvl in EquivalentExpressions for expression elimination

2022-04-27 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-39040: -- Description: For example the query will fail: {code:java} set spark.sql.ansi.enabled=true; set

[jira] [Created] (SPARK-39040) Respect NaNvl in EquivalentExpressions for expression elimination

2022-04-27 Thread XiDuo You (Jira)
XiDuo You created SPARK-39040: - Summary: Respect NaNvl in EquivalentExpressions for expression elimination Key: SPARK-39040 URL: https://issues.apache.org/jira/browse/SPARK-39040 Project: Spark

[jira] [Created] (SPARK-39039) Conditional expression evaluation ordering

2022-04-27 Thread XiDuo You (Jira)
XiDuo You created SPARK-39039: - Summary: Conditional expression evaluation ordering Key: SPARK-39039 URL: https://issues.apache.org/jira/browse/SPARK-39039 Project: Spark Issue Type: Umbrella

[jira] [Updated] (SPARK-37528) Schedule Tasks By Input Size

2022-04-21 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37528: -- Description: In general, the larger input data size means longer running time. So ideally, we can

[jira] [Updated] (SPARK-37528) Schedule Tasks By Input Size

2022-04-21 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37528: -- Description: In general, the larger input data size means longer running time. So ideally, we can

[jira] [Updated] (SPARK-37528) Schedule Tasks By Input Size

2022-04-21 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37528: -- Description: In general, the larger input data size means longer running time. So ideally, we can

[jira] [Created] (SPARK-38962) Fix wrong computeStats at DataSourceV2Relation

2022-04-19 Thread XiDuo You (Jira)
XiDuo You created SPARK-38962: - Summary: Fix wrong computeStats at DataSourceV2Relation Key: SPARK-38962 URL: https://issues.apache.org/jira/browse/SPARK-38962 Project: Spark Issue Type:

[jira] [Updated] (SPARK-38962) Fix wrong computeStats at DataSourceV2Relation

2022-04-19 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38962: -- Issue Type: Bug (was: Improvement) > Fix wrong computeStats at DataSourceV2Relation >

[jira] [Commented] (SPARK-38667) Optimizer generates error when using inner join along with sequence

2022-04-19 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17524692#comment-17524692 ] XiDuo You commented on SPARK-38667: --- So you can add a config to avoid this issue set

[jira] [Commented] (SPARK-38667) Optimizer generates error when using inner join along with sequence

2022-04-19 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17524690#comment-17524690 ] XiDuo You commented on SPARK-38667: --- it was introduced by SPARK-32295 and fixed by SPARK-37392 >

[jira] [Created] (SPARK-38932) Datasource v2 support report unique keys

2022-04-18 Thread XiDuo You (Jira)
XiDuo You created SPARK-38932: - Summary: Datasource v2 support report unique keys Key: SPARK-38932 URL: https://issues.apache.org/jira/browse/SPARK-38932 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-38895) Unify the AQE shuffle read canonicalized

2022-04-14 Thread XiDuo You (Jira)
XiDuo You created SPARK-38895: - Summary: Unify the AQE shuffle read canonicalized Key: SPARK-38895 URL: https://issues.apache.org/jira/browse/SPARK-38895 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-38887) Support switch inner join side for sort merge join

2022-04-13 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38887: -- Summary: Support switch inner join side for sort merge join (was: Support swtich inner join side for

[jira] [Created] (SPARK-38887) Support swtich inner join side for sort merge join

2022-04-13 Thread XiDuo You (Jira)
XiDuo You created SPARK-38887: - Summary: Support swtich inner join side for sort merge join Key: SPARK-38887 URL: https://issues.apache.org/jira/browse/SPARK-38887 Project: Spark Issue Type:

[jira] [Updated] (SPARK-38886) Remove outer join if aggregate functions are duplicate agnostic on streamed side

2022-04-13 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38886: -- Description: If aggregate child is outer join, and the aggregate references are all coming from the

[jira] [Created] (SPARK-38886) Remove outer join if aggregate functions are duplicate agnostic on streamed side

2022-04-13 Thread XiDuo You (Jira)
XiDuo You created SPARK-38886: - Summary: Remove outer join if aggregate functions are duplicate agnostic on streamed side Key: SPARK-38886 URL: https://issues.apache.org/jira/browse/SPARK-38886 Project:

[jira] [Commented] (SPARK-38853) optimizeSkewsInRebalancePartitions has performance issue

2022-04-11 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17520368#comment-17520368 ] XiDuo You commented on SPARK-38853: --- Some issues might cause driver hang during optimizing skew :

[jira] [Updated] (SPARK-38832) Remove unnecessary distinct in aggregate expression by distinctKeys

2022-04-08 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38832: -- Description: We can remove the distinct in aggregate expression if the child distinct semantics is

[jira] [Created] (SPARK-38832) Remove unnecessary distinct in aggregate expression by distinctKeys

2022-04-08 Thread XiDuo You (Jira)
XiDuo You created SPARK-38832: - Summary: Remove unnecessary distinct in aggregate expression by distinctKeys Key: SPARK-38832 URL: https://issues.apache.org/jira/browse/SPARK-38832 Project: Spark

[jira] [Updated] (SPARK-38162) Optimize one row plan in normal and AQE Optimizer

2022-04-06 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38162: -- Parent: SPARK-37063 Issue Type: Sub-task (was: Improvement) > Optimize one row plan in

[jira] [Created] (SPARK-38773) Correct the Union output partitioning and ordering

2022-04-02 Thread XiDuo You (Jira)
XiDuo You created SPARK-38773: - Summary: Correct the Union output partitioning and ordering Key: SPARK-38773 URL: https://issues.apache.org/jira/browse/SPARK-38773 Project: Spark Issue Type:

[jira] [Updated] (SPARK-37528) Schedule Tasks By Input Size

2022-04-01 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37528: -- Affects Version/s: 3.4.0 (was: 3.3.0) > Schedule Tasks By Input Size >

[jira] [Updated] (SPARK-37528) Schedule Tasks By Input Size

2022-04-01 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37528: -- Summary: Schedule Tasks By Input Size (was: Support reorder tasks during scheduling by shuffle

[jira] [Updated] (SPARK-37528) Support reorder tasks during scheduling by shuffle partition size in AQE

2022-04-01 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37528: -- Description: In general, the larger input data size means longer running time. So ideally, we can

[jira] [Updated] (SPARK-38697) Extend SparkSessionExtensions to inject rules into AQE Optimizer

2022-03-30 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38697: -- Description: Provide a entrance for developer to play their logical plan with runtime optimizer in

[jira] [Created] (SPARK-38697) Extend SparkSessionExtensions to inject rules into AQE Optimizer

2022-03-30 Thread XiDuo You (Jira)
XiDuo You created SPARK-38697: - Summary: Extend SparkSessionExtensions to inject rules into AQE Optimizer Key: SPARK-38697 URL: https://issues.apache.org/jira/browse/SPARK-38697 Project: Spark

[jira] [Updated] (SPARK-38578) Avoid unnecessary sort in FileFormatWriter if user has specified sort in AQE

2022-03-16 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38578: -- Summary: Avoid unnecessary sort in FileFormatWriter if user has specified sort in AQE (was: Avoid

[jira] [Updated] (SPARK-38578) Avoid unnecessary sort in FileFormatWriter if user has specified sort

2022-03-16 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38578: -- Parent: SPARK-37063 Issue Type: Sub-task (was: Improvement) > Avoid unnecessary sort in

[jira] [Created] (SPARK-38578) Avoid unnecessary sort in FileFormatWriter if user has specified sort

2022-03-16 Thread XiDuo You (Jira)
XiDuo You created SPARK-38578: - Summary: Avoid unnecessary sort in FileFormatWriter if user has specified sort Key: SPARK-38578 URL: https://issues.apache.org/jira/browse/SPARK-38578 Project: Spark

[jira] [Updated] (SPARK-37796) ByteArrayMethods arrayEquals should fast skip the check of aligning with unaligned platform

2022-03-15 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37796: -- Priority: Major (was: Minor) > ByteArrayMethods arrayEquals should fast skip the check of aligning

[jira] [Updated] (SPARK-36992) Improve byte array sort perf by unify getPrefix function of UTF8String and ByteArray

2022-03-15 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-36992: -- Priority: Major (was: Minor) > Improve byte array sort perf by unify getPrefix function of

[jira] [Updated] (SPARK-37037) Improve byte array sort by unify compareTo function of UTF8String and ByteArray

2022-03-15 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37037: -- Priority: Major (was: Minor) > Improve byte array sort by unify compareTo function of UTF8String and

[jira] [Commented] (SPARK-38536) Spark 3 can not read mixed format partitions

2022-03-15 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17506789#comment-17506789 ] XiDuo You commented on SPARK-38536: --- it should be fixed by SPARK-36197 ? > Spark 3 can not read

[jira] [Updated] (SPARK-38519) AQE throw exception should respect SparkFatalException

2022-03-10 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38519: -- Description: BroadcastExchangeExec will wrap fatal exception inside SparkFatalException and unwarp

[jira] [Updated] (SPARK-38519) AQE throw exception should respect SparkFatalException

2022-03-10 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38519: -- Description: BroadcastExchangeExec will wrap fatal exception in SparkFatalException and unwarp it 

[jira] [Updated] (SPARK-38519) AQE throw exception should respect SparkFatalException

2022-03-10 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38519: -- Description: BroadcastExchangeExec will wrap fatal exception in SparkFatalException and unwarp it in

[jira] [Created] (SPARK-38519) AQE throw exception should respect SparkFatalException

2022-03-10 Thread XiDuo You (Jira)
XiDuo You created SPARK-38519: - Summary: AQE throw exception should respect SparkFatalException Key: SPARK-38519 URL: https://issues.apache.org/jira/browse/SPARK-38519 Project: Spark Issue Type:

[jira] [Created] (SPARK-38410) Support specify initial partition number for rebalance

2022-03-03 Thread XiDuo You (Jira)
XiDuo You created SPARK-38410: - Summary: Support specify initial partition number for rebalance Key: SPARK-38410 URL: https://issues.apache.org/jira/browse/SPARK-38410 Project: Spark Issue Type:

[jira] [Updated] (SPARK-38406) Improve perfermance of ShufflePartitionsUtil createSkewPartitionSpecs

2022-03-03 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38406: -- Parent: SPARK-37063 Issue Type: Sub-task (was: Improvement) > Improve perfermance of

[jira] [Updated] (SPARK-38406) Improve perfermance of ShufflePartitionsUtil createSkewPartitionSpecs

2022-03-03 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38406: -- Description: If shuffle is skewed with tens of thousands of map partitions and reduce partitions in

[jira] [Created] (SPARK-38406) Improve perfermance of ShufflePartitionsUtil createSkewPartitionSpecs

2022-03-03 Thread XiDuo You (Jira)
XiDuo You created SPARK-38406: - Summary: Improve perfermance of ShufflePartitionsUtil createSkewPartitionSpecs Key: SPARK-38406 URL: https://issues.apache.org/jira/browse/SPARK-38406 Project: Spark

[jira] [Updated] (SPARK-38401) Unify get preferred locations for shuffle in AQE

2022-03-03 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38401: -- Description: It has several issues with method `ShuffledRowRDD#getPreferredLocations`. * it does not

[jira] [Created] (SPARK-38401) Unify get preferred locations for shuffle in AQE

2022-03-02 Thread XiDuo You (Jira)
XiDuo You created SPARK-38401: - Summary: Unify get preferred locations for shuffle in AQE Key: SPARK-38401 URL: https://issues.apache.org/jira/browse/SPARK-38401 Project: Spark Issue Type:

[jira] [Updated] (SPARK-38322) Support query stage show runtime statistics in formatted explain mode

2022-02-24 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38322: -- Description: The formatted explalin mode is the powerful explain mode to show the details of query

[jira] [Created] (SPARK-38322) Support query stage show runtime statistics in formatted explain mode

2022-02-24 Thread XiDuo You (Jira)
XiDuo You created SPARK-38322: - Summary: Support query stage show runtime statistics in formatted explain mode Key: SPARK-38322 URL: https://issues.apache.org/jira/browse/SPARK-38322 Project: Spark

[jira] [Commented] (SPARK-38172) Adaptive coalesce not working with df persist

2022-02-24 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17497870#comment-17497870 ] XiDuo You commented on SPARK-38172: --- thanks [~Naveenmts]  for the confirming ! > Adaptive coalesce

[jira] [Resolved] (SPARK-38172) Adaptive coalesce not working with df persist

2022-02-24 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You resolved SPARK-38172. --- Resolution: Won't Fix > Adaptive coalesce not working with df persist >

[jira] [Updated] (SPARK-38232) Explain formatted does not collect subqueries under query stage in AQE

2022-02-16 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38232: -- Parent: SPARK-37063 Issue Type: Sub-task (was: Bug) > Explain formatted does not collect

[jira] [Updated] (SPARK-38232) Explain formatted does not collect subqueries under query stage in AQE

2022-02-16 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38232: -- Description: ExplainUtils have not catched QueryStageExec during collecting subquries. So we can not

[jira] [Created] (SPARK-38232) Explain formatted does not collect subqueries under query stage in AQE

2022-02-16 Thread XiDuo You (Jira)
XiDuo You created SPARK-38232: - Summary: Explain formatted does not collect subqueries under query stage in AQE Key: SPARK-38232 URL: https://issues.apache.org/jira/browse/SPARK-38232 Project: Spark

[jira] [Commented] (SPARK-38172) Adaptive coalesce not working with df persist

2022-02-11 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17490773#comment-17490773 ] XiDuo You commented on SPARK-38172: --- hi [~Naveenmts]  have you tried enable this config ? {code:java}

[jira] [Updated] (SPARK-38185) Fix data incorrect if aggregate function is empty

2022-02-10 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38185: -- Summary: Fix data incorrect if aggregate function is empty (was: Fix data incorrect if aggregate is

[jira] [Updated] (SPARK-38185) Fix data incorrect if aggregate is group only with empty function

2022-02-10 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38185: -- Description: The group only condition should check if the aggregate expression is empty. In

[jira] [Created] (SPARK-38185) Fix data incorrect if aggregate is group only with empty function

2022-02-10 Thread XiDuo You (Jira)
XiDuo You created SPARK-38185: - Summary: Fix data incorrect if aggregate is group only with empty function Key: SPARK-38185 URL: https://issues.apache.org/jira/browse/SPARK-38185 Project: Spark

[jira] [Updated] (SPARK-38182) Fix NoSuchElementException if pushed filter does not contain any references

2022-02-10 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38182: -- Description: reproduce: {code:java} CREATE TABLE t (c1 int) USING PARQUET; SET

[jira] [Created] (SPARK-38182) Fix NoSuchElementException if pushed filter does not contain any references

2022-02-10 Thread XiDuo You (Jira)
XiDuo You created SPARK-38182: - Summary: Fix NoSuchElementException if pushed filter does not contain any references Key: SPARK-38182 URL: https://issues.apache.org/jira/browse/SPARK-38182 Project: Spark

[jira] [Updated] (SPARK-38177) Fix wrong transformExpressions in Optimizer

2022-02-10 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38177: -- Description: `transformExpressions` can only traverse all expressions in this current query plan, so

[jira] [Created] (SPARK-38177) Fix wrong transformExpressions in Optimizer

2022-02-10 Thread XiDuo You (Jira)
XiDuo You created SPARK-38177: - Summary: Fix wrong transformExpressions in Optimizer Key: SPARK-38177 URL: https://issues.apache.org/jira/browse/SPARK-38177 Project: Spark Issue Type:

[jira] [Updated] (SPARK-38162) Optimize one row plan in normal and AQE Optimizer

2022-02-10 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38162: -- Description: Optimize the plan if its max row is equal to or less than 1 in these cases: - if the

[jira] [Updated] (SPARK-38162) Optimize one row plan in normal and AQE Optimizer

2022-02-09 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38162: -- Summary: Optimize one row plan in normal and AQE Optimizer (was: Optimize one max row plan in normal

[jira] [Updated] (SPARK-38162) Optimize one max row plan in normal and AQE Optimizer

2022-02-09 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38162: -- Description: Optimize the plan if its max row is equal to or less than 1 in these cases: * if sort

[jira] [Updated] (SPARK-38162) Optimize one max row plan in normal and AQE Optimizer

2022-02-09 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38162: -- Summary: Optimize one max row plan in normal and AQE Optimizer (was: Remove distinct in aggregate if

[jira] [Created] (SPARK-38162) Remove distinct in aggregate if its child is empty

2022-02-09 Thread XiDuo You (Jira)
XiDuo You created SPARK-38162: - Summary: Remove distinct in aggregate if its child is empty Key: SPARK-38162 URL: https://issues.apache.org/jira/browse/SPARK-38162 Project: Spark Issue Type:

[jira] [Created] (SPARK-38148) Do not add dynamic partition pruning if there exists static partition pruning

2022-02-08 Thread XiDuo You (Jira)
XiDuo You created SPARK-38148: - Summary: Do not add dynamic partition pruning if there exists static partition pruning Key: SPARK-38148 URL: https://issues.apache.org/jira/browse/SPARK-38148 Project:

[jira] [Commented] (SPARK-33832) Add an option in AQE to mitigate skew even if it causes an new shuffle

2022-01-27 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17483604#comment-17483604 ] XiDuo You commented on SPARK-33832: --- thank you [~dongjoon] ! > Add an option in AQE to mitigate skew

[jira] [Updated] (SPARK-38013) AQE can change bhj to smj if no extra shuffle introduce

2022-01-27 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38013: -- Parent: SPARK-37063 Issue Type: Sub-task (was: Task) > AQE can change bhj to smj if no extra

[jira] [Commented] (SPARK-38013) AQE can change bhj to smj if no extra shuffle introduce

2022-01-27 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17483598#comment-17483598 ] XiDuo You commented on SPARK-38013: --- Add a test to cover this behavior > AQE can change bhj to smj if

[jira] [Resolved] (SPARK-38013) AQE can change bhj to smj if no extra shuffle introduce

2022-01-25 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You resolved SPARK-38013. --- Resolution: Won't Fix > AQE can change bhj to smj if no extra shuffle introduce >

[jira] [Commented] (SPARK-38013) AQE can change bhj to smj if no extra shuffle introduce

2022-01-25 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17482209#comment-17482209 ] XiDuo You commented on SPARK-38013: --- seems it is allowed in AQE, not a bug otherwise .. > AQE can

[jira] [Updated] (SPARK-38013) AQE can change bhj to smj if no extra shuffle introduce

2022-01-25 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38013: -- Issue Type: Task (was: Bug) > AQE can change bhj to smj if no extra shuffle introduce >

<    1   2   3   4   5   >