XiDuo You created SPARK-35853: --------------------------------- Summary: Remark the shuffle origin to ENSURE_REQUIREMENTS as far as possible Key: SPARK-35853 URL: https://issues.apache.org/jira/browse/SPARK-35853 Project: Spark Issue Type: Improvement Components: SQL Affects Versions: 3.2.0 Reporter: XiDuo You
In some queries, we might repartition by some column with a large partition number manually to make parallelism big enough. However if the output partitioning satisfies some other node (e.g. join/aggregate), this shuffle can not be optimized by AQE due to the shuffle origin. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org