[ https://issues.apache.org/jira/browse/SPARK-44571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
jiaan.geng updated SPARK-44571: ------------------------------- Summary: Eliminate the Join by combine multiple Aggregates (was: Eliminate the Join by Combine multiple Aggregates) > Eliminate the Join by combine multiple Aggregates > ------------------------------------------------- > > Key: SPARK-44571 > URL: https://issues.apache.org/jira/browse/SPARK-44571 > Project: Spark > Issue Type: New Feature > Components: SQL > Affects Versions: 3.5.0 > Reporter: jiaan.geng > Priority: Major > > Recently, I investigate the test case q28 which is belong to the TPC-DS > queries. > The query contains multiple scalar subquery with aggregation and connected > with inner join. > If we can merge the filters and aggregates, we can scan data source only once > and eliminate the join so as avoid shuffle. Obviously, this change will > improve the performance. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org