[ https://issues.apache.org/jira/browse/SPARK-42513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17691896#comment-17691896 ]
Apache Spark commented on SPARK-42513: -------------------------------------- User 'wangyum' has created a pull request for this issue: https://github.com/apache/spark/pull/40114 > Push down topK through join > --------------------------- > > Key: SPARK-42513 > URL: https://issues.apache.org/jira/browse/SPARK-42513 > Project: Spark > Issue Type: Improvement > Components: SQL > Affects Versions: 3.4.0 > Reporter: Yuming Wang > Priority: Major > Attachments: after-UI.png, before-UI.png > > > {code:scala} > spark.range(100000000).selectExpr("id % 10000 as a", "id as > b").write.saveAsTable("t1") > spark.range(100000000).selectExpr("id % 10000 as x", "id as > y").write.saveAsTable("t2") > sql("select * from t1 left join t2 on a = x order by b limit 5").collect() > spark.sql("set > spark.sql.optimizer.excludedRules=org.apache.spark.sql.catalyst.optimizer.LimitPushDown") > sql("select * from t1 left join t2 on a = x order by b limit 5").collect() > {code} -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org