2010YOUY01 commented on code in PR #17467:
URL: https://github.com/apache/datafusion/pull/17467#discussion_r2329222755
##########
datafusion/common/src/config.rs:
##########
@@ -822,9 +822,34 @@ config_namespace! {
/// process to reorder the join keys
pub top_down_join_key_reordering: bool, default = true
- /// When set to true, the physical plan optimizer will prefer HashJoin
over SortMergeJoin.
+ /// When set to true, the physical plan optimizer will prefer HashJoin
when applicable.
+ /// If there are multiple preferred and applicable join types, the
optimizer
+ /// will choose one based on heuristics.
/// HashJoin can work more efficiently than SortMergeJoin but consumes
more memory
- pub prefer_hash_join: bool, default = true
+ pub prefer_hash_join: bool, default = false
Review Comment:
It's not a breaking change. By default there is no join types preferred, and
the planner will prioritize HJ according to heuristics, setting it to `true`
will just override the planner's default behavior.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]