Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/10162 I would not bake this logic into the logical operator, I think the general approach of doing it in Dataset is better. I just think that we need to do it in a way that does not change existing join functionality at all. It would be helpful to survey other systems and describe their APIs and make a proposal based on that.
--- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org