Martin Andersson created SEDONA-217:
---------------------------------------

             Summary: Automatically broadcast small datasets
                 Key: SEDONA-217
                 URL: https://issues.apache.org/jira/browse/SEDONA-217
             Project: Apache Sedona
          Issue Type: Improvement
            Reporter: Martin Andersson


Spark uses stats for each side of a join and automatically switches to a 
broadcast join if size is below threshold. Sedona could do the same.

This is where it's done in Spark.
https://github.com/apache/spark/blob/master/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/joins.scala



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to