Kristin Cowalcijk created SEDONA-408:
----------------------------------------

             Summary: Set a reasonable default size for RasterUDT
                 Key: SEDONA-408
                 URL: https://issues.apache.org/jira/browse/SEDONA-408
             Project: Apache Sedona
          Issue Type: Improvement
            Reporter: Kristin Cowalcijk
             Fix For: 1.5.0


The {{defaultSize}} method of {{UserDefinedType}} is used by Spark SQL query 
optimizer to decide whether to broadcast the DataFrame or not. For 
{{RasterUDT}}, the default value is 100 bytes, which is the default size of 
{{BinaryType}}. This is almost always too small for {{RasterUDT}} and will lead 
to large raster DataFrames being mistakenly broadcasted. We can override this 
method and set a better default size for {{RasterUDT}}. Maybe 512 KB is a 
reasonable default value.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to