Its purely for estimation, when guessing when its safe to do a broadcast
join. We picked a random number that we thought was larger than the common
case (its better to over estimate to avoid OOM).
On Wed, Oct 7, 2015 at 10:11 PM, vivek bhaskar wrote:
> I want to understand
I want to understand whats use of default size for a given datatype?
Following link mention that its for internal size estimation.
https://spark.apache.org/docs/latest/api/java/org/apache/spark/sql/types/DataType.html
Above behavior is also reflected in code where default value seems to be
used