coalesce and executor memory

Christopher Brady Fri, 12 Feb 2016 10:14:05 -0800

Can anyone help me understand why using coalesce causes my executors tocrash with out of memory? What happens during coalesce that increasesmemory usage so much?


If I do:
hadoopFile -> sample -> cache -> map -> saveAsNewAPIHadoopFile


everything works fine, but if I do:
hadoopFile -> sample -> coalesce -> cache -> map -> saveAsNewAPIHadoopFile

my executors crash with out of memory exceptions.

Is there any documentation that explains what causes the increasedmemory requirements with coalesce? It seems to be less of a problem if Icoalesce into a larger number of partitions, but I'm not sure why thisis. How would I estimate how much additional memory the coalesce requires?


Thanks.

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

coalesce and executor memory

Reply via email to