cache() VS cacheTable()

Srikanth Mon, 13 Jul 2015 08:55:19 -0700

Hello,

I was reading "learning spark" book and saw a tip in chapter 9 that read
   "In Spark 1.2, the regular cache() method on RDDs also results in a
cacheTable()"


Is that true? When I cache a RDD and cache same data as a dataframe I see
that memory usage for dataframe cache is way less than RDD cache. I thought
this difference is due to columnar format used by dataframe. As per the
statement in the book, cache size should be similar.

Srikanth

cache() VS cacheTable()

Reply via email to