If I have 2 RDDs which depend on the same RDD like the following:

val rdd1 = ...

val rdd2 = rdd1.groupBy()...

val rdd3 = rdd1.groupBy()...


If I don't cache rdd1, will it's lineage be calculated twice (one for rdd2
and one for rdd3)?

Reply via email to