If I have 2 RDDs which depend on the same RDD like the following: val rdd1 = ...
val rdd2 = rdd1.groupBy()... val rdd3 = rdd1.groupBy()... If I don't cache rdd1, will it's lineage be calculated twice (one for rdd2 and one for rdd3)?
If I have 2 RDDs which depend on the same RDD like the following: val rdd1 = ...
val rdd2 = rdd1.groupBy()... val rdd3 = rdd1.groupBy()... If I don't cache rdd1, will it's lineage be calculated twice (one for rdd2 and one for rdd3)?