Without caching, each action is recomputed. So assuming rdd2 and rdd3 result in separate actions answer is yes.
On Mon, Dec 29, 2014 at 7:53 PM, Corey Nolet <cjno...@gmail.com> wrote: > If I have 2 RDDs which depend on the same RDD like the following: > > val rdd1 = ... > > val rdd2 = rdd1.groupBy()... > > val rdd3 = rdd1.groupBy()... > > > If I don't cache rdd1, will it's lineage be calculated twice (one for rdd2 > and one for rdd3)? >