Hi,

Is there a good way to materialize derivate RDDs from say, a HadoopRDD
while reading in the data only once.  One way to do so would be to cache
the HadoopRDD and then create derivative RDDs, but that would require
enough RAM to cache the HadoopRDD which is not an option in my case.

Thanks,
Akshat

Reply via email to