Have you looked at RDD#randomSplit() (as example) ? Cheers
On Tue, May 5, 2015 at 2:42 PM, Bill Q <bill.q....@gmail.com> wrote: > Hi all, > I have a large RDD that I map a function to it. Based on the nature of > each record in the input RDD, I will generate two types of data. I would > like to save each type into its own RDD. But I can't seem to find an > efficient way to do it. Any suggestions? > > Many thanks. > > > Bill > > > -- > Many thanks. > > > Bill > >