Re: rdd split into new rdd

2015-12-23 Thread Ted Yu
bq. {a=1, b=1, c=2, d=2} Can you elaborate your criteria a bit more ? The above seems to be a Set, not a Map. Cheers On Wed, Dec 23, 2015 at 7:11 AM, Yasemin Kaya wrote: > Hi, > > I have data > *JavaPairRDD> *format. In example: > > *(1610,

rdd split into new rdd

2015-12-23 Thread Yasemin Kaya
Hi, I have data *JavaPairRDD> *format. In example: *(1610, {a=1, b=1, c=2, d=2}) * I want to get *JavaPairRDD* In example: *(1610, {a, b})* *(1610, {c, d})* Is there a way to solve this problem? Best, yasemin -- hiç ender hiç

Re: rdd split into new rdd

2015-12-23 Thread Stéphane Verlet
You should be able to do that using mapPartition On Wed, Dec 23, 2015 at 8:24 AM, Ted Yu wrote: > bq. {a=1, b=1, c=2, d=2} > > Can you elaborate your criteria a bit more ? The above seems to be a Set, > not a Map. > > Cheers > > On Wed, Dec 23, 2015 at 7:11 AM, Yasemin Kaya

Re: rdd split into new rdd

2015-12-23 Thread Yasemin Kaya
How can i use mapPartion? Could u give me an example? 2015-12-23 17:26 GMT+02:00 Stéphane Verlet : > You should be able to do that using mapPartition > > On Wed, Dec 23, 2015 at 8:24 AM, Ted Yu wrote: > >> bq. {a=1, b=1, c=2, d=2} >> >> Can you

Re: rdd split into new rdd

2015-12-23 Thread Stéphane Verlet
I use Scala , but I guess in Java code would like this JavaPairRDD> rdd ... JavaPairRDD rdd2 = rdd.mapPartitionsToPair(function , true) where function implements PairFlatMapFunction>,String, List>