if you want the result as RDD of (key, 1)
new_rdd = rdd.filter(x => x._2 == 1)
if you want result as RDD of keys (since you know the values are 1), then
new_rdd = rdd.filter(x => x._2 == 1).map(x => x._1)
x._1 and x._2 are the way of scala to access the key and value from
key/value pair.
You don't. That's what filter or the partial function version of collect
are for:
val transformedRDD = yourRDD.collect { case (k, v) if k == 1 => v }
On Wed, Sep 17, 2014 at 3:24 AM, Deep Pradhan
wrote:
> Hi,
> I want to make the following changes in the RDD (create new RDD from the
> existing