If you want to take out apple and orange you might want to try
dataRDD.filter(_._2 !=apple).filter(_._2 !=orange) and so on.
...Manas
-
Manas Kar
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/equivalent-to-sql-in-tp20599p20616.html
Sent from
you filter using a set or a
list?
thanks
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/equivalent-to-sql-in-tp20599.html
Sent from the Apache Spark User List mailing list archive at Nabble.com
This is more a scala specific question. I would look at the List contains
implementation
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/equivalent-to-sql-in-tp20599p20600.html
Sent from the Apache Spark User List mailing list archive at Nabble.com
...@spark.incubator.apache.org
Subject: equivalent to sql in
i have and RDD i want to filter and for a single term all works good:
ie
dataRDD.filter(x=x._2 ==apple)
how can i use multiple values, for example if i wanted to filter my rdd to take
out apples and oranges and pears with out using . This could get