If you want to take out "apple" and "orange" you might want to try
dataRDD.filter(_._2 !="apple").filter(_._2 !="orange") and so on.
...Manas
-
Manas Kar
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/equivalen
mail.com]
Sent: Tuesday, December 9, 2014 2:16 PM
To: u...@spark.incubator.apache.org
Subject: equivalent to sql in
i have and RDD i want to filter and for a single term all works good:
ie
dataRDD.filter(x=>x._2 =="apple")
how can i use multiple values, for example if i wanted to f
This is more a scala specific question. I would look at the List contains
implementation
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/equivalent-to-sql-in-tp20599p20600.html
Sent from the Apache Spark User List mailing list archive at Nabble.com
This could
get long winded as there may be quite a few. Can you filter using a set or a
list?
thanks
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/equivalent-to-sql-in-tp20599.html
Sent from the Apache