Hi I have an RDD with elements as tuple ((key1,key2),value) where (key1,key2) is the partitioning key in my Cassandra table Now for each such element I have to do a read from Cassandra table. My Cassandra table and spark cluster are in different nodes and cant be co-located. Right now I am doing individual query using session.execute("...").* Should I prefer joinWithCassandraTable over individual queries? Do I get some performance benefit?*
As i understand joinWithCassandraTable is ultimately going to perform queries for each partitioningKey(or primary key not sure). Regards Vaibhav -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/JoinWithCassandraTable-over-individual-queries-tp26833.html Sent from the Apache Spark User List mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org