I'm querying a single cassandra partition using sqlContext and Its temView 
which creates more than 2000 tasks on spark and took about 360 seconds:



sqlContext.read().format("org.apache.spark.sql.cassandra).options(ops).load.createOrReplaceTempView("tableName")



But using javaFunctions(sc).cassandraTable().where() it creates only one task 
which response in 200 ms!

I'm using exactly the same where clause for both scenarios.

Spark UI shows like 60 GB input for sqlcontext scenario and only a few KBs for 
javaFunctions scenario



Sent using Zoho Mail










Reply via email to