Hi Alaa, Partition when using CassandraRDD depends on your partition key in Cassandra table.
If you see only 1 partition in the RDD it means all the rows you have selected have same partition_key in C* Thanks Ankur On Thu, Sep 3, 2015 at 11:54 AM, Alaa Zubaidi (PDF) <alaa.zuba...@pdf.com> wrote: > Hi, > > I testing Spark and Cassandra, Spark 1.4, Cassandra 2.1.7 cassandra spark > connector 1.4, running in standalone mode. > > I am getting 4000 rows from Cassandra (4mb row), where the row keys are > random. > ...... sc.cassandraTable[RES](keyspace,res_name).where(res_where).cache > > I am expecting that it will generate few partitions. > However, I can ONLY see 1 partition. > I cached the CassandraRDD and in the UI storage tab it shows ONLY 1 > partition. > > Any idea, why I am getting 1 partition? > > Thanks, > Alaa > > > > *This message may contain confidential and privileged information. If it > has been sent to you in error, please reply to advise the sender of the > error and then immediately permanently delete it and all attachments to it > from your systems. If you are not the intended recipient, do not read, > copy, disclose or otherwise use this message or any attachments to it. The > sender disclaims any liability for such unauthorized use. PLEASE NOTE that > all incoming e-mails sent to PDF e-mail accounts will be archived and may > be scanned by us and/or by external service providers to detect and prevent > threats to our systems, investigate illegal or inappropriate behavior, > and/or eliminate unsolicited promotional e-mails (“spam”). If you have any > concerns about this process, please contact us at * > *legal.departm...@pdf.com* <legal.departm...@pdf.com>*.*