Hash Partitioning and Dataframes

Daniel, Ronald (ELS-SDG) Fri, 08 May 2015 14:51:07 -0700

Hi,

How can I ensure that a batch of DataFrames I make are all partitioned based on 
the value of one column common to them all?
For RDDs I would partitionBy a HashPartitioner, but I don't see that in the 
DataFrame API.
If I partition the RDDs that way, then do a toDF(), will the partitioning be 
preserved?


Thanks,
Ron

Hash Partitioning and Dataframes

Reply via email to