Hi,

How can I ensure that a batch of DataFrames I make are all partitioned based on 
the value of one column common to them all?
For RDDs I would partitionBy a HashPartitioner, but I don't see that in the 
DataFrame API.
If I partition the RDDs that way, then do a toDF(), will the partitioning be 
preserved?

Thanks,
Ron

Reply via email to