Short story: I want to write some parquet files so they are pre-partitioned by 
the same key. Then, when I read them
back in, joining the two tables on that key should be about as fast as things 
can be done.
Can I do that, and if so, how? I don't see how to control the partitioning of a 
SQL table, as opposed to PairRDDs.

Thanks,
Ron

Reply via email to