Hi,

I'm trying, after reducing by key, to get data ordered among partitions
(like RangePartitioner) and within partitions (like sortByKey or
repartitionAndSortWithinPartition) pushing the sorting down to the
shuffles machinery of the reducing phase.

I think, but maybe I'm wrong, that the correct way to do that is that
combineByKey call setKeyOrdering function on the ShuflleRDD that it returns.

Am I wrong? Can be done by a combination of other transformations with
the same efficiency?

Thanks,
Marco

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Reply via email to