Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/22112 Personally I don't want to talk about implementation until we decide what we want our semantics to be around the unordered operations because that affects any implementation. If we are saying we need to fix zip and any other unordered operation that means we don't really support unordered operations and everything needs to be sorted. I would propose we fix the things that are using the round robin type partitioning (repartition) but then unordered things like zip/MapPartitions (via user code) we document or perhaps give the user the option to sort. @mridulm you caught the issues with zip and others and have said they need to be fixed, what are your thoughts?
--- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org