Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/22112 I only see 2 options: 1. force a sort on these operations 2. do nothing and require users to sort or handle someway (checkpoint) if they care. You can possibly make optimizations to the above and try to do the failure thing if you know the output format ahead of time and conditionally use save to temp location based on output format and if you do an operations like zip.
--- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org