Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/21698 Still catching up on this and trying to understand all the cases. What happened to the other pr proposal of just using the hashPartitioner? Did we give up on that because of the skewed data issue or was there another issue there that I missed?
--- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org