-shuffle-read-tp18648p18791.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.
-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org
running with
2 nodes, i verified that one partition is completely empty, and the other
contains all the records.
What is going wrong with the partitioning here?
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/Imbalanced-shuffle-read-tp18648p18790.html
Sent
with a
huge shuffle read and takes a long time to finish.
Can someone explain why the read is all on one node and how to parallelize
this better?
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/Imbalanced-shuffle-read-tp18648.html
Sent from the Apache Spark
why the read is all on one node and how to parallelize
this better?
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/Imbalanced-shuffle-read-tp18648.html
Sent from the Apache Spark User List mailing list archive at Nabble.com