Re: Imbalanced shuffle read

2014-11-12 Thread ankits
-shuffle-read-tp18648p18791.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org

Re: Imbalanced shuffle read

2014-11-12 Thread ankits
running with 2 nodes, i verified that one partition is completely empty, and the other contains all the records. What is going wrong with the partitioning here? -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Imbalanced-shuffle-read-tp18648p18790.html Sent

Imbalanced shuffle read

2014-11-11 Thread ankits
with a huge shuffle read and takes a long time to finish. Can someone explain why the read is all on one node and how to parallelize this better? -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Imbalanced-shuffle-read-tp18648.html Sent from the Apache Spark

Re: Imbalanced shuffle read

2014-11-11 Thread Akhil Das
why the read is all on one node and how to parallelize this better? -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Imbalanced-shuffle-read-tp18648.html Sent from the Apache Spark User List mailing list archive at Nabble.com