It's really strange that cpu load so high and both disk/network IO load so
low. CLUSTER BY is just something similar to groupBy, why it needs so much
cpu resource?



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Spark-1-0-1-SparkSQL-reduce-stage-of-shuffle-is-slow-tp10765p10851.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

Reply via email to