groupBy cannot handle large RDDs
Could anyone have a look at this? It looks like a bug: http://stackoverflow.com/questions/38106554/groupby-cannot-handle-large-rdds Best regards, Kaiyin ZHONG
Spark RDD aggregate action behaves strangely
Could anyone have a look at this? http://stackoverflow.com/questions/38100918/spark-rdd-aggregate-action-behaves-strangely Thanks! Best regards, Kaiyin ZHONG