groupBy cannot handle large RDDs

2016-06-29 Thread Kaiyin Zhong
Could anyone have a look at this? It looks like a bug: http://stackoverflow.com/questions/38106554/groupby-cannot-handle-large-rdds Best regards, Kaiyin ZHONG

Spark RDD aggregate action behaves strangely

2016-06-29 Thread Kaiyin Zhong
Could anyone have a look at this? http://stackoverflow.com/questions/38100918/spark-rdd-aggregate-action-behaves-strangely Thanks! Best regards, Kaiyin ZHONG