GraphX: unbalanced computation and slow runtime on livejournal network

2015-04-19 Thread harenbergsd
-unbalanced-computation-and-slow-runtime-on-livejournal-network-tp22565.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e

Re: GraphX: unbalanced computation and slow runtime on livejournal network

2015-04-19 Thread hnahak
-- 1.2 3 -- 1.3 5 -- 1.6 10 -- 2.6 20 -- 3.9 -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/GraphX-unbalanced-computation-and-slow-runtime-on-livejournal-network-tp22565p22566.html Sent from the Apache Spark User

GraphX: unbalanced computation and slow runtime on livejournal network

2015-04-19 Thread Steven Harenberg
Hi all, I have been testing GraphX on the soc-LiveJournal1 network from the SNAP repository. Currently I am running on c3.8xlarge EC2 instances on Amazon. These instances have 32 cores and 60GB RAM per node, and so far I have run SSSP, PageRank, and WCC on a 1, 4, and 8 node cluster. The issues