How to measure communication between nodes in Spark Standalone Cluster?

2014-11-17 Thread Hlib Mykhailenko
Hello, I use Spark Standalone Cluster and I want to measure somehow internode communication. As I understood, Graphx transfers only vertices values. Am I right? But I do not want to get number of bytes which were transferred between any two nodes. So is there way to measure how many values

Re: How to measure communication between nodes in Spark Standalone Cluster?

2014-11-17 Thread Akhil Das
You can use Ganglia to see the overall data transfer across the cluster/nodes. I don't think there's a direct way to get the vertices being transferred. Thanks Best Regards On Mon, Nov 17, 2014 at 4:29 PM, Hlib Mykhailenko hlib.mykhaile...@inria.fr wrote: Hello, I use Spark Standalone

Re: How to measure communication between nodes in Spark Standalone Cluster?

2014-11-17 Thread Yifan LI
I am not sure there is a direct way(an api in graphx, etc) to measure the number of transferred vertex values among nodes during computation. It might depend on: - the operations in your application, e.g. only communicate with its immediate neighbours for each vertex. - the partition strategy