Hi,

I am trying to leftJoin an other vertice RDD(e.g vB) with this one(vA).
vA.leftJoin(vB)(f)

- vA is the vertices RDD in graph G, and G is edge-partitioned using 
EdgePartition2D.

- vB is created using default partitioner(actually I am not sure...)

So, I am wondering, that if vB has same partitioner to vA, what will 
graphx(spark) do to handle this case?
for instance, as below
1) to check the partitioner of vB.
2) to do leftJoin operations, on each machine separately, for those co-located 
partitions of vA and vB.
right?

But, if vB’s partitioner is different, what will happen? how they communicate 
between partitions(and machines)?


Anyone has some points on this, or communication between RDDs? 

Thanks, :)

Best,
Yifan LI





Reply via email to