Re: GraphX triplets on 5-node graph

2014-05-28 Thread Reynold Xin
Take a look at this one: https://issues.apache.org/jira/browse/SPARK-1188

It was an optimization that added user inconvenience. We got rid of that
now in Spark 1.0.



On Wed, May 28, 2014 at 11:48 PM, Michael Malak wrote:

> Shouldn't I be seeing N2 and N4 in the output below? (Spark 0.9.0 REPL) Or
> am I missing something fundamental?
>
>
> val nodes = sc.parallelize(Array((1L, "N1"), (2L, "N2"), (3L, "N3"), (4L,
> "N4"), (5L, "N5")))
> val edges = sc.parallelize(Array(Edge(1L, 2L, "E1"), Edge(1L, 3L, "E2"),
> Edge(2L, 4L, "E3"), Edge(3L, 5L, "E4")))
> Graph(nodes, edges).triplets.collect
> res1: Array[org.apache.spark.graphx.EdgeTriplet[String,String]] =
> Array(((1,N1),(3,N3),E2), ((1,N1),(3,N3),E2), ((3,N3),(5,N5),E4),
> ((3,N3),(5,N5),E4))
>


GraphX triplets on 5-node graph

2014-05-28 Thread Michael Malak
Shouldn't I be seeing N2 and N4 in the output below? (Spark 0.9.0 REPL) Or am I 
missing something fundamental?


val nodes = sc.parallelize(Array((1L, "N1"), (2L, "N2"), (3L, "N3"), (4L, 
"N4"), (5L, "N5"))) 
val edges = sc.parallelize(Array(Edge(1L, 2L, "E1"), Edge(1L, 3L, "E2"), 
Edge(2L, 4L, "E3"), Edge(3L, 5L, "E4"))) 
Graph(nodes, edges).triplets.collect 
res1: Array[org.apache.spark.graphx.EdgeTriplet[String,String]] = 
Array(((1,N1),(3,N3),E2), ((1,N1),(3,N3),E2), ((3,N3),(5,N5),E4), 
((3,N3),(5,N5),E4))