RDD Indexes and how to fetch all edges with a given label

2014-10-14 Thread Soumitra Johri
Hi All,

I have a Graph with millions of edges. Edges are represented by
org.apache.spark.rdd.RDD[org.apache.spark.graphx.Edge[String]]
= MappedRDD[4] . I have two questions :

1)How can I fetch all the nodes with a given edge label ( all edges with a
given property )

2) Is it possible to create indexes on the RDDs or a specific column of the
RDD to make the look up faster?

Please excuse me for the triviality of the question, I am new to the
language and its taking me sometime to get used to it.
Warm Regards
Soumitra


Re: RDD Indexes and how to fetch all edges with a given label

2014-10-14 Thread Soumitra Johri
Hi,

With respect to the first issue, one possible way is to filter the graph
via 'graph.subgraph(epred = e = e.attr == edgeLabel)'  ,  but I am still
curious if we can index RDDs.


Warm Regards
Soumitra

On Tue, Oct 14, 2014 at 2:46 PM, Soumitra Johri 
soumitra.siddha...@gmail.com wrote:

 Hi All,

 I have a Graph with millions of edges. Edges are represented by 
 org.apache.spark.rdd.RDD[org.apache.spark.graphx.Edge[String]]
 = MappedRDD[4] . I have two questions :

 1)How can I fetch all the nodes with a given edge label ( all edges with a
 given property )

 2) Is it possible to create indexes on the RDDs or a specific column of
 the RDD to make the look up faster?

 Please excuse me for the triviality of the question, I am new to the
 language and its taking me sometime to get used to it.
 Warm Regards
 Soumitra