The one issue with using Neo4j is that you need to persist the whole graph on one single machine i.e you can not shard the graph. I am not sure what is the size of your graph but if it is huge one way to shard could be to use the Component Id to shard. You can generate Component Id by running ConnectedComponent on your Graph in GrpahX of GraphFrames.
But GraphX or GraphFrame expect the data in to Dataframes (RDD) vertices and edges and it really relies on the relational nature of these entities to run any algorithm. AFAIK same is the case with Giraph too so if you want to use GraphFrames as your processing engine you can chose to persist your data in Hive tables and not in native graph format. Hope this helps. Thanks Ankur On Sun, Jan 29, 2017 at 10:27 AM, Felix Cheung <felixcheun...@hotmail.com> wrote: > Which graph do you are thinking about? > Here's one for neo4j > > https://neo4j.com/blog/neo4j-3-0-apache-spark-connector/ > > ------------------------------ > *From:* Deepak Sharma <deepakmc...@gmail.com> > *Sent:* Sunday, January 29, 2017 4:28:19 AM > *To:* spark users > *Subject:* Examples in graphx > > Hi There, > Are there any examples of using GraphX along with any graph DB? > I am looking to persist the graph in graph based DB and then read it back > in spark , process using graphx. > > -- > Thanks > Deepak > www.bigdatabig.com > www.keosha.net >