Re: GraphX

Ankur Dave Sat, 02 Aug 2014 13:22:22 -0700

At 2014-08-02 21:29:33 +0530, Deep Pradhan <[email protected]> wrote:
> How should I run graphx codes?


At the moment it's a little more complicated to run the GraphX algorithms than 
the Spark examples due to SPARK-1986 [1]. There is a driver program in 
org.apache.spark.graphx.lib.Analytics which you can invoke using spark-submit:

    $SPARK_HOME/bin/spark-submit --master local[*] --class 
org.apache.spark.graphx.lib.Analytics \
        $SPARK_HOME/assembly/target/scala-2.10/spark-assembly-*.jar \
        pagerank /edge-list-file.txt --numEPart=8 --numIter=10 
--partStrategy=EdgePartition2D

This supports running PageRank, connected components, and triangle count. For 
the other algorithms, you can use the Spark shell:

    import org.apache.spark.graphx._
    val graph = (GraphLoader.edgeListFile(sc, "/edge-list-file.txt", 
minEdgePartitions = 8)
      .partitionBy(PartitionStrategy.EdgePartition2D))
    // Run algorithms on graph

Ankur

[1] https://issues.apache.org/jira/browse/SPARK-1986

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: GraphX

Reply via email to