[jira] [Created] (TINKERPOP3-925) Use persisted SparkContext to persist an RDD across Spark jobs.

Marko A. Rodriguez (JIRA) Tue, 27 Oct 2015 14:03:06 -0700

Marko A. Rodriguez created TINKERPOP3-925:
---------------------------------------------


             Summary: Use persisted SparkContext to persist an RDD across Spark 
jobs.
                 Key: TINKERPOP3-925
                 URL: https://issues.apache.org/jira/browse/TINKERPOP3-925
             Project: TinkerPop 3
          Issue Type: Improvement
          Components: hadoop
    Affects Versions: 3.0.2-incubating
            Reporter: Marko A. Rodriguez
            Assignee: Marko A. Rodriguez
             Fix For: 3.1.0-incubating


If a provider is using Spark, they are currently forced to have HDFS be used to 
store intermediate RDD data. However, if they plan on using that data in a 
{{GraphComputer}} "job chain," then they should be able to lookup a 
{{.cached()}} RDD by name. 

Create a {{inputGraphRDD.name}} and {{outputGraphRDD.name}} to make it so that 
the configuration references {{SparkContext.getPersitedRDDs()}}.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (TINKERPOP3-925) Use persisted SparkContext to persist an RDD across Spark jobs.

Reply via email to