Reading an RDD from a checkpoint.

Daniel Imberman Mon, 14 Mar 2016 12:01:10 -0700

So I'm attempting to pre-compute my data such that I can pull an RDD from a
checkpoint. However, I'm finding that upon running the same job twice the
system is simply recreating the RDD from scratch.


Here is the code I'm implementing to create the checkpoint:

  def checkpointTeam(checkpointDir:String) ={
    sparkContext.setCheckpointDir(checkpointDir)
    val a = ObjectsTable.readAsRDD(sparkContext,
Some("5507424...")).persist() //read from HBase
    a.checkpoint()
    a.count()
  }

I've checked multiple sources but none of them seem to explicitly state how
to read the values. I'd like to be able to treat the checkpoint as a
snapshot so I can read it from other jobs.



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Reading-an-RDD-from-a-checkpoint-tp26486.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Reading an RDD from a checkpoint.

Reply via email to