[jira] [Commented] (SPARK-1855) Provide memory-and-local-disk RDD checkpointing

Mridul Muralidharan (JIRA) Sun, 18 May 2014 21:23:25 -0700

    [ 
https://issues.apache.org/jira/browse/SPARK-1855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14001377#comment-14001377
 ]


Mridul Muralidharan commented on SPARK-1855:
--------------------------------------------

Did not realize that mail replies to JIRA mails did not get mirrored to JIRA ! 
Replicating my mail here :

– cut and paste –

We don't have 3x replication in spark :-)
And if we use replicated storagelevel, while decreasing odds of failure, it 
does not eliminate it (since we are not doing a great job with replication 
anyway from fault tolerance point of view).
Also it does take a nontrivial performance hit with replicated levels.

Regards,
Mridul

> Provide memory-and-local-disk RDD checkpointing
> -----------------------------------------------
>
>                 Key: SPARK-1855
>                 URL: https://issues.apache.org/jira/browse/SPARK-1855
>             Project: Spark
>          Issue Type: New Feature
>          Components: MLlib, Spark Core
>    Affects Versions: 1.0.0
>            Reporter: Xiangrui Meng
>
> Checkpointing is used to cut long lineage while maintaining fault tolerance. 
> The current implementation is HDFS-based. Using the BlockRDD we can create 
> in-memory-and-local-disk (with replication) checkpoints that are not as 
> reliable as HDFS-based solution but faster.
> It can help applications that require many iterations.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (SPARK-1855) Provide memory-and-local-disk RDD checkpointing

Reply via email to