[ https://issues.apache.org/jira/browse/MAHOUT-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13986959#comment-13986959 ]
Sebastian Schelter commented on MAHOUT-1529: -------------------------------------------- In Stratosphere the optimizer makes the decision what to put in memory or on disk at what point. So there is no explicit caching, but nevertheless programs could have that and Stratosphere would use it as a hint to the optimizer. I asked on the newly established stratosphere dev list for someone to have a look at this issue. > Finalize abstraction of distributed logical plans from backend operations > ------------------------------------------------------------------------- > > Key: MAHOUT-1529 > URL: https://issues.apache.org/jira/browse/MAHOUT-1529 > Project: Mahout > Issue Type: Improvement > Reporter: Dmitriy Lyubimov > Fix For: 1.0 > > > We have a few situations when algorithm-facing API has Spark dependencies > creeping in. > In particular, we know of the following cases: > (1) checkpoint() accepts Spark constant StorageLevel directly; > (2) certain things in CheckpointedDRM; > (3) drmParallelize etc. routines in the "drm" and "sparkbindings" package. > (5) drmBroadcast returns a Spark-specific Broadcast object -- This message was sent by Atlassian JIRA (v6.2#6252)