[ https://issues.apache.org/jira/browse/MAHOUT-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13988497#comment-13988497 ]
Dmitriy Lyubimov commented on MAHOUT-1529: ------------------------------------------ IMO this doesn't tolerate haste. This is very conceptual. I won't commit anything until we have finished discussion on the things i outlined for Stratosphere. There are little items that i may commit soon (such as wrapping up context and cache manager). I think the best mode to work this issue is to create a github side branch and keep squash-committing it little by little while handling little additions via pull request. BTW I truly envy the Spark process which is 100% handled by github pull requests. Does anybody knows how they manage to push it back to Apache from there? > Finalize abstraction of distributed logical plans from backend operations > ------------------------------------------------------------------------- > > Key: MAHOUT-1529 > URL: https://issues.apache.org/jira/browse/MAHOUT-1529 > Project: Mahout > Issue Type: Improvement > Reporter: Dmitriy Lyubimov > Fix For: 1.0 > > > We have a few situations when algorithm-facing API has Spark dependencies > creeping in. > In particular, we know of the following cases: > (1) checkpoint() accepts Spark constant StorageLevel directly; > (2) certain things in CheckpointedDRM; > (3) drmParallelize etc. routines in the "drm" and "sparkbindings" package. > (5) drmBroadcast returns a Spark-specific Broadcast object -- This message was sent by Atlassian JIRA (v6.2#6252)