[
https://issues.apache.org/jira/browse/MAHOUT-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13988497#comment-13988497
]
Dmitriy Lyubimov commented on MAHOUT-1529:
------------------------------------------
IMO this doesn't tolerate haste. This is very conceptual. I won't commit
anything until we have finished discussion on the things i outlined for
Stratosphere.
There are little items that i may commit soon (such as wrapping up context and
cache manager).
I think the best mode to work this issue is to create a github side branch and
keep squash-committing it little by little while handling little additions via
pull request.
BTW I truly envy the Spark process which is 100% handled by github pull
requests. Does anybody knows how they manage to push it back to Apache from
there?
> Finalize abstraction of distributed logical plans from backend operations
> -------------------------------------------------------------------------
>
> Key: MAHOUT-1529
> URL: https://issues.apache.org/jira/browse/MAHOUT-1529
> Project: Mahout
> Issue Type: Improvement
> Reporter: Dmitriy Lyubimov
> Fix For: 1.0
>
>
> We have a few situations when algorithm-facing API has Spark dependencies
> creeping in.
> In particular, we know of the following cases:
> (1) checkpoint() accepts Spark constant StorageLevel directly;
> (2) certain things in CheckpointedDRM;
> (3) drmParallelize etc. routines in the "drm" and "sparkbindings" package.
> (5) drmBroadcast returns a Spark-specific Broadcast object
--
This message was sent by Atlassian JIRA
(v6.2#6252)