[ 
https://issues.apache.org/jira/browse/MAHOUT-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13988497#comment-13988497
 ] 

Dmitriy Lyubimov commented on MAHOUT-1529:
------------------------------------------

IMO this doesn't tolerate haste. This is very conceptual. I won't commit 
anything until we have finished discussion on the things i outlined for 
Stratosphere.

There are little items that i may commit soon (such as wrapping up context and 
cache manager). 

I think the best mode to work this issue is to create a github side branch and 
keep squash-committing it little by little while handling little additions via 
pull request. 

BTW I truly envy the Spark process which is 100% handled by github pull 
requests. Does anybody knows how they manage to push it back to Apache from 
there?

> Finalize abstraction of distributed logical plans from backend operations
> -------------------------------------------------------------------------
>
>                 Key: MAHOUT-1529
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1529
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Dmitriy Lyubimov
>             Fix For: 1.0
>
>
> We have a few situations when algorithm-facing API has Spark dependencies 
> creeping in. 
> In particular, we know of the following cases:
> (1) checkpoint() accepts Spark constant StorageLevel directly;
> (2) certain things in CheckpointedDRM;
> (3) drmParallelize etc. routines in the "drm" and "sparkbindings" package. 
> (5) drmBroadcast returns a Spark-specific Broadcast object



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to