[ 
https://issues.apache.org/jira/browse/MAHOUT-1570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14973091#comment-14973091
 ] 

ASF GitHub Bot commented on MAHOUT-1570:
----------------------------------------

Github user smarthi commented on the pull request:

    https://github.com/apache/mahout/pull/137#issuecomment-150896852
  
    Nevertheless, we have pushed some changes up to the mahout : flink-binding
    that @alexey needs to pull from
    
    On Sun, Oct 25, 2015 at 2:38 AM, Dmitriy Lyubimov <[email protected]>
    wrote:
    
    > Maybe. we need something to review the diff of the flink branch vs. 
master.
    > can't create a PR since we do not own github's mahout repo.
    >
    > On Sat, Oct 24, 2015 at 11:06 PM, Alexey Grigorev <
    > [email protected]>
    > wrote:
    >
    > > @dlyubimov <https://github.com/dlyubimov> just to clarify: you're
    > > suggesting I pull from mahout:flink-binding to my :flink-binding, so all
    > > new commits from that branch appear here?
    > >
    > > —
    > > Reply to this email directly or view it on GitHub
    > > <https://github.com/apache/mahout/pull/137#issuecomment-150895829>.
    >
    > >
    >
    > —
    > Reply to this email directly or view it on GitHub
    > <https://github.com/apache/mahout/pull/137#issuecomment-150896721>.
    >



> Adding support for Apache Flink as a backend for the Mahout DSL
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-1570
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1570
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Till Rohrmann
>            Assignee: Alexey Grigorev
>              Labels: DSL, flink, scala
>             Fix For: 0.13.0
>
>
> With the finalized abstraction of the Mahout DSL plans from the backend 
> operations (MAHOUT-1529), it should be possible to integrate further backends 
> for the Mahout DSL. Apache Flink would be a suitable candidate to act as a 
> good execution backend. 
> With respect to the implementation, the biggest difference between Spark and 
> Flink at the moment is probably the incremental rollout of plans, which is 
> triggered by Spark's actions and which is not supported by Flink yet. 
> However, the Flink community is working on this issue. For the moment, it 
> should be possible to circumvent this problem by writing intermediate results 
> required by an action to HDFS and reading from there.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to