[ https://issues.apache.org/jira/browse/MAHOUT-1570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14908217#comment-14908217 ]
ASF GitHub Bot commented on MAHOUT-1570: ---------------------------------------- Github user alexeygrigorev commented on the pull request: https://github.com/apache/mahout/pull/137#issuecomment-143259429 I rebased and cleaned the commit history a bit to make sure every commit has a jira id. The documentation and the current status can be found at https://github.com/alexeygrigorev/mahout/wiki/Samsara-Flink-Bindings > Adding support for Apache Flink as a backend for the Mahout DSL > --------------------------------------------------------------- > > Key: MAHOUT-1570 > URL: https://issues.apache.org/jira/browse/MAHOUT-1570 > Project: Mahout > Issue Type: Improvement > Reporter: Till Rohrmann > Assignee: Alexey Grigorev > Labels: DSL, flink, scala > Fix For: 0.11.1 > > > With the finalized abstraction of the Mahout DSL plans from the backend > operations (MAHOUT-1529), it should be possible to integrate further backends > for the Mahout DSL. Apache Flink would be a suitable candidate to act as a > good execution backend. > With respect to the implementation, the biggest difference between Spark and > Flink at the moment is probably the incremental rollout of plans, which is > triggered by Spark's actions and which is not supported by Flink yet. > However, the Flink community is working on this issue. For the moment, it > should be possible to circumvent this problem by writing intermediate results > required by an action to HDFS and reading from there. -- This message was sent by Atlassian JIRA (v6.3.4#6332)