[
https://issues.apache.org/jira/browse/FLINK-5525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15834199#comment-15834199
]
Tao Meng commented on FLINK-5525:
---------------------------------
[~skonto] Yes, I agree it. In there days, I will investigate Spark and Samoa
and make some docs about it.
> Streaming Version of a Linear Regression model
> ----------------------------------------------
>
> Key: FLINK-5525
> URL: https://issues.apache.org/jira/browse/FLINK-5525
> Project: Flink
> Issue Type: New Feature
> Components: Machine Learning Library
> Reporter: Stavros Kontopoulos
>
> Given the nature of Flink we should have a streaming version of the
> algorithms when possible.
> Update of the model should be done on a per window basis.
> An extreme case is: https://en.wikipedia.org/wiki/Online_machine_learning
> Resources
> [1]
> http://scikit-learn.org/dev/modules/scaling_strategies.html#incremental-learning
> [2]
> http://stats.stackexchange.com/questions/6920/efficient-online-linear-regression
> [3] https://spark.apache.org/docs/1.1.0/mllib-linear-methods.html
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)