[
https://issues.apache.org/jira/browse/FLINK-5785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15943728#comment-15943728
]
ASF GitHub Bot commented on FLINK-5785:
---------------------------------------
Github user zentol commented on the issue:
https://github.com/apache/flink/pull/3625
Regarding the license: Every (non-binary) file in the flink repository must
have the apache license at the very top of the file. Simply take a look at an
existing scala class and you'll see what i mean.
Second: It is not required to open a new PR when making changes, you can
add commits to the branch of the PR. (note that force-pushes should only be
done if necessary).
Third, the file count in this PR is dramatically higher than in the last
one (4 vs 84), is this intended or a mistake?
> Add an Imputer for preparing data
> ---------------------------------
>
> Key: FLINK-5785
> URL: https://issues.apache.org/jira/browse/FLINK-5785
> Project: Flink
> Issue Type: New Feature
> Components: Machine Learning Library
> Reporter: Stavros Kontopoulos
> Assignee: Stavros Kontopoulos
>
> We need to add an Imputer as described in [1].
> "The Imputer class provides basic strategies for imputing missing values,
> either using the mean, the median or the most frequent value of the row or
> column in which the missing values are located. This class also allows for
> different missing values encodings."
> References
> 1. http://scikit-learn.org/stable/modules/preprocessing.html#preprocessing
> 2.
> http://scikit-learn.org/stable/auto_examples/missing_values.html#sphx-glr-auto-examples-missing-values-py
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)