[jira] [Commented] (FLINK-5785) Add an Imputer for preparing data

ASF GitHub Bot (JIRA) Mon, 27 Mar 2017 10:55:19 -0700

    [ 
https://issues.apache.org/jira/browse/FLINK-5785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15943728#comment-15943728
 ]


ASF GitHub Bot commented on FLINK-5785:
---------------------------------------

Github user zentol commented on the issue:

    https://github.com/apache/flink/pull/3625
  
    Regarding the license: Every (non-binary) file in the flink repository must 
have the apache license at the very top of the file. Simply take a look at an 
existing scala class and you'll see what i mean.
    
    Second: It is not required to open a new PR when making changes, you can 
add commits to the branch of the PR. (note that force-pushes should only be 
done if necessary).
    
    Third, the file count in this PR is dramatically higher than in the last 
one (4 vs 84), is this intended or a mistake?


> Add an Imputer for preparing data
> ---------------------------------
>
>                 Key: FLINK-5785
>                 URL: https://issues.apache.org/jira/browse/FLINK-5785
>             Project: Flink
>          Issue Type: New Feature
>          Components: Machine Learning Library
>            Reporter: Stavros Kontopoulos
>            Assignee: Stavros Kontopoulos
>
> We need to add an Imputer as described in [1].
> "The Imputer class provides basic strategies for imputing missing values, 
> either using the mean, the median or the most frequent value of the row or 
> column in which the missing values are located. This class also allows for 
> different missing values encodings."
> References
> 1. http://scikit-learn.org/stable/modules/preprocessing.html#preprocessing
> 2. 
> http://scikit-learn.org/stable/auto_examples/missing_values.html#sphx-glr-auto-examples-missing-values-py



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

[jira] [Commented] (FLINK-5785) Add an Imputer for preparing data

Reply via email to