[jira] [Updated] (FLINK-2379) Add methods to evaluate field wise statistics over DataSet of vectors.

2019-02-28 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/FLINK-2379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated FLINK-2379:
--
Labels: pull-request-available  (was: )

> Add methods to evaluate field wise statistics over DataSet of vectors.
> --
>
> Key: FLINK-2379
> URL: https://issues.apache.org/jira/browse/FLINK-2379
> Project: Flink
>  Issue Type: New Feature
>  Components: Library / Machine Learning
>Reporter: Sachin Goel
>Assignee: Sachin Goel
>Priority: Major
>  Labels: pull-request-available
>
> Design methods to evaluate statistics over dataset of vectors.
> For continuous fields, Minimum, maximum, mean, variance.
> For discrete fields, Class counts, Entropy, Gini Impurity.
> Further statistical measures can also be supported. For example, correlation 
> between two series, computing the covariance matrix, etc. 
> [These are currently the things Spark supports.]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (FLINK-2379) Add methods to evaluate field wise statistics over DataSet of vectors.

2015-07-20 Thread Sachin Goel (JIRA)

 [ 
https://issues.apache.org/jira/browse/FLINK-2379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sachin Goel updated FLINK-2379:
---
Description: 
Design methods to evaluate statistics over dataset of vectors.
For continuous fields, Minimum, maximum, mean, variance.
For discrete fields, Class counts, Entropy, Gini Impurity.

Further statistical measures can also be supported. For example, correlation 
between two series, computing the covariance matrix, etc. 
[These are currently the things Spark supports.]

  was:
Design methods to evaluate statistics over dataset of vectors.
For continuous fields, Minimum, maximum, mean, variance.
For discrete fields, Class counts, Entropy, Gini Impurity.

 Issue Type: New Feature  (was: Bug)

 Add methods to evaluate field wise statistics over DataSet of vectors.
 --

 Key: FLINK-2379
 URL: https://issues.apache.org/jira/browse/FLINK-2379
 Project: Flink
  Issue Type: New Feature
  Components: Machine Learning Library
Reporter: Sachin Goel

 Design methods to evaluate statistics over dataset of vectors.
 For continuous fields, Minimum, maximum, mean, variance.
 For discrete fields, Class counts, Entropy, Gini Impurity.
 Further statistical measures can also be supported. For example, correlation 
 between two series, computing the covariance matrix, etc. 
 [These are currently the things Spark supports.]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)