[ 
https://issues.apache.org/jira/browse/SPARK-8534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14726364#comment-14726364
 ] 

Ehsan Mohyedin Kermani commented on SPARK-8534:
-----------------------------------------------

I'd like to give it a shot but first I think, we need distributed scan function 
for computing the cumulative sum of the sorted predictions. Would it be 
possible to add that to RegressionMetrics or perhaps mllib.util first? An 
implementation was suggested here 
https://groups.google.com/forum/#!topic/spark-users/ts-FdB50ltY. 

> Gini for regression metrics and evaluator
> -----------------------------------------
>
>                 Key: SPARK-8534
>                 URL: https://issues.apache.org/jira/browse/SPARK-8534
>             Project: Spark
>          Issue Type: New Feature
>          Components: ML, MLlib
>            Reporter: Joseph K. Bradley
>            Priority: Minor
>
> One common metric we do not have in RegressionMetrics or RegressionEvaluator 
> is Gini: [https://www.kaggle.com/wiki/Gini]
> Implementing (normalized) Gini would be nice.  However, it might be 
> expensive; I believe it would require sorting the labels.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to