[ 
https://issues.apache.org/jira/browse/SPARK-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14063133#comment-14063133
 ] 

Xiangrui Meng commented on SPARK-2329:
--------------------------------------

PR: https://github.com/apache/spark/pull/1270

> Add multi-label evaluation metrics
> ----------------------------------
>
>                 Key: SPARK-2329
>                 URL: https://issues.apache.org/jira/browse/SPARK-2329
>             Project: Spark
>          Issue Type: New Feature
>          Components: MLlib
>            Reporter: Alexander Ulanov
>            Assignee: Alexander Ulanov
>   Original Estimate: 72h
>  Remaining Estimate: 72h
>
> There is no class in Spark MLlib for measuring the performance of multi-label 
>  classifiers. Multilabel classification is when the document is labeled with 
> several labels (classes).
> This task involves adding the class for multilabel evaluation and unit tests. 
> The following measures are to be implemented: Precision, Recall and 
> F1-measure (1) based on documents averaged by the number of documents; (2) 
> per label; (3) based on labels micro and macro averaged; (4) Hamming loss. 
> Reference: Tsoumakas, Grigorios, Ioannis Katakis, and Ioannis Vlahavas. 
> "Mining multi-label data." Data mining and knowledge discovery handbook. 
> Springer US, 2010. 667-685.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to