[ 
https://issues.apache.org/jira/browse/SPARK-23216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hyukjin Kwon resolved SPARK-23216.
----------------------------------
    Resolution: Incomplete

> Multiclass LogisticRegression could have methods like NCE, NEG, Hierarchical 
> SoftMax, Blackout or IS
> ----------------------------------------------------------------------------------------------------
>
>                 Key: SPARK-23216
>                 URL: https://issues.apache.org/jira/browse/SPARK-23216
>             Project: Spark
>          Issue Type: Improvement
>          Components: ML, MLlib
>    Affects Versions: 2.2.1
>            Reporter: Michel Lemay
>            Priority: Minor
>              Labels: bulk-closed
>
> When training a classifier with large number of classes, performance sink. 
> This is expected when using regular (log)SoftMax methods to compute the loss 
> since it needs to normalize current class score with the sum of all other 
> classes score.
> I think this would be helpful to have approximate methods like Hierarchical 
> SoftMax, NCE, NEG, IS to speedup training.
> A paper comparing different methods for approximate normalization over all 
> classes:
> [http://web4.cs.ucl.ac.uk/staff/D.Barber/publications/AISTATS2017.pdf]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to