Marco Gaido created SPARK-22119:
-----------------------------------

             Summary: Add cosine distance to KMeans
                 Key: SPARK-22119
                 URL: https://issues.apache.org/jira/browse/SPARK-22119
             Project: Spark
          Issue Type: New Feature
          Components: ML, MLlib
    Affects Versions: 2.2.0
            Reporter: Marco Gaido
            Priority: Minor


Currently, KMeans assumes the only possible distance measure to be used is the 
Euclidean.

In some use cases, eg. text mining, other distance measures like the cosine 
distance are widely used. Thus, for such use cases, it would be good to support 
multiple distance measures.

This ticket is to support the cosine distance measure on KMeans. Later, other 
algorithms can be extended to support several distance measures and other 
distance measures can be added.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to