yuhao yang created SPARK-22195:
----------------------------------

             Summary: Add cosine similarity to 
org.apache.spark.ml.linalg.Vectors
                 Key: SPARK-22195
                 URL: https://issues.apache.org/jira/browse/SPARK-22195
             Project: Spark
          Issue Type: New Feature
          Components: ML
    Affects Versions: 2.2.0
            Reporter: yuhao yang
            Priority: Minor


https://en.wikipedia.org/wiki/Cosine_similarity:
As the most important measure of similarity, I found it quite useful in some 
image and NLP applications according to personal experience.

Suggest to add function for cosine similarity in 
org.apache.spark.ml.linalg.Vectors.

Interface:

  def cosineSimilarity(v1: Vector, v2: Vector): Double = ...
  def cosineSimilarity(v1: Vector, v2: Vector, norm1: Double, norm2: Double): 
Double = ...

Appreciate suggestions and need green light from committers.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to