[ https://issues.apache.org/jira/browse/SPARK-5992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14575557#comment-14575557 ]
Debasish Das commented on SPARK-5992: ------------------------------------- Lsh is anyway optimized for cosine...I think we should use algebird lsh for quality comparisons but in mllib add idea like randomized kdtree or kmeans tree...but it should be generic for distance functions hopefully... > Locality Sensitive Hashing (LSH) for MLlib > ------------------------------------------ > > Key: SPARK-5992 > URL: https://issues.apache.org/jira/browse/SPARK-5992 > Project: Spark > Issue Type: New Feature > Components: MLlib > Affects Versions: 1.4.0 > Reporter: Joseph K. Bradley > > Locality Sensitive Hashing (LSH) would be very useful for ML. It would be > great to discuss some possible algorithms here, choose an API, and make a PR > for an initial algorithm. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org