[ https://issues.apache.org/jira/browse/SPARK-5992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14575079#comment-14575079 ]
Joseph K. Bradley commented on SPARK-5992: ------------------------------------------ I'm just noting here that [~yuu.ishik...@gmail.com] and I discussed the design doc. The current plan is to have these implemented under ml.feature, with 2 types of UnaryTransformers: * Vector -> T: hash feature vector to some other type (Vector, Int, etc.) * Vector -> Double: compare feature vector with a fixed base Vector, and return similarity > Locality Sensitive Hashing (LSH) for MLlib > ------------------------------------------ > > Key: SPARK-5992 > URL: https://issues.apache.org/jira/browse/SPARK-5992 > Project: Spark > Issue Type: New Feature > Components: MLlib > Affects Versions: 1.4.0 > Reporter: Joseph K. Bradley > > Locality Sensitive Hashing (LSH) would be very useful for ML. It would be > great to discuss some possible algorithms here, choose an API, and make a PR > for an initial algorithm. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org