[ https://issues.apache.org/jira/browse/DATAFU-37?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13986752#comment-13986752 ]
Matthew Hayes commented on DATAFU-37: ------------------------------------- I think we could do that as a follow on item. Let's think about it some more. If there's a way we could implement that so that the parameters could be estimated using a Pig script that would make it really convenient for people to use. We wouldn't be able to use these UDFs but maybe we could have alternative versions that take a parameter range. Another thing that would be great is a blog post on the datafu website that demonstrates using this for a real application from start to finish, including parameter estimation. This would help people get started using it. > Add Locality Sensitive Hashing UDFs > ----------------------------------- > > Key: DATAFU-37 > URL: https://issues.apache.org/jira/browse/DATAFU-37 > Project: DataFu > Issue Type: New Feature > Reporter: Casey Stella > Assignee: Casey Stella > Attachments: DATAFU-37.patch > > Original Estimate: 168h > Remaining Estimate: 168h > > Create a set of UDFs to implement [Locality Sensitive > Hashing|http://en.wikipedia.org/wiki/Locality-sensitive_hashing] in support > of finding k-near neighbors. Initially, hashes associated with L1, L2 and > Cosine similarity should be supported. -- This message was sent by Atlassian JIRA (v6.2#6252)