[
https://issues.apache.org/jira/browse/DATAFU-37?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13986752#comment-13986752
]
Matthew Hayes commented on DATAFU-37:
-------------------------------------
I think we could do that as a follow on item. Let's think about it some more.
If there's a way we could implement that so that the parameters could be
estimated using a Pig script that would make it really convenient for people to
use. We wouldn't be able to use these UDFs but maybe we could have alternative
versions that take a parameter range. Another thing that would be great is a
blog post on the datafu website that demonstrates using this for a real
application from start to finish, including parameter estimation. This would
help people get started using it.
> Add Locality Sensitive Hashing UDFs
> -----------------------------------
>
> Key: DATAFU-37
> URL: https://issues.apache.org/jira/browse/DATAFU-37
> Project: DataFu
> Issue Type: New Feature
> Reporter: Casey Stella
> Assignee: Casey Stella
> Attachments: DATAFU-37.patch
>
> Original Estimate: 168h
> Remaining Estimate: 168h
>
> Create a set of UDFs to implement [Locality Sensitive
> Hashing|http://en.wikipedia.org/wiki/Locality-sensitive_hashing] in support
> of finding k-near neighbors. Initially, hashes associated with L1, L2 and
> Cosine similarity should be supported.
--
This message was sent by Atlassian JIRA
(v6.2#6252)