[ 
https://issues.apache.org/jira/browse/DATAFU-37?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13986752#comment-13986752
 ] 

Matthew Hayes commented on DATAFU-37:
-------------------------------------

I think we could do that as a follow on item.  Let's think about it some more.  
If there's a way we could implement that so that the parameters could be 
estimated using a Pig script that would make it really convenient for people to 
use.  We wouldn't be able to use these UDFs but maybe we could have alternative 
versions that take a parameter range.  Another thing that would be great is a 
blog post on the datafu website that demonstrates using this for a real 
application from start to finish, including parameter estimation.  This would 
help people get started using it.

> Add Locality Sensitive Hashing UDFs
> -----------------------------------
>
>                 Key: DATAFU-37
>                 URL: https://issues.apache.org/jira/browse/DATAFU-37
>             Project: DataFu
>          Issue Type: New Feature
>            Reporter: Casey Stella
>            Assignee: Casey Stella
>         Attachments: DATAFU-37.patch
>
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> Create a set of UDFs to implement [Locality Sensitive 
> Hashing|http://en.wikipedia.org/wiki/Locality-sensitive_hashing] in support 
> of finding k-near neighbors.   Initially, hashes associated with L1, L2 and 
> Cosine similarity should be supported.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to