----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/20608/ -----------------------------------------------------------
(Updated May 13, 2014, 12:49 p.m.) Review request for DataFu. Changes ------- As we discussed in JIRA, this is the suggested implementation to support random seeds. Give it a look-over, please. I also added a unit test to verify that the seed is taking. Repository: datafu Description ------- >From DATAFU-37: Create a set of UDFs to implement Locality Sensitive Hashing >in support of finding k-near neighbors. Initially, hashes associated with L1, >L2 and Cosine similarity should be supported. Diffs (updated) ----- datafu-pig/src/main/java/datafu/pig/hash/lsh/CosineDistanceHash.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/L1PStableHash.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/L2PStableHash.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/LSHFamily.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/LSHFunc.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/RepeatingLSH.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/cosine/HyperplaneLSH.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/cosine/package-info.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/interfaces/LSH.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/interfaces/LSHCreator.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/interfaces/Sampler.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/interfaces/package-info.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/metric/Cosine.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/metric/L1.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/metric/L2.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/metric/MetricUDF.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/metric/package-info.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/p_stable/AbstractStableDistributionFunction.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/p_stable/L1LSH.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/p_stable/L2LSH.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/p_stable/package-info.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/package-info.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/util/DataTypeUtil.java PRE-CREATION datafu-pig/src/main/java/datafu/pig/hash/lsh/util/package-info.java PRE-CREATION datafu-pig/src/test/java/datafu/test/pig/hash/lsh/LSHPigTest.java PRE-CREATION datafu-pig/src/test/java/datafu/test/pig/hash/lsh/LSHTest.java PRE-CREATION Diff: https://reviews.apache.org/r/20608/diff/ Testing ------- 2 unit tests. One pigunit for the UDFs and one regular JUnit test to test functionality. Thanks, Casey Stella