----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/25049/ -----------------------------------------------------------
(Updated Sept. 3, 2014, 8:55 p.m.) Review request for DataFu. Changes ------- Added Generic NGram instead of 3-gram Repository: datafu Description ------- DATAFU-67. Adding Simple SimHash to compute near duplicates. https://issues.apache.org/jira/browse/DATAFU-67 Diffs (updated) ----- datafu-pig/src/main/java/datafu/pig/hash/SimHash.java PRE-CREATION datafu-pig/src/test/java/datafu/test/pig/hash/HashTests.java 7ff8fb9 Diff: https://reviews.apache.org/r/25049/diff/ Testing ------- Unit tests passed. Thanks, Mohammad Amin