Makoto Yui created HIVEMALL-18: ---------------------------------- Summary: Support approx_count UDAF using HyperLogLog Key: HIVEMALL-18 URL: https://issues.apache.org/jira/browse/HIVEMALL-18 Project: Hivemall Issue Type: Sub-task Reporter: Makoto Yui Priority: Minor
https://github.com/addthis/stream-lib could be used for underlying library. http://www.slideshare.net/bzamecnik/hyperloglog-in-hive-how-to-count-sheep-efficiently https://databricks.com/blog/2016/05/19/approximate-algorithms-in-apache-spark-hyperloglog-and-quantiles.html There exist several HLL implementation as Hive UDAF. https://github.com/MLnick/hive-udf/wiki https://github.com/klout/brickhouse -- This message was sent by Atlassian JIRA (v6.3.4#6332)