Sounds great! Would you mind writing a proposal about this? Jihoon
On Tue, Mar 19, 2019 at 3:54 PM Samarth Jain <[email protected]> wrote: > Hi, > > T-Digest (https://github.com/tdunning/t-digest) data-structure is another > way of computing sketches, rank based statistics and trimmed means over > numeric data. At my day job, we have been using a t-digest backed Druid > aggregator module which generally has been working out well for the use > cases of respective teams. I think it would be valuable to have T-Digest > backed aggregators in Druid along with other sketch algorithms like moments > and yahoo quantile sketches. > > T-Digest has also been adopted by other projects including: > > Elastic Search - > > https://www.elastic.co/guide/en/elasticsearch/reference/current/search-aggregations-metrics-percentile-aggregation.html#search-aggregations-metrics-percentile-aggregation-approximation > > stream-lib ( > > https://github.com/addthis/stream-lib/blob/master/src/main/java/com/clearspring/analytics/stream/quantile/TDigest.java > ) > > Apache Mahout - > > https://archive.cloudera.com/cdh5/cdh/5/mahout/mahout-math/org/apache/mahout/math/stats/TDigest.html > > I have been working on cleaning up and improving performance of the module > and would like to contribute it. I would like to see what does the > community think about it. > > Thanks, > Samarth >
