[ https://issues.apache.org/jira/browse/CASSANDRA-4914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14323933#comment-14323933 ]
Ahmet AKYOL commented on CASSANDRA-4914: ---------------------------------------- As [~jbellis] said it's a different yet interesting case. IMHO, some probabilistic data structures can be implemented like bloom filter. A library like [stream-lib|https://github.com/addthis/stream-lib] can be used (by the way, they say that, [~jbellis]'s blog post about bloom filters has inspired them ). Hyperloglog can also be useful,just look [Redis's implementation|http://redis.io/commands#hyperloglog]. > Aggregation functions in CQL > ---------------------------- > > Key: CASSANDRA-4914 > URL: https://issues.apache.org/jira/browse/CASSANDRA-4914 > Project: Cassandra > Issue Type: New Feature > Reporter: Vijay > Assignee: Benjamin Lerer > Labels: cql, docs > Fix For: 3.0 > > Attachments: CASSANDRA-4914-V2.txt, CASSANDRA-4914-V3.txt, > CASSANDRA-4914-V4.txt, CASSANDRA-4914-V5.txt, CASSANDRA-4914.txt > > > The requirement is to do aggregation of data in Cassandra (Wide row of column > values of int, double, float etc). > With some basic agree gate functions like AVG, SUM, Mean, Min, Max, etc (for > the columns within a row). > Example: > SELECT * FROM emp WHERE empID IN (130) ORDER BY deptID DESC; > > empid | deptid | first_name | last_name | salary > -------+--------+------------+-----------+-------- > 130 | 3 | joe | doe | 10.1 > 130 | 2 | joe | doe | 100 > 130 | 1 | joe | doe | 1e+03 > > SELECT sum(salary), empid FROM emp WHERE empID IN (130); > > sum(salary) | empid > -------------+-------- > 1110.1 | 130 -- This message was sent by Atlassian JIRA (v6.3.4#6332)