Re: anyone using Cassandra as an analytics/data warehouse?

2011-01-09 Thread Peter Harrison
On Wed, Jan 5, 2011 at 4:09 PM, Dave Viner wrote: > > "a Column Family with the row key being the Unix time divided by 60x60 and > a column key of... pretty much anything unique" > LogCF[hour-day-in-epoch-seconds][timeuuid] = 1 > where 'hour-day-in-epoch-seconds' is something like the first s

Re: anyone using Cassandra as an analytics/data warehouse?

2011-01-04 Thread Jake Luciani
Some relevant information here: https://www.cloudkick.com/blog/2010/mar/02/4_months_with_cassandra/ On Tue, Jan 4, 2011 at 10:09 PM, Dave Viner wrote: > Hi Peter, > > Thanks. These are great ideas. One comment tho. I'm actually not as > worried about the "logging into the system" performance

Re: anyone using Cassandra as an analytics/data warehouse?

2011-01-04 Thread Dave Viner
Hi Peter, Thanks. These are great ideas. One comment tho. I'm actually not as worried about the "logging into the system" performance and more speculating/imagining the querying out of the system. Most traditional data warehouses have a cube or a star schema or something similar. I'm trying t

Re: anyone using Cassandra as an analytics/data warehouse?

2011-01-04 Thread Peter Harrison
Okay, here is two ways to handle this, both are quite different from each other. A) This approach does not depend on counters. You simply have a Column Family with the row key being the Unix time divided by 60x60 and a column key of... pretty much anything unique. Then have another process look

anyone using Cassandra as an analytics/data warehouse?

2011-01-04 Thread Dave Viner
Does anyone use Cassandra to power an analytics or data warehouse implementation? As a concrete example, one could imagine Cassandra storing data for something that reports on page-views on a website. The basic notions might be simple (url as row-key and columns as timeuuids of viewers). But, ho