Hi, I'm using Cassandra 0.4.2 at my current client to persist URL graphs for Spam detection. The crawling and page classification is done in Hadoop/Bixo/Cascading, which persists URL classification results into Cassandra. The incoming production traffic is using Cassandra for the real-time spam score lookup to determine the spammyness of a URL.
It started out as a prototype and is currently in production with 4 Cassandra nodes (for the last >3 weeks). Sometimes Cassandra is a little rough on the edges, but in general it works. Wishes: - data rebalancing - proper MapReduce support (ideally supporting the same API HBase uses, so one could use the same eco-system) - node decommissioning On Fri, Nov 20, 2009 at 1:17 PM, Jonathan Ellis <jbel...@gmail.com> wrote: > Hi all, > > I'd love to get a better feel for who is using Cassandra and what kind > of applications it is seeing. If you are using Cassandra, could you > share what you're using it for and what stage you are at with it > (evaluation / testing / production)? Also, what alternatives you > evaluated/are evaluating would be useful. Finally, feel free to throw > in "I'd love to use Cassandra if only it did X" wishes. :) > > I can start: Rackspace is using Cassandra for stats collection > (testing, almost production) and as a backend for the Mail & Apps > division (early testing). We evaluated HBase, Hypertable, dynomite, > and Voldemort as well. > > Thanks, > > -Jonathan > > (If you're in stealth mode or don't want to say anything in public, > feel free to reply to me privately and I will keep it off the record.) >