Re: Effective allocation of multiple disks

2010-03-12 Thread Ryan King
D 0? > I would think software RAID 0 would be fine since there is no actual > computation being done... > Thanks! > > -Eric > > On Thu, Mar 11, 2010 at 1:16 PM, Ryan King wrote: >> >> Even without major compaction, you can get significant imbalances in >>

Re: Effective allocation of multiple disks

2010-03-11 Thread Ryan King
On Thu, Mar 11, 2010 at 10:45 AM, Jonathan Ellis wrote: > Except that for a major compaction the whole thing gets put in one > directory.  That's the problem w/ the JBOD approach. Even without major compaction, you can get significant imbalances in how much data is on each disk which will bottlen

Re: Hackathon?!?

2010-03-09 Thread Ryan King
I'm already committed to talking about cassandra that day at our company's developer conference (chirp.twitter.com). -ryan On Tue, Mar 9, 2010 at 6:26 PM, Jeff Hodges wrote: > I'm down. > -- > Jeff > > On Tue, Mar 9, 2010 at 6:18 PM, Jonathan Ellis wrote: >> I can make it. \o/ >> >> On Tue, Mar

Re: finding Cassandra servers

2010-03-03 Thread Ryan King
On Wed, Mar 3, 2010 at 9:27 AM, Eric Evans wrote: > On Wed, 2010-03-03 at 16:49 +, Christopher Brind wrote: >> So is the current general practice to connect to a known node, e.g. by >> ip address? > > There are so many ways you could tackle this but... > > If you're talking about provisioning/

Re: finding Cassandra servers

2010-03-03 Thread Ryan King
2010/3/3 Ted Zlatanov : > On Wed, 3 Mar 2010 09:04:37 -0800 Ryan King wrote: > > RK> Something like RRDNS is no more complex that managing a list of seed > nodes. > > How do your clients at Twitter find server nodes?  Do you just run them > local to each node? RRDNS +

Re: finding Cassandra servers

2010-03-03 Thread Ryan King
2010/3/3 Ted Zlatanov : > On Wed, 03 Mar 2010 10:43:19 -0600 Eric Evans wrote: > > EE> It's entirely possible that you've identified a problem that others > EE> can't see, or haven't yet encountered. I don't see it, but then maybe > EE> I'm just thick. > > Getting back to my original question, how

Re: Looking for work

2010-03-02 Thread Ryan King
I guess I should take this opportunity to say that we at Twitter are hiring aggressively for people to work on and around cassandra. Feel free to email me your resume. :) -ryan On Tue, Mar 2, 2010 at 6:01 PM, Peter Halliday wrote: > I'm looking for work.  My previous employer was a non-profit th

Re: Change partitioner from Random to OrderPreserving

2010-02-21 Thread Ryan King
On Sun, Feb 21, 2010 at 7:07 PM, JKnight JKnight wrote: > Dear all, > > I want to change partitioner from Random to OrderPreserving. How can I do > with old data? There's no easy way to do this. You'll likely have to reload your data. -ryan

Re: Hackathon?!?

2010-01-27 Thread Ryan King
I'd be interested in another hackathon and I'm sure we could help out in some way. An intro would be good, and I think we have a lot more people qualified to do so. -ryan On Wed, Jan 27, 2010 at 9:00 AM, Jonathan Ellis wrote: > At the hackathon Twitter hosted last year, a lot of people showed u

Re: Problems with FreeBSD

2010-01-16 Thread Ryan King
'^]'. >> >> >> This is the last part of the log: >> >> INFO - Starting up server gossip >> DEBUG - Starting to listen on localhost/127.0.0.1 >> DEBUG - Binding thrift service to /127.0.0.1:9160 >> INFO - Cassandra starting up... >> DEBUG - Dissemina

Re: Problems with FreeBSD

2010-01-16 Thread Ryan King
On Fri, Jan 15, 2010 at 5:18 PM, Andrew wrote: > Hi there, > > I'm trying to get Cassandra server up and going on a FreeBSD 7.1 box and I'm > experiencing some challenges. > > I would like to use the Ruby client, so I have followed the instructions > from the Up and Running with Cassandra post: >

cassandra gem 0.7 released

2010-01-08 Thread Ryan King
I just pushed an updated version of our gem that's compatible with 0.5. http://gemcutter.org/gems/cassandra -ryan

Re: Graduation

2010-01-08 Thread Ryan King
+1 from me, for sure. -ryan On Thu, Jan 7, 2010 at 10:56 AM, Eric Evans wrote: > > Greetings, > > It seems hard to believe, but it's been a little over a year now since > Cassandra entered the Apache Incubator. The project has made an > impressive amount of progress in that time, from expanding

Re: Images store in Cassandra

2009-12-12 Thread Ryan King
On Sat, Dec 12, 2009 at 12:05 PM, Ran Tavory wrote: > As we're designing our systems for a move from mysql to Cassandra we're > considering moving our file storage to Cassandra as well. Is this wise? > We're currently using mogilefs to store media items (images) of average size > of 30Mb (400k ima

Re: Map Reduce on Cassandra Store

2009-12-04 Thread Ryan King
On Fri, Dec 4, 2009 at 8:44 AM, Mark Vigeant wrote: > Hello! > > > > Has anyone tried to run MapReduce analytics on data stored in Cassandra? I > feel like I saw a patch once to get hadoop working on top of Cassandra, but > I can’t find it now. I know that Hadoop integration is big on people’s > w

Re: Cassandra users survey

2009-11-20 Thread Ryan King
At twitter we're working on using Cassandra to replace our currents storage for all tweets. We have a cluster in production that's being populated outside the the user-critical path (ie, the cassandra writing is async). Additionally, we're testing and evaluating for basically everything else in ou

Re: [VOTE] Website

2009-11-11 Thread Ryan King
Looks great. +1 -ryan On Wed, Nov 11, 2009 at 2:22 PM, Johan Oskarsson wrote: > +1. A great step forward from the current version and a good base to improve > upon. > > /Johan > > Eric Evans wrote: >> >> The current website is quite ugly, and I don't know about you, but I'm >> itching to put the

Re: data distribution among DataFileDirectories

2009-09-29 Thread Ryan King
On Tue, Sep 29, 2009 at 12:46 PM, Jonathan Ellis wrote: > On Tue, Sep 29, 2009 at 2:22 PM, Igor Katkov wrote: >> Does cassandra distributes keys evenly among DataFileDirectories? > > No, but it should distribute sstables evenly (which, on average, > should be distributing keys evenly, but there w

Re: Visual representation of Cassandra data model

2009-08-12 Thread Ryan King
A few quick comments: * its not clear what column family the super column you're using is in. * it might be useful to include the timestamps in the columns (since they're user-supplied) * given that the colon-delimited api has been removed, it might be easier to explain the data model without such