> A.) store ALL the data associated with the user onto a single users row-key. 
> Some user keys may be small, others may get larger over time depending upon 
> activity.
I would go with this.
The important thing is supporting the read queries. 

Cheers
Aaron

-----------------
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com

On 28/02/2012, at 7:40 PM, Blake Starkenburg wrote:

> Using a user/member as an example I am curious which of the data models would 
> be the best fit for performance and longevity of data in Cassandra? 
> 
> Consider the simple staples of user/member details like 
> username,email,address,state,preferences,etc. Fairly simple, storing this 
> data into a row key users->username[email], etc.
> 
> Now as time goes on more data such as snapshot changes like 
> users->username['change:123456] = 'changed email', etc. columns compound onto 
> the users row-key. Perhaps more preferences are added onto the row-key or 
> login information. I wouldn't expect the amount of columns to grow hugely, 
> but I've also learned to plan for the un-expected... 
> 
> Simplicity would tell me to:
> 
> A.) store ALL the data associated with the user onto a single users row-key. 
> Some user keys may be small, others may get larger over time depending upon 
> activity.
> 
> but would B be a better performance model
> 
> B.) Split out user data into seperate row-keys such as 
> users->changes_username['change123456] = 'changed email' AND 
> users->preferences_username['fav_color] = 'blue'. This would add a level of 
> complexity and in some cases tiny row-keys along with multiple fetches for 
> all user/member data?
> 
> Curious what your opinions are?
> 
> Thanks!

Reply via email to