Re: Dropping Creating Column Families Never Returns

2011-02-15 Thread William R Speirs
What would/could take so long for the nodes to agree? It's a small cluster (7 nodes) all on local LAN and not being used by anything else. I think a delete refresh might be in order... Thanks! Bill- On 02/15/2011 09:13 PM, Jonathan Ellis wrote: command never returns means it's waiting for

Re: postgis cassandra?

2011-02-05 Thread William R Speirs
I know nothing about postgis and little about spacial data, but if you're simply talking about data that relates to some latitude longitude pair, you could have your row key simply be the concatenation of the two: lat:long. Can you provide more details about the type of data you're looking to

Re: Schema Design Question : Supercolumn family or just a Standard column family with columns containing serialized aggregate data?

2011-02-02 Thread William R Speirs
To reiterate, so I know we're both on the same page, your schema would be something like this: - A column family (as you describe) to store the details of a reminder. One reminder per row. The row key would be a TimeUUID. - A super column family to store the reminders for each user, for each

Re: Schema Design Question : Supercolumn family or just a Standard column family with columns containing serialized aggregate data?

2011-02-02 Thread William R Speirs
Any time I see/hear a single row containing all ... I get nervous. That single row is going to reside on a single node. That is potentially a lot of load (don't know the system) for that single node. Why wouldn't you split it by at least user? If it won't be a lot of load, then why are you

Re: Schema Design Question : Supercolumn family or just a Standard column family with columns containing serialized aggregate data?

2011-02-02 Thread William R Speirs
I did not understand before... sorry. Again, depending upon how many reminders you have for a single user, this could be a long/wide row. Again, it really comes down to how many reminders are we talking about and how often will they be read/written. While a single row can contain millions

Re: cassandra as session store

2011-02-01 Thread William R Speirs
I'm still very new to Cassandra, but when I started reading about it the first thing I thought about was a session store. It's based (in part from what I understand) on Dynamo which is (again, I could be wrong) used at Amazon as the session store for your shopping cart. So I would certainly

Re: Is it recommended to store two types of data (not related to each other but need to be retrieved together) in one super column family ?

2011-01-29 Thread William R Speirs
I'm very new to Cassandra, but I'll pitch in my $0.02. Row look-ups are super fast, why do you think it would be more efficient to store these two rows together in the super column method you describe? Why would you not just look-up the rows, one after the other? If I understand correctly,

Re: Schema Design

2011-01-26 Thread William R Speirs
It makes sense that the single row for a system (with a growing number of columns) will reside on a single machine. With that in mind, here is my updated schema: - A single column family for all the messages. The row keys will be the TimeUUID of the message with the following columns:

Re: Schema Design

2011-01-26 Thread William R Speirs
Ah, sweet... thanks for the link! Bill- On 01/26/2011 08:20 PM, buddhasystem wrote: Bill, it's all explained here: http://wiki.apache.org/cassandra/MemtableThresholds#JVM_Heap_Size,the Watch the number of CFs and the memtable sizes. In my experience, this all matters.