Creating namespace and column family from multiple nodes concurrently

2013-05-22 Thread Emalayan Vairavanathan
Hi all,



I am implementing a distributed application which runs on 100s of machines 
concurrently. This application is going to use Cassandra as underlaying storage.

The application creates the schema (name space and column families) during 
initialization phase.  It seems I have two options to create the schema.

Option - 1 : Using a single node for schema creation.
        Option - 2: Having all the nodes ( 100) to run the same schema 
creation logic (First, nodes will check whether the schema is already available 
and then try to create the schema if it is not available already).  

To keep the initialization phase simple, I prefer to go for Option - 2. However 
I am not sure how Cassandra is going to behave if multiple nodes try to create 
the same schema (namespace and column families) concurrently. It would be nice 
if someone can tell me about the implications of Option - 2 with Cassandra 
version 1.2.2.

Please let me know if you have question.

Thank you
VE

Re: Creating namespace and column family from multiple nodes concurrently

2013-05-22 Thread Michael Kjellman
You will most likely get better answers to your if you try the Cassandra
users group instead for your question.

Best,
Michael

On 5/22/13 5:52 PM, Emalayan Vairavanathan svemala...@yahoo.com wrote:

Hi all,



I am implementing a distributed application which runs on 100s of
machines concurrently. This application is going to use Cassandra as
underlaying storage.

The application creates the schema (name space and column families)
during initialization phase.  It seems I have two options to create the
schema.

Option - 1 : Using a single node for schema creation.
Option - 2: Having all the nodes ( 100) to run the same schema
creation logic (First, nodes will check whether the schema is already
available and then try to create the schema if it is not available
already).  

To keep the initialization phase simple, I prefer to go for Option - 2.
However I am not sure how Cassandra is going to behave if multiple nodes
try to create the same schema (namespace and column families)
concurrently. It would be nice if someone can tell me about the
implications of Option - 2 with Cassandra version 1.2.2.

Please let me know if you have question.

Thank you
VE