Re: Problem in getting data from a 2 node cluster

2012-06-08 Thread aaron morton
1

Data Stax has some good documentation here 
http://www.datastax.com/docs/1.1/index

Cheers


-
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com

On 8/06/2012, at 2:50 PM, Prakrati Agrawal wrote:

> What is the default replication factor? I did not set any replication factor.
> 
> Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | www.mu-sigma.com
> 
> -Original Message-
> From: Tim Wintle [mailto:timwin...@gmail.com]
> Sent: Wednesday, June 06, 2012 5:42 PM
> To: user@cassandra.apache.org
> Subject: RE: Problem in getting data from a 2 node cluster
> 
> On Wed, 2012-06-06 at 06:54 -0500, Prakrati Agrawal wrote:
>> This node will not auto bootstrap because it is configured to be a
>> seed node
> 
> This means the cassandra.yaml on that node references itself as a seed
> node.
> 
> 
> After you decommission the second node, can you still access the entire
> dataset in the single node cluser, or has it been lost along the way?
> 
> What is the replication factor for your data?
> 
> 
> Tim Wintle
> 
> 
> 
> This email message may contain proprietary, private and confidential 
> information. The information transmitted is intended only for the person(s) 
> or entities to which it is addressed. Any review, retransmission, 
> dissemination or other use of, or taking of any action in reliance upon, this 
> information by persons or entities other than the intended recipient is 
> prohibited and may be illegal. If you received this in error, please contact 
> the sender and delete the message from your system.
> 
> Mu Sigma takes all reasonable steps to ensure that its electronic 
> communications are free from viruses. However, given Internet accessibility, 
> the Company cannot accept liability for any virus introduced by this e-mail 
> or any attachment and you are advised to use up-to-date virus checking 
> software.



Re: Problem in getting data from a 2 node cluster of Cassandra

2012-06-08 Thread rohit bhatia
run nodetool -h localhost cfstats on the nodes... this gives node
specific column family based data...
just run this for both nodes...

On Fri, Jun 8, 2012 at 12:46 PM, Prakrati Agrawal
 wrote:
> Yes the code is the same for both 1 and 2 node cluster. It's a Hector code. 
> How do I get the number of rows and columns from Cassandra CLI as the data is 
> very large.
>
> Thanks and Regards
> Prakrati
>
>
> -Original Message-
> From: Roshni Rajagopal [mailto:roshni.rajago...@wal-mart.com]
> Sent: Friday, June 08, 2012 12:43 PM
> To: user@cassandra.apache.org
> Subject: Re: Problem in getting data from a 2 node cluster of Cassandra
>
> Hi Prakrati,
>
>  In an ideal situation, no data should be lost when a node is added. How are 
> you getting the statistics below.
> The output below looks like its from some code using Hector or Thrift..is the 
> code to get statistics from a 1 node cluster or 2 exactly the same- with the 
> only change being a node being added or removed?
> Could you verify the number of rows & cols in the column family using CLI or 
> CQL..
>
> Regards,
> Roshni
>
>
>
>
> From: Prakrati Agrawal 
> mailto:prakrati.agra...@mu-sigma.com>>
> Reply-To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" 
> mailto:user@cassandra.apache.org>>
> Date: Friday 8 June 2012 11:50 AM
> To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" 
> mailto:user@cassandra.apache.org>>
> Subject: Problem in getting data from a 2 node cluster of Cassandra
>
> Dear all
>
> I was originally having a 1 node cluster. Then I added one more node to it 
> with initial token configured appropriately. Now when I run my queries I am 
> not getting all my data ie all columns.
>  Output on 2 nodes
> Time taken to retrieve columns 43707 of key range is 1276
> Time taken to retrieve columns 2084199 of all tickers is 54334
> Time taken to count is 230776
> Total number of rows in the database are 183
> Total number of columns in the database are 7903753
> Output on 1 node
> Time taken to retrieve columns 43707 of key range is 767
> Time taken to retrieve columns 382 of all tickers is 52793
> Time taken to count is 268135
> Total number of rows in the database are 396
> Total number of columns in the database are 16316426
> Please help me. Where is my data going or how should I retrieve it. I have 
> consistency level specified as ONE and I did not specify any replication 
> factor.
>
>
>
> Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | www.mu-sigma.com
>
>
> 
> This email message may contain proprietary, private and confidential 
> information. The information transmitted is intended only for the person(s) 
> or entities to which it is addressed. Any review, retransmission, 
> dissemination or other use of, or taking of any action in reliance upon, this 
> information by persons or entities other than the intended recipient is 
> prohibited and may be illegal. If you received this in error, please contact 
> the sender and delete the message from your system.
>
> Mu Sigma takes all reasonable steps to ensure that its electronic 
> communications are free from viruses. However, given Internet accessibility, 
> the Company cannot accept liability for any virus introduced by this e-mail 
> or any attachment and you are advised to use up-to-date virus checking 
> software.
>
> This email and any files transmitted with it are confidential and intended 
> solely for the individual or entity to whom they are addressed. If you have 
> received this email in error destroy it immediately. *** Walmart Confidential 
> ***
>
>  This email message may contain proprietary, private and confidential 
> information. The information transmitted is intended only for the person(s) 
> or entities to which it is addressed. Any review, retransmission, 
> dissemination or other use of, or taking of any action in reliance upon, this 
> information by persons or entities other than the intended recipient is 
> prohibited and may be illegal. If you received this in error, please contact 
> the sender and delete the message from your system.
>
> Mu Sigma takes all reasonable steps to ensure that its electronic 
> communications are free from viruses. However, given Internet accessibility, 
> the Company cannot accept liability for any virus introduced by this e-mail 
> or any attachment and you are advised to use up-to-date virus checking 
> software.


RE: Problem in getting data from a 2 node cluster of Cassandra

2012-06-08 Thread Prakrati Agrawal
Yes the code is the same for both 1 and 2 node cluster. It's a Hector code. How 
do I get the number of rows and columns from Cassandra CLI as the data is very 
large.

Thanks and Regards
Prakrati


-Original Message-
From: Roshni Rajagopal [mailto:roshni.rajago...@wal-mart.com]
Sent: Friday, June 08, 2012 12:43 PM
To: user@cassandra.apache.org
Subject: Re: Problem in getting data from a 2 node cluster of Cassandra

Hi Prakrati,

 In an ideal situation, no data should be lost when a node is added. How are 
you getting the statistics below.
The output below looks like its from some code using Hector or Thrift..is the 
code to get statistics from a 1 node cluster or 2 exactly the same- with the 
only change being a node being added or removed?
Could you verify the number of rows & cols in the column family using CLI or 
CQL..

Regards,
Roshni




From: Prakrati Agrawal 
mailto:prakrati.agra...@mu-sigma.com>>
Reply-To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" 
mailto:user@cassandra.apache.org>>
Date: Friday 8 June 2012 11:50 AM
To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" 
mailto:user@cassandra.apache.org>>
Subject: Problem in getting data from a 2 node cluster of Cassandra

Dear all

I was originally having a 1 node cluster. Then I added one more node to it with 
initial token configured appropriately. Now when I run my queries I am not 
getting all my data ie all columns.
 Output on 2 nodes
Time taken to retrieve columns 43707 of key range is 1276
Time taken to retrieve columns 2084199 of all tickers is 54334
Time taken to count is 230776
Total number of rows in the database are 183
Total number of columns in the database are 7903753
Output on 1 node
Time taken to retrieve columns 43707 of key range is 767
Time taken to retrieve columns 382 of all tickers is 52793
Time taken to count is 268135
Total number of rows in the database are 396
Total number of columns in the database are 16316426
Please help me. Where is my data going or how should I retrieve it. I have 
consistency level specified as ONE and I did not specify any replication factor.



Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | www.mu-sigma.com



This email message may contain proprietary, private and confidential 
information. The information transmitted is intended only for the person(s) or 
entities to which it is addressed. Any review, retransmission, dissemination or 
other use of, or taking of any action in reliance upon, this information by 
persons or entities other than the intended recipient is prohibited and may be 
illegal. If you received this in error, please contact the sender and delete 
the message from your system.

Mu Sigma takes all reasonable steps to ensure that its electronic 
communications are free from viruses. However, given Internet accessibility, 
the Company cannot accept liability for any virus introduced by this e-mail or 
any attachment and you are advised to use up-to-date virus checking software.

This email and any files transmitted with it are confidential and intended 
solely for the individual or entity to whom they are addressed. If you have 
received this email in error destroy it immediately. *** Walmart Confidential 
***

 This email message may contain proprietary, private and confidential 
information. The information transmitted is intended only for the person(s) or 
entities to which it is addressed. Any review, retransmission, dissemination or 
other use of, or taking of any action in reliance upon, this information by 
persons or entities other than the intended recipient is prohibited and may be 
illegal. If you received this in error, please contact the sender and delete 
the message from your system.

Mu Sigma takes all reasonable steps to ensure that its electronic 
communications are free from viruses. However, given Internet accessibility, 
the Company cannot accept liability for any virus introduced by this e-mail or 
any attachment and you are advised to use up-to-date virus checking software.


Re: Problem in getting data from a 2 node cluster of Cassandra

2012-06-08 Thread Roshni Rajagopal
Hi Prakrati,

 In an ideal situation, no data should be lost when a node is added. How are 
you getting the statistics below.
The output below looks like its from some code using Hector or Thrift..is the 
code to get statistics from a 1 node cluster or 2 exactly the same- with the 
only change being a node being added or removed?
Could you verify the number of rows & cols in the column family using CLI or 
CQL..

Regards,
Roshni




From: Prakrati Agrawal 
mailto:prakrati.agra...@mu-sigma.com>>
Reply-To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" 
mailto:user@cassandra.apache.org>>
Date: Friday 8 June 2012 11:50 AM
To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" 
mailto:user@cassandra.apache.org>>
Subject: Problem in getting data from a 2 node cluster of Cassandra

Dear all

I was originally having a 1 node cluster. Then I added one more node to it with 
initial token configured appropriately. Now when I run my queries I am not 
getting all my data ie all columns.
 Output on 2 nodes
Time taken to retrieve columns 43707 of key range is 1276
Time taken to retrieve columns 2084199 of all tickers is 54334
Time taken to count is 230776
Total number of rows in the database are 183
Total number of columns in the database are 7903753
Output on 1 node
Time taken to retrieve columns 43707 of key range is 767
Time taken to retrieve columns 382 of all tickers is 52793
Time taken to count is 268135
Total number of rows in the database are 396
Total number of columns in the database are 16316426
Please help me. Where is my data going or how should I retrieve it. I have 
consistency level specified as ONE and I did not specify any replication factor.



Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | www.mu-sigma.com



This email message may contain proprietary, private and confidential 
information. The information transmitted is intended only for the person(s) or 
entities to which it is addressed. Any review, retransmission, dissemination or 
other use of, or taking of any action in reliance upon, this information by 
persons or entities other than the intended recipient is prohibited and may be 
illegal. If you received this in error, please contact the sender and delete 
the message from your system.

Mu Sigma takes all reasonable steps to ensure that its electronic 
communications are free from viruses. However, given Internet accessibility, 
the Company cannot accept liability for any virus introduced by this e-mail or 
any attachment and you are advised to use up-to-date virus checking software.

This email and any files transmitted with it are confidential and intended 
solely for the individual or entity to whom they are addressed. If you have 
received this email in error destroy it immediately. *** Walmart Confidential 
***


Problem in getting data from a 2 node cluster of Cassandra

2012-06-07 Thread Prakrati Agrawal
Dear all

I was originally having a 1 node cluster. Then I added one more node to it with 
initial token configured appropriately. Now when I run my queries I am not 
getting all my data ie all columns.
 Output on 2 nodes
Time taken to retrieve columns 43707 of key range is 1276
Time taken to retrieve columns 2084199 of all tickers is 54334
Time taken to count is 230776
Total number of rows in the database are 183
Total number of columns in the database are 7903753
Output on 1 node
Time taken to retrieve columns 43707 of key range is 767
Time taken to retrieve columns 382 of all tickers is 52793
Time taken to count is 268135
Total number of rows in the database are 396
Total number of columns in the database are 16316426
Please help me. Where is my data going or how should I retrieve it. I have 
consistency level specified as ONE and I did not specify any replication factor.



Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | www.mu-sigma.com



This email message may contain proprietary, private and confidential 
information. The information transmitted is intended only for the person(s) or 
entities to which it is addressed. Any review, retransmission, dissemination or 
other use of, or taking of any action in reliance upon, this information by 
persons or entities other than the intended recipient is prohibited and may be 
illegal. If you received this in error, please contact the sender and delete 
the message from your system.

Mu Sigma takes all reasonable steps to ensure that its electronic 
communications are free from viruses. However, given Internet accessibility, 
the Company cannot accept liability for any virus introduced by this e-mail or 
any attachment and you are advised to use up-to-date virus checking software.


RE: Problem in getting data from a 2 node cluster

2012-06-07 Thread Prakrati Agrawal
What is the default replication factor? I did not set any replication factor.

Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | www.mu-sigma.com

-Original Message-
From: Tim Wintle [mailto:timwin...@gmail.com]
Sent: Wednesday, June 06, 2012 5:42 PM
To: user@cassandra.apache.org
Subject: RE: Problem in getting data from a 2 node cluster

On Wed, 2012-06-06 at 06:54 -0500, Prakrati Agrawal wrote:
> This node will not auto bootstrap because it is configured to be a
> seed node

This means the cassandra.yaml on that node references itself as a seed
node.


After you decommission the second node, can you still access the entire
dataset in the single node cluser, or has it been lost along the way?

What is the replication factor for your data?


Tim Wintle



 This email message may contain proprietary, private and confidential 
information. The information transmitted is intended only for the person(s) or 
entities to which it is addressed. Any review, retransmission, dissemination or 
other use of, or taking of any action in reliance upon, this information by 
persons or entities other than the intended recipient is prohibited and may be 
illegal. If you received this in error, please contact the sender and delete 
the message from your system.

Mu Sigma takes all reasonable steps to ensure that its electronic 
communications are free from viruses. However, given Internet accessibility, 
the Company cannot accept liability for any virus introduced by this e-mail or 
any attachment and you are advised to use up-to-date virus checking software.


RE: Problem in getting data from a 2 node cluster

2012-06-06 Thread Tim Wintle
On Wed, 2012-06-06 at 06:54 -0500, Prakrati Agrawal wrote:
> This node will not auto bootstrap because it is configured to be a
> seed node

This means the cassandra.yaml on that node references itself as a seed
node.


After you decommission the second node, can you still access the entire
dataset in the single node cluser, or has it been lost along the way?

What is the replication factor for your data?


Tim Wintle




RE: Problem in getting data from a 2 node cluster

2012-06-06 Thread Prakrati Agrawal
I will repeat my query once again:
I had a 1 node cluster. Then I added 1 more node to it.
When I ran my query on 1 node cluster I got all my data but when I ran my query 
on the 2 node cluster (Hector code) I am not getting the same data.
How do I ensure that my Hector code retrieves data from all the nodes.

Also when I decommission my node and then add it again I get the following 
message.
This node will not auto bootstrap because it is configured to be a seed node
Please tell me the meaning of it also
The things I already tried are:


1.   Used CassandraHostConfigurator - Still same issue

2.   Used nodetool repair on both the nodes - Still same issue
Please help me out. I am badly stuck

Thanks and Regards

Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | www.mu-sigma.com

From: Prakrati Agrawal [mailto:prakrati.agra...@mu-sigma.com]
Sent: Wednesday, June 06, 2012 4:41 PM
To: user@cassandra.apache.org
Subject: RE: Problem in getting data from a 2 node cluster

I even used CassandraHostConfigurator and added a string of hosts but still the 
same issue. Please someone help me

Thanks and Regards

Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | www.mu-sigma.com

From: Prakrati Agrawal [mailto:prakrati.agra...@mu-sigma.com]
Sent: Wednesday, June 06, 2012 4:04 PM
To: user@cassandra.apache.org
Subject: RE: Problem in getting data from a 2 node cluster

Yes I ran nodetool repair also. Still the same problem I am getting lesser data 
when using my code on a 2 node cluster. Please help me

Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | www.mu-sigma.com

From: R. Verlangen [mailto:ro...@us2.nl]
Sent: Wednesday, June 06, 2012 4:01 PM
To: user@cassandra.apache.org
Subject: Re: Problem in getting data from a 2 node cluster

Repair ensures that all data is consistent and available on the node.
2012/6/6 Prakrati Agrawal 
mailto:prakrati.agra...@mu-sigma.com>>
When I run the nodetool command I get the following information
./nodetool -h localhost ring
Address DC  RackStatus State   Load
Effective-Owership  Token

   85070591730234615865843651857942052864
162.192.100.16  datacenter1 rack1   Up Normal  238.22 MB   50.00%   
   0
162.192.100.48  datacenter1 rack1   Up Normal  115.6 MB50.00%   
   85070591730234615865843651857942052864

Please help me

Thanks and Regards

Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | 
www.mu-sigma.com<http://www.mu-sigma.com>

From: Prakrati Agrawal 
[mailto:prakrati.agra...@mu-sigma.com<mailto:prakrati.agra...@mu-sigma.com>]
Sent: Wednesday, June 06, 2012 3:55 PM
To: user@cassandra.apache.org<mailto:user@cassandra.apache.org>
Subject: RE: Problem in getting data from a 2 node cluster

What does repair do?

Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | 
www.mu-sigma.com<http://www.mu-sigma.com>

From: R. Verlangen [mailto:ro...@us2.nl<mailto:ro...@us2.nl>]
Sent: Wednesday, June 06, 2012 3:56 PM
To: user@cassandra.apache.org<mailto:user@cassandra.apache.org>
Subject: Re: Problem in getting data from a 2 node cluster

Did you run repair on the new node?
2012/6/6 Prakrati Agrawal 
mailto:prakrati.agra...@mu-sigma.com>>
Dear all,

I had a 1 node cluster. Then I added 1 more node to it.
When I ran my query on 1 node cluster I got all my data but when I ran my query 
on the 2 node cluster (Hector code) I am not getting the same data.
How do I ensure that my Hector code retrieves data from all the nodes.

Also when I decommission my node and then add it again I get the following 
message.
This node will not auto bootstrap because it is configured to be a seed node
Please tell me the meaning of it also

Thanks and Regards
Prakrati

Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | 
www.mu-sigma.com<http://www.mu-sigma.com>



This email message may contain proprietary, private and confidential 
information. The information transmitted is intended only for the person(s) or 
entities to which it is addressed. Any review, retransmission, dissemination or 
other use of, or taking of any action in reliance upon, this information by 
persons or entities other than the intended recipient is prohibited and may be 
illegal. If you received this in error, please contact the sender and delete 
the message from your system.

Mu Sigma takes all reasonable steps to ensure that its electronic 
communications are free from viruses. However, given Internet accessibility, 
the Company cannot accept liability for any virus introduced by this e-mail or 
any attachment and you are advised to use up-to-date virus checking software.



--
With kind regards,

Robin Verlangen
Software engineer

W http://www.robinverlangen.nl
E ro...@us2.nl<mailto:ro...@us2.nl>

RE: Problem in getting data from a 2 node cluster

2012-06-06 Thread Prakrati Agrawal
I even used CassandraHostConfigurator and added a string of hosts but still the 
same issue. Please someone help me

Thanks and Regards

Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | www.mu-sigma.com

From: Prakrati Agrawal [mailto:prakrati.agra...@mu-sigma.com]
Sent: Wednesday, June 06, 2012 4:04 PM
To: user@cassandra.apache.org
Subject: RE: Problem in getting data from a 2 node cluster

Yes I ran nodetool repair also. Still the same problem I am getting lesser data 
when using my code on a 2 node cluster. Please help me

Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | www.mu-sigma.com

From: R. Verlangen [mailto:ro...@us2.nl]
Sent: Wednesday, June 06, 2012 4:01 PM
To: user@cassandra.apache.org
Subject: Re: Problem in getting data from a 2 node cluster

Repair ensures that all data is consistent and available on the node.
2012/6/6 Prakrati Agrawal 
mailto:prakrati.agra...@mu-sigma.com>>
When I run the nodetool command I get the following information
./nodetool -h localhost ring
Address DC  RackStatus State   Load
Effective-Owership  Token

   85070591730234615865843651857942052864
162.192.100.16  datacenter1 rack1   Up Normal  238.22 MB   50.00%   
   0
162.192.100.48  datacenter1 rack1   Up Normal  115.6 MB50.00%   
   85070591730234615865843651857942052864

Please help me

Thanks and Regards

Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | 
www.mu-sigma.com<http://www.mu-sigma.com>

From: Prakrati Agrawal 
[mailto:prakrati.agra...@mu-sigma.com<mailto:prakrati.agra...@mu-sigma.com>]
Sent: Wednesday, June 06, 2012 3:55 PM
To: user@cassandra.apache.org<mailto:user@cassandra.apache.org>
Subject: RE: Problem in getting data from a 2 node cluster

What does repair do?

Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | 
www.mu-sigma.com<http://www.mu-sigma.com>

From: R. Verlangen [mailto:ro...@us2.nl<mailto:ro...@us2.nl>]
Sent: Wednesday, June 06, 2012 3:56 PM
To: user@cassandra.apache.org<mailto:user@cassandra.apache.org>
Subject: Re: Problem in getting data from a 2 node cluster

Did you run repair on the new node?
2012/6/6 Prakrati Agrawal 
mailto:prakrati.agra...@mu-sigma.com>>
Dear all,

I had a 1 node cluster. Then I added 1 more node to it.
When I ran my query on 1 node cluster I got all my data but when I ran my query 
on the 2 node cluster (Hector code) I am not getting the same data.
How do I ensure that my Hector code retrieves data from all the nodes.

Also when I decommission my node and then add it again I get the following 
message.
This node will not auto bootstrap because it is configured to be a seed node
Please tell me the meaning of it also

Thanks and Regards
Prakrati

Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | 
www.mu-sigma.com<http://www.mu-sigma.com>



This email message may contain proprietary, private and confidential 
information. The information transmitted is intended only for the person(s) or 
entities to which it is addressed. Any review, retransmission, dissemination or 
other use of, or taking of any action in reliance upon, this information by 
persons or entities other than the intended recipient is prohibited and may be 
illegal. If you received this in error, please contact the sender and delete 
the message from your system.

Mu Sigma takes all reasonable steps to ensure that its electronic 
communications are free from viruses. However, given Internet accessibility, 
the Company cannot accept liability for any virus introduced by this e-mail or 
any attachment and you are advised to use up-to-date virus checking software.



--
With kind regards,

Robin Verlangen
Software engineer

W http://www.robinverlangen.nl
E ro...@us2.nl<mailto:ro...@us2.nl>

Disclaimer: The information contained in this message and attachments is 
intended solely for the attention and use of the named addressee and may be 
confidential. If you are not the intended recipient, you are reminded that the 
information remains the property of the sender. You must not use, disclose, 
distribute, copy, print or rely on this e-mail. If you have received this 
message in error, please contact the sender immediately and irrevocably delete 
this message and any copies.



This email message may contain proprietary, private and confidential 
information. The information transmitted is intended only for the person(s) or 
entities to which it is addressed. Any review, retransmission, dissemination or 
other use of, or taking of any action in reliance upon, this information by 
persons or entities other than the intended recipient is prohibited and may be 
illegal. If you received this in error, please contact the sender and delete 
the m

RE: Problem in getting data from a 2 node cluster

2012-06-06 Thread Prakrati Agrawal
Yes I ran nodetool repair also. Still the same problem I am getting lesser data 
when using my code on a 2 node cluster. Please help me

Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | www.mu-sigma.com

From: R. Verlangen [mailto:ro...@us2.nl]
Sent: Wednesday, June 06, 2012 4:01 PM
To: user@cassandra.apache.org
Subject: Re: Problem in getting data from a 2 node cluster

Repair ensures that all data is consistent and available on the node.
2012/6/6 Prakrati Agrawal 
mailto:prakrati.agra...@mu-sigma.com>>
When I run the nodetool command I get the following information
./nodetool -h localhost ring
Address DC  RackStatus State   Load
Effective-Owership  Token

   85070591730234615865843651857942052864
162.192.100.16  datacenter1 rack1   Up Normal  238.22 MB   50.00%   
   0
162.192.100.48  datacenter1 rack1   Up Normal  115.6 MB50.00%   
   85070591730234615865843651857942052864

Please help me

Thanks and Regards

Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | 
www.mu-sigma.com<http://www.mu-sigma.com>

From: Prakrati Agrawal 
[mailto:prakrati.agra...@mu-sigma.com<mailto:prakrati.agra...@mu-sigma.com>]
Sent: Wednesday, June 06, 2012 3:55 PM
To: user@cassandra.apache.org<mailto:user@cassandra.apache.org>
Subject: RE: Problem in getting data from a 2 node cluster

What does repair do?

Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | 
www.mu-sigma.com<http://www.mu-sigma.com>

From: R. Verlangen [mailto:ro...@us2.nl<mailto:ro...@us2.nl>]
Sent: Wednesday, June 06, 2012 3:56 PM
To: user@cassandra.apache.org<mailto:user@cassandra.apache.org>
Subject: Re: Problem in getting data from a 2 node cluster

Did you run repair on the new node?
2012/6/6 Prakrati Agrawal 
mailto:prakrati.agra...@mu-sigma.com>>
Dear all,

I had a 1 node cluster. Then I added 1 more node to it.
When I ran my query on 1 node cluster I got all my data but when I ran my query 
on the 2 node cluster (Hector code) I am not getting the same data.
How do I ensure that my Hector code retrieves data from all the nodes.

Also when I decommission my node and then add it again I get the following 
message.
This node will not auto bootstrap because it is configured to be a seed node
Please tell me the meaning of it also

Thanks and Regards
Prakrati

Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | 
www.mu-sigma.com<http://www.mu-sigma.com>



This email message may contain proprietary, private and confidential 
information. The information transmitted is intended only for the person(s) or 
entities to which it is addressed. Any review, retransmission, dissemination or 
other use of, or taking of any action in reliance upon, this information by 
persons or entities other than the intended recipient is prohibited and may be 
illegal. If you received this in error, please contact the sender and delete 
the message from your system.

Mu Sigma takes all reasonable steps to ensure that its electronic 
communications are free from viruses. However, given Internet accessibility, 
the Company cannot accept liability for any virus introduced by this e-mail or 
any attachment and you are advised to use up-to-date virus checking software.



--
With kind regards,

Robin Verlangen
Software engineer

W http://www.robinverlangen.nl
E ro...@us2.nl<mailto:ro...@us2.nl>

Disclaimer: The information contained in this message and attachments is 
intended solely for the attention and use of the named addressee and may be 
confidential. If you are not the intended recipient, you are reminded that the 
information remains the property of the sender. You must not use, disclose, 
distribute, copy, print or rely on this e-mail. If you have received this 
message in error, please contact the sender immediately and irrevocably delete 
this message and any copies.



This email message may contain proprietary, private and confidential 
information. The information transmitted is intended only for the person(s) or 
entities to which it is addressed. Any review, retransmission, dissemination or 
other use of, or taking of any action in reliance upon, this information by 
persons or entities other than the intended recipient is prohibited and may be 
illegal. If you received this in error, please contact the sender and delete 
the message from your system.

Mu Sigma takes all reasonable steps to ensure that its electronic 
communications are free from viruses. However, given Internet accessibility, 
the Company cannot accept liability for any virus introduced by this e-mail or 
any attachment and you are advised to use up-to-date virus checking software.


This email message may contain proprietary, private

Re: Problem in getting data from a 2 node cluster

2012-06-06 Thread R. Verlangen
Repair ensures that all data is consistent and available on the node.

2012/6/6 Prakrati Agrawal 

>  When I run the nodetool command I get the following information
>
> ./nodetool -h localhost ring
>
> Address DC  RackStatus State   Load
> Effective-Owership  Token   
>
>
>   
> 85070591730234615865843651857942052864
> 
>
> 162.192.100.16  datacenter1 rack1   Up Normal  238.22 MB
> 50.00%  0   
>
> 162.192.100.48  datacenter1 rack1   Up Normal  115.6 MB
> 50.00%  85070591730234615865843651857942052864  
>
> ** **
>
> Please help me
>
> ** **
>
> Thanks and Regards
>
> ** **
>
> Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 |
> www.mu-sigma.com 
>
> ** **
>
> *From:* Prakrati Agrawal [mailto:prakrati.agra...@mu-sigma.com]
> *Sent:* Wednesday, June 06, 2012 3:55 PM
> *To:* user@cassandra.apache.org
> *Subject:* RE: Problem in getting data from a 2 node cluster
>
> ** **
>
> What does repair do?
>
> ** **
>
> Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 |
> www.mu-sigma.com 
>
> ** **
>
> *From:* R. Verlangen [mailto:ro...@us2.nl]
> *Sent:* Wednesday, June 06, 2012 3:56 PM
> *To:* user@cassandra.apache.org
> *Subject:* Re: Problem in getting data from a 2 node cluster
>
> ** **
>
> Did you run repair on the new node?
>
> 2012/6/6 Prakrati Agrawal 
>
> Dear all,
>
>  
>
> I had a 1 node cluster. Then I added 1 more node to it. 
>
> When I ran my query on 1 node cluster I got all my data but when I ran my
> query on the 2 node cluster (Hector code) I am not getting the same data.
> 
>
> How do I ensure that my Hector code retrieves data from all the nodes. ***
> *
>
>  
>
> Also when I decommission my node and then add it again I get the following
> message.
>
> This node will not auto bootstrap because it is configured to be a seed
> node
>
> Please tell me the meaning of it also
>
>  
>
> Thanks and Regards
>
> Prakrati
>
>  
>
> Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 |
> www.mu-sigma.com 
>
>  
>
> ** **
>  --
>
> This email message may contain proprietary, private and confidential
> information. The information transmitted is intended only for the person(s)
> or entities to which it is addressed. Any review, retransmission,
> dissemination or other use of, or taking of any action in reliance upon,
> this information by persons or entities other than the intended recipient
> is prohibited and may be illegal. If you received this in error, please
> contact the sender and delete the message from your system.
>
> Mu Sigma takes all reasonable steps to ensure that its electronic
> communications are free from viruses. However, given Internet
> accessibility, the Company cannot accept liability for any virus introduced
> by this e-mail or any attachment and you are advised to use up-to-date
> virus checking software.
>
>
>
> 
>
> ** **
>
> --
> With kind regards,
>
> ** **
>
> Robin Verlangen
>
> *Software engineer*
>
> ** **
>
> W http://www.robinverlangen.nl
>
> E ro...@us2.nl
>
> ** **
>
> Disclaimer: The information contained in this message and attachments is
> intended solely for the attention and use of the named addressee and may be
> confidential. If you are not the intended recipient, you are reminded that
> the information remains the property of the sender. You must not use,
> disclose, distribute, copy, print or rely on this e-mail. If you have
> received this message in error, please contact the sender immediately and
> irrevocably delete this message and any copies.
>
> ** **
>
> ** **
>  --
>
> This email message may contain proprietary, private and confidential
> information. The information transmitted is intended only for the person(s)
> or entities to which it is addressed. Any review, retransmission,
> dissemination or other use of, or taking of any action in reliance upon,
> this information by persons or entities other than the intended recipient
> is prohibited and may be illegal. If you received this in error, please
> contact the sender and delete the message from your system.
>
> Mu Sigma takes all reasonable steps to ensure that its electronic
> communications are free from vi

RE: Problem in getting data from a 2 node cluster

2012-06-06 Thread Prakrati Agrawal
When I run the nodetool command I get the following information
./nodetool -h localhost ring
Address DC  RackStatus State   Load
Effective-Owership  Token

   85070591730234615865843651857942052864
162.192.100.16  datacenter1 rack1   Up Normal  238.22 MB   50.00%   
   0
162.192.100.48  datacenter1 rack1   Up Normal  115.6 MB50.00%   
   85070591730234615865843651857942052864

Please help me

Thanks and Regards

Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | www.mu-sigma.com

From: Prakrati Agrawal [mailto:prakrati.agra...@mu-sigma.com]
Sent: Wednesday, June 06, 2012 3:55 PM
To: user@cassandra.apache.org
Subject: RE: Problem in getting data from a 2 node cluster

What does repair do?

Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | www.mu-sigma.com

From: R. Verlangen [mailto:ro...@us2.nl]
Sent: Wednesday, June 06, 2012 3:56 PM
To: user@cassandra.apache.org
Subject: Re: Problem in getting data from a 2 node cluster

Did you run repair on the new node?
2012/6/6 Prakrati Agrawal 
mailto:prakrati.agra...@mu-sigma.com>>
Dear all,

I had a 1 node cluster. Then I added 1 more node to it.
When I ran my query on 1 node cluster I got all my data but when I ran my query 
on the 2 node cluster (Hector code) I am not getting the same data.
How do I ensure that my Hector code retrieves data from all the nodes.

Also when I decommission my node and then add it again I get the following 
message.
This node will not auto bootstrap because it is configured to be a seed node
Please tell me the meaning of it also

Thanks and Regards
Prakrati

Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | 
www.mu-sigma.com<http://www.mu-sigma.com>



This email message may contain proprietary, private and confidential 
information. The information transmitted is intended only for the person(s) or 
entities to which it is addressed. Any review, retransmission, dissemination or 
other use of, or taking of any action in reliance upon, this information by 
persons or entities other than the intended recipient is prohibited and may be 
illegal. If you received this in error, please contact the sender and delete 
the message from your system.

Mu Sigma takes all reasonable steps to ensure that its electronic 
communications are free from viruses. However, given Internet accessibility, 
the Company cannot accept liability for any virus introduced by this e-mail or 
any attachment and you are advised to use up-to-date virus checking software.



--
With kind regards,

Robin Verlangen
Software engineer

W http://www.robinverlangen.nl
E ro...@us2.nl<mailto:ro...@us2.nl>

Disclaimer: The information contained in this message and attachments is 
intended solely for the attention and use of the named addressee and may be 
confidential. If you are not the intended recipient, you are reminded that the 
information remains the property of the sender. You must not use, disclose, 
distribute, copy, print or rely on this e-mail. If you have received this 
message in error, please contact the sender immediately and irrevocably delete 
this message and any copies.



This email message may contain proprietary, private and confidential 
information. The information transmitted is intended only for the person(s) or 
entities to which it is addressed. Any review, retransmission, dissemination or 
other use of, or taking of any action in reliance upon, this information by 
persons or entities other than the intended recipient is prohibited and may be 
illegal. If you received this in error, please contact the sender and delete 
the message from your system.

Mu Sigma takes all reasonable steps to ensure that its electronic 
communications are free from viruses. However, given Internet accessibility, 
the Company cannot accept liability for any virus introduced by this e-mail or 
any attachment and you are advised to use up-to-date virus checking software.


This email message may contain proprietary, private and confidential 
information. The information transmitted is intended only for the person(s) or 
entities to which it is addressed. Any review, retransmission, dissemination or 
other use of, or taking of any action in reliance upon, this information by 
persons or entities other than the intended recipient is prohibited and may be 
illegal. If you received this in error, please contact the sender and delete 
the message from your system.

Mu Sigma takes all reasonable steps to ensure that its electronic 
communications are free from viruses. However, given Internet accessibility, 
the Company cannot accept liability for any virus introduced by this e-mail or 
any attachment and you are advised to use up-to-date virus checking software.


RE: Problem in getting data from a 2 node cluster

2012-06-06 Thread Prakrati Agrawal
What does repair do?

Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | www.mu-sigma.com

From: R. Verlangen [mailto:ro...@us2.nl]
Sent: Wednesday, June 06, 2012 3:56 PM
To: user@cassandra.apache.org
Subject: Re: Problem in getting data from a 2 node cluster

Did you run repair on the new node?
2012/6/6 Prakrati Agrawal 
mailto:prakrati.agra...@mu-sigma.com>>
Dear all,

I had a 1 node cluster. Then I added 1 more node to it.
When I ran my query on 1 node cluster I got all my data but when I ran my query 
on the 2 node cluster (Hector code) I am not getting the same data.
How do I ensure that my Hector code retrieves data from all the nodes.

Also when I decommission my node and then add it again I get the following 
message.
This node will not auto bootstrap because it is configured to be a seed node
Please tell me the meaning of it also

Thanks and Regards
Prakrati

Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | 
www.mu-sigma.com<http://www.mu-sigma.com>



This email message may contain proprietary, private and confidential 
information. The information transmitted is intended only for the person(s) or 
entities to which it is addressed. Any review, retransmission, dissemination or 
other use of, or taking of any action in reliance upon, this information by 
persons or entities other than the intended recipient is prohibited and may be 
illegal. If you received this in error, please contact the sender and delete 
the message from your system.

Mu Sigma takes all reasonable steps to ensure that its electronic 
communications are free from viruses. However, given Internet accessibility, 
the Company cannot accept liability for any virus introduced by this e-mail or 
any attachment and you are advised to use up-to-date virus checking software.



--
With kind regards,

Robin Verlangen
Software engineer

W http://www.robinverlangen.nl
E ro...@us2.nl<mailto:ro...@us2.nl>

Disclaimer: The information contained in this message and attachments is 
intended solely for the attention and use of the named addressee and may be 
confidential. If you are not the intended recipient, you are reminded that the 
information remains the property of the sender. You must not use, disclose, 
distribute, copy, print or rely on this e-mail. If you have received this 
message in error, please contact the sender immediately and irrevocably delete 
this message and any copies.



This email message may contain proprietary, private and confidential 
information. The information transmitted is intended only for the person(s) or 
entities to which it is addressed. Any review, retransmission, dissemination or 
other use of, or taking of any action in reliance upon, this information by 
persons or entities other than the intended recipient is prohibited and may be 
illegal. If you received this in error, please contact the sender and delete 
the message from your system.

Mu Sigma takes all reasonable steps to ensure that its electronic 
communications are free from viruses. However, given Internet accessibility, 
the Company cannot accept liability for any virus introduced by this e-mail or 
any attachment and you are advised to use up-to-date virus checking software.


Re: Problem in getting data from a 2 node cluster

2012-06-06 Thread R. Verlangen
Did you run repair on the new node?

2012/6/6 Prakrati Agrawal 

>  Dear all,
>
> ** **
>
> I had a 1 node cluster. Then I added 1 more node to it. ** **
>
> When I ran my query on 1 node cluster I got all my data but when I ran my
> query on the 2 node cluster (Hector code) I am not getting the same data.
> 
>
> How do I ensure that my Hector code retrieves data from all the nodes. ***
> *
>
> ** **
>
> Also when I decommission my node and then add it again I get the following
> message.
>
> This node will not auto bootstrap because it is configured to be a seed
> node
>
> Please tell me the meaning of it also
>
> ** **
>
> Thanks and Regards
>
> Prakrati
>
> ** **
>
> Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 |
> www.mu-sigma.com 
>
> ** **
>
> --
> This email message may contain proprietary, private and confidential
> information. The information transmitted is intended only for the person(s)
> or entities to which it is addressed. Any review, retransmission,
> dissemination or other use of, or taking of any action in reliance upon,
> this information by persons or entities other than the intended recipient
> is prohibited and may be illegal. If you received this in error, please
> contact the sender and delete the message from your system.
>
> Mu Sigma takes all reasonable steps to ensure that its electronic
> communications are free from viruses. However, given Internet
> accessibility, the Company cannot accept liability for any virus introduced
> by this e-mail or any attachment and you are advised to use up-to-date
> virus checking software.
>



-- 
With kind regards,

Robin Verlangen
*Software engineer*
*
*
W http://www.robinverlangen.nl
E ro...@us2.nl

Disclaimer: The information contained in this message and attachments is
intended solely for the attention and use of the named addressee and may be
confidential. If you are not the intended recipient, you are reminded that
the information remains the property of the sender. You must not use,
disclose, distribute, copy, print or rely on this e-mail. If you have
received this message in error, please contact the sender immediately and
irrevocably delete this message and any copies.


Problem in getting data from a 2 node cluster

2012-06-06 Thread Prakrati Agrawal
Dear all,

I had a 1 node cluster. Then I added 1 more node to it.
When I ran my query on 1 node cluster I got all my data but when I ran my query 
on the 2 node cluster (Hector code) I am not getting the same data.
How do I ensure that my Hector code retrieves data from all the nodes.

Also when I decommission my node and then add it again I get the following 
message.
This node will not auto bootstrap because it is configured to be a seed node
Please tell me the meaning of it also

Thanks and Regards
Prakrati

Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | www.mu-sigma.com



This email message may contain proprietary, private and confidential 
information. The information transmitted is intended only for the person(s) or 
entities to which it is addressed. Any review, retransmission, dissemination or 
other use of, or taking of any action in reliance upon, this information by 
persons or entities other than the intended recipient is prohibited and may be 
illegal. If you received this in error, please contact the sender and delete 
the message from your system.

Mu Sigma takes all reasonable steps to ensure that its electronic 
communications are free from viruses. However, given Internet accessibility, 
the Company cannot accept liability for any virus introduced by this e-mail or 
any attachment and you are advised to use up-to-date virus checking software.