Re: Compaction before Decommission and Bootstrapping

Jack Krupansky Sun, 17 Aug 2014 08:32:50 -0700

If you are migrating all nodes, you might want to consider creating a newdata center, bringing up all the new nodes (bootstrap) in that new datacenter, and then decommissioning all the nodes in the old data center.

That way, the existing nodes remain fully operational during the process,and the new nodes are not available until the new data center is completelyready. And if something goes wrong, no harm to the existing nodes.


-- Jack Krupansky

-----Original Message-----From: Robert Stupp

Sent: Sunday, August 17, 2014 11:17 AM
To: user@cassandra.apache.org
Subject: Re: Compaction before Decommission and Bootstrapping

In a few words:
Bootstrap one node at once
Wait for bootstrap to complete
Next node

More details: datastax.com/docs (C* 2.0)

Before decommissioning: nodetool cleanup

Don't forget to do repairs (one node at a time) - this should be a regularadmin task



--
Sent from my iPhone

Am 17.08.2014 um 15:46 schrieb Maxime <maxim...@gmail.com>:
Is there some unwritten wisdom with regards to the use 'nodetool compact'before bootstrapping new nodes and decommissioning old ones?
TL;DR:
I've been spending the last few days trying to move a cluster onDigitalOcean 2GB machines to 4GB machines (same provider). To do so Iwanted to create the new nodes, bootstrap them, then decommission the oldones (one by one seems to be the only available option).
The bootstrapping was failing, eventually I figured out it was somehowrelated to the TombstoneOverwhelmingException on the new nodes. I issued a'nodetool compact' on the entire cluster to try to minimize the number ofTombstones. Once that was done I was able to bootstrap all my new nodes.
Now is the time to decommission. From the very first node I tried todecommission I've been getting 1 node dying after an almost endless loopof "GC for ConcurrentMarkSweep" showing the heap getting fuller and fulleruntil the node dies. On one node I've been able to bump the MAX_HEAP_SIZEby 400MB and get it to work (it was a 4GB node), but now I'm getting thesame symptoms on a 2GB node where the heap is as big as it can be beforethe OS running out of RAM itself, so I can't expand the MAX_HEAP_SIZE. Itwould seem I have really painted myself into a scrap-the-cluster kind ofway.
Not knowing the inner-workings of Cassandra's bootstrap and decommissionmechanisms means all I can do is make an educated guesses that perhapsdoing another 'nodetool compact' on the nodes I'm about to decommissionmight help. However I have not found any wisdom or documentation onanything relating to this, which I find surprising as I can't be the firstto have had this problem.
BOTTOM LINE:
Does anyone have a real-world production process for efficiently andreliably bootstrap and decommission nodes in a cluster? Seems it mightlook like <compact all>, <bootstrap one-by-one>, <compact all>,<decommission one-by-one (really?!?)>. Or are all my problems due to merunning on "hardware" that doesn't have resources (RAM,CPU) to spare inthe first place?
Thanks

Re: Compaction before Decommission and Bootstrapping

Reply via email to