Re: Data storage space unbalance issue

Elliott Sims Mon, 03 Dec 2018 22:30:57 -0800

It depends on the type of repair, but you'll want to make sure all the data
is where it should be before running cleanup.  Somewhat related, if you're
not running regular repairs already, you should be.  You can do it via
cron, but I strongly suggest checking out Reaper.


On Wed, Nov 28, 2018, 8:05 PM Eunsu Kim <eunsu.bil...@gmail.com wrote:

> Thank you for your response.
>
> I will run repair from datacenter2 with your advice. Do I have to run
> repair on every node in datacenter2?
>
> There is no snapshot when checked with nodetool listsnaphosts.
>
> Thank you.
>
> On 29 Nov 2018, at 4:31 AM, Elliott Sims <elli...@backblaze.com> wrote:
>
> I think you answered your own question, sort of.
>
> When you expand a cluster, it copies the appropriate rows to the new
> node(s) but doesn't automatically remove them from the old nodes.  When you
> ran cleanup on datacenter1, it cleared out those old extra copies.  I would
> suggest running a repair first for safety on datacenter2, then a "nodetool
> cleanup" on those hosts.
>
> Also run "nodetool snapshot" to make sure you don't have any old snapshots
> sitting around taking up space.
>
> On Wed, Nov 28, 2018 at 5:29 AM Eunsu Kim <eunsu.bil...@gmail.com> wrote:
>
>> (I am sending the previous mail again because it seems that it has not
>> been sent properly.)
>>
>> HI experts,
>>
>> I am running 2 datacenters each containing five nodes. (total 10 nodes,
>> all 3.11.3)
>>
>> My data is stored one at each data center. (REPLICATION = { 'class' :
>> 'org.apache.cassandra.locator.NetworkTopologyStrategy', 'datacenter1':
>> '1', 'datacenter2': '1’ })
>>
>> Most of my data have a short TTL(14days). The gc_grace_seconds value for
>> all tables is also 600sec.
>>
>> I expect the two data centers to use the same size but datacenter2 is
>> using more size. It seems that the datas of datacenter2 is rarely
>> deleted. While the disk usage for datacenter1 remains constant, the disk
>> usage for datacenter2 continues to grow.
>>
>> ——————
>> Datacenter: datacenter1
>> =======================
>> Status=Up/Down
>> |/ State=Normal/Leaving/Joining/Moving
>> --  Address       Load       Tokens       Owns (effective)  Host ID
>>                         Rack
>> UN  10.61.58.228  925.48 GiB  256          21.5%
>> 60d1bac8-b4d6-4e02-a05f-badee0bb36f5  rack1
>> UN  10.61.58.167  840 GiB    256          20.0%
>> a04fc77a-907f-490c-971c-4e1f964c7b14  rack1
>> UN  10.61.75.86   1.13 TiB   256          19.3%
>> 618c101b-036d-42e7-bf9f-2bcbd429cbd1  rack1
>> UN  10.61.59.22   844.19 GiB  256          20.0%
>> d8a4a165-13f0-4f4a-9278-4024730b8116  rack1
>> UN  10.61.59.82   737.88 GiB  256          19.2%
>> 054a4eb5-6d1c-46fa-b550-34da610da4e0  rack1
>> Datacenter: datacenter2
>> =======================
>> Status=Up/Down
>> |/ State=Normal/Leaving/Joining/Moving
>> --  Address       Load       Tokens       Owns (effective)  Host ID
>>                         Rack
>> UN  10.42.6.120   1.11 TiB   256          18.6%
>> 69f15be0-e5a1-474e-87cf-b063e6854402  rack1
>> UN  10.42.5.207   1.17 TiB   256          20.0%
>> f78bdce5-cb01-47e0-90b9-fcc31568e49e  rack1
>> UN  10.42.6.47    1.01 TiB   256          20.1%
>> 3ff93b47-2c15-4e1a-a4ea-2596f26b4281  rack1
>> UN  10.42.6.48    1007.67 GiB  256          20.4%
>> 8cbbe76d-6496-403a-8b09-fe6812c9dea2  rack1
>> UN  10.42.5.208   1.29 TiB   256          20.9%
>> 4aa96c6a-6083-417f-a58a-ec847bcbfc7e  rack1
>> ------------------
>>
>> A few days ago, one node of datacenter1 broke down and replaced it, and I
>> worked on rebuild, repair, and cleanup.
>>
>>
>> What else can I do?
>>
>> Thank you in advance.
>>
>
>

Re: Data storage space unbalance issue

Reply via email to