Re: Switching to Incremental Repair

Bowen Song via user Sat, 03 Feb 2024 08:41:08 -0800

Full repair running for an entire week sounds excessively long. Even ifyou've got 1 TB of data per node, 1 week means the repair speed is lessthan 2 MB/s, that's very slow. Perhaps you should focus on finding thebottleneck of the full repair speed and work on that instead.


On 03/02/2024 16:18, Sebastian Marsching wrote:

Hi,
2. use an orchestration tool, such as Cassandra Reaper, to take careof that for you. You will still need monitor and alert to ensure therepairs are run successfully, but fixing a stuck or failed repair isnot very time sensitive, you can usually leave it till Monday morningif it happens at Friday night.
Does anyone know how such a schedule can be created in Cassandra Reaper?
I recently learned the hard way that running both a full and anincremental repair for the same keyspace and table in parallel is nota good idea (it caused a very unpleasant overload situation on one ofour clusters).
At the moment, we have one schedule for the full repairs (every 90days) and another schedule for the incremental repairs (daily). But asfull repairs take much longer than a day (about a week, in our case),the two schedules collide. So, Cassandra Reaper starts an incrementalrepair while the full repair is still in process.
Does anyone know how to avoid this? Optimally, the full repair wouldbe paused (no new segments started) for the duration of theincremental repair. The second best option would be inhibiting theincremental repair while a full repair is in progress.
Best regards,
Sebastian

Re: Switching to Incremental Repair

Reply via email to