[ https://issues.apache.org/jira/browse/CASSANDRA-13885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16173049#comment-16173049 ]
Thomas Steinmaurer commented on CASSANDRA-13885: ------------------------------------------------ It is about ease the operational side and that 2.2+ is a major shift towards behaving differently and being much more complex when I simply want to run a full repair across my 9 node cluster on 2 small volume CFs on a daily basis (grace period = 72hr) and being used to so by running the following with 2.1 kicked off in parallel on all nodes: {code} nodetool repair -pr mykeyspace mycf1 mycf2 {code} Ok, I learned incremental repair being the default since 2.2+, so I need to additionally apply the -full option. Ok, not a big deal, but when running the following with 3.0.14, again kicked off in parallel on all nodes: {code} nodetool repair -full -pr mykeyspace mycf1 mycf2 {code} I start to see basically the following nodetool output: {code} ... [2017-09-20 11:34:49,968] Some repair failed [2017-09-20 11:34:49,968] Repair command #8 finished in 0 seconds error: Repair job has failed with the error message: [2017-09-20 11:34:49,968] Some repair failed -- StackTrace -- java.lang.RuntimeException: Repair job has failed with the error message: [2017-09-20 11:34:49,968] Some repair failed at org.apache.cassandra.tools.RepairRunner.progress(RepairRunner.java:115) at org.apache.cassandra.utils.progress.jmx.JMXNotificationProgressListener.handleNotification(JMXNotificationProgressListener.java:77) at com.sun.jmx.remote.internal.ClientNotifForwarder$NotifFetcher.dispatchNotification(ClientNotifForwarder.java:583) at com.sun.jmx.remote.internal.ClientNotifForwarder$NotifFetcher.doRun(ClientNotifForwarder.java:533) at com.sun.jmx.remote.internal.ClientNotifForwarder$NotifFetcher.run(ClientNotifForwarder.java:452) at com.sun.jmx.remote.internal.ClientNotifForwarder$LinearExecutor$1.run(ClientNotifForwarder.java:108) {code} > Allow to run full repairs in 3.0 without additional cost of anti-compaction > --------------------------------------------------------------------------- > > Key: CASSANDRA-13885 > URL: https://issues.apache.org/jira/browse/CASSANDRA-13885 > Project: Cassandra > Issue Type: Bug > Reporter: Thomas Steinmaurer > > This ticket is basically the result of the discussion in Cassandra user list: > https://www.mail-archive.com/user@cassandra.apache.org/msg53562.html > I was asked to open a ticket by Paulo Motta to think about back-porting > running full repairs without the additional cost of anti-compaction. > Basically there is no way in 3.0 to run full repairs from several nodes > concurrently without troubles caused by (overlapping?) anti-compactions. > Coming from 2.1 this is a major change from an operational POV, basically > breaking any e.g. cron job based solution kicking off -pr based repairs on > several nodes concurrently. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org