[jira] [Assigned] (KUDU-3346) Rebalance fails when trying to decommission tserver on a rack-aware cluster
[ https://issues.apache.org/jira/browse/KUDU-3346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] YifanZhang reassigned KUDU-3346: Assignee: YifanZhang > Rebalance fails when trying to decommission tserver on a rack-aware cluster > --- > > Key: KUDU-3346 > URL: https://issues.apache.org/jira/browse/KUDU-3346 > Project: Kudu > Issue Type: Bug >Affects Versions: 1.15.0 >Reporter: Georgiana Ogrean >Assignee: YifanZhang >Priority: Major > Attachments: rebalance_ignored_tserver_1c.log.Z, rebalance_v1.log.Z > > > When following the steps [in the > docs|https://docs.cloudera.com/runtime/7.2.0/administering-kudu/topics/kudu-decommissioning-or-permanently-removing-tablet-server-from-cluster.html] > for decommissioning a tserver, the rebalance job fails with: > {code:java} > Invalid argument: ignored tserver is not reported among know > tservers > {code} > Steps followed: > 1. Checked that ksck passes. > 2. Put the tserver to be decommissioned in maintenance mode. > {code:java} > sudo -u kudu kudu tserver state enter_maintenance $MASTER_ADDRESSES > 5ae499b1b870419daabb0e8da90ef233 {code} > 3. Ran rebalance with {{-ignored_tservers}} and > {{-move_replicas_from_ignored_tservers}} flags. > {code:java} > sudo -u kudu kudu cluster rebalance $MASTER_ADDRESSES > -move_replicas_from_ignored_tservers > -ignored_tservers=5ae499b1b870419daabb0e8da90ef233 -v=1{code} > The logs for the rebalace command are attached. > -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Commented] (KUDU-3346) Rebalance fails when trying to decommission tserver on a rack-aware cluster
[ https://issues.apache.org/jira/browse/KUDU-3346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17464941#comment-17464941 ] YifanZhang commented on KUDU-3346: -- I think there is something wrong when populating `ClusterInfo::tservers_to_empty`, because sometimes the `ClusterRawInfo` only contains tservers/tablets info of a specific location. I plan to fix it. > Rebalance fails when trying to decommission tserver on a rack-aware cluster > --- > > Key: KUDU-3346 > URL: https://issues.apache.org/jira/browse/KUDU-3346 > Project: Kudu > Issue Type: Bug >Affects Versions: 1.15.0 >Reporter: Georgiana Ogrean >Priority: Major > Attachments: rebalance_ignored_tserver_1c.log.Z, rebalance_v1.log.Z > > > When following the steps [in the > docs|https://docs.cloudera.com/runtime/7.2.0/administering-kudu/topics/kudu-decommissioning-or-permanently-removing-tablet-server-from-cluster.html] > for decommissioning a tserver, the rebalance job fails with: > {code:java} > Invalid argument: ignored tserver is not reported among know > tservers > {code} > Steps followed: > 1. Checked that ksck passes. > 2. Put the tserver to be decommissioned in maintenance mode. > {code:java} > sudo -u kudu kudu tserver state enter_maintenance $MASTER_ADDRESSES > 5ae499b1b870419daabb0e8da90ef233 {code} > 3. Ran rebalance with {{-ignored_tservers}} and > {{-move_replicas_from_ignored_tservers}} flags. > {code:java} > sudo -u kudu kudu cluster rebalance $MASTER_ADDRESSES > -move_replicas_from_ignored_tservers > -ignored_tservers=5ae499b1b870419daabb0e8da90ef233 -v=1{code} > The logs for the rebalace command are attached. > -- This message was sent by Atlassian Jira (v8.20.1#820001)