[jira] [Assigned] (KUDU-3346) Rebalance fails when trying to decommission tserver on a rack-aware cluster

2021-12-24 Thread YifanZhang (Jira)


 [ 
https://issues.apache.org/jira/browse/KUDU-3346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

YifanZhang reassigned KUDU-3346:


Assignee: YifanZhang

> Rebalance fails when trying to decommission tserver on a rack-aware cluster
> ---
>
> Key: KUDU-3346
> URL: https://issues.apache.org/jira/browse/KUDU-3346
> Project: Kudu
>  Issue Type: Bug
>Affects Versions: 1.15.0
>Reporter: Georgiana Ogrean
>Assignee: YifanZhang
>Priority: Major
> Attachments: rebalance_ignored_tserver_1c.log.Z, rebalance_v1.log.Z
>
>
> When following the steps [in the 
> docs|https://docs.cloudera.com/runtime/7.2.0/administering-kudu/topics/kudu-decommissioning-or-permanently-removing-tablet-server-from-cluster.html]
>  for decommissioning a tserver, the rebalance job fails with:
> {code:java}
> Invalid argument: ignored tserver  is not reported among know 
> tservers 
> {code}
> Steps followed:
> 1. Checked that ksck passes.
> 2. Put the tserver to be decommissioned in maintenance mode.
> {code:java}
> sudo -u kudu kudu tserver state enter_maintenance $MASTER_ADDRESSES 
> 5ae499b1b870419daabb0e8da90ef233 {code}
> 3. Ran rebalance with {{-ignored_tservers}} and 
> {{-move_replicas_from_ignored_tservers}} flags.
> {code:java}
> sudo -u kudu kudu cluster rebalance $MASTER_ADDRESSES 
> -move_replicas_from_ignored_tservers 
> -ignored_tservers=5ae499b1b870419daabb0e8da90ef233 -v=1{code}
> The logs for the rebalace command are attached.
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (KUDU-3346) Rebalance fails when trying to decommission tserver on a rack-aware cluster

2021-12-24 Thread YifanZhang (Jira)


[ 
https://issues.apache.org/jira/browse/KUDU-3346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17464941#comment-17464941
 ] 

YifanZhang commented on KUDU-3346:
--

I think there is something wrong when populating 
`ClusterInfo::tservers_to_empty`, because sometimes the `ClusterRawInfo` only 
contains  tservers/tablets info of a specific location.  I plan to fix it.

> Rebalance fails when trying to decommission tserver on a rack-aware cluster
> ---
>
> Key: KUDU-3346
> URL: https://issues.apache.org/jira/browse/KUDU-3346
> Project: Kudu
>  Issue Type: Bug
>Affects Versions: 1.15.0
>Reporter: Georgiana Ogrean
>Priority: Major
> Attachments: rebalance_ignored_tserver_1c.log.Z, rebalance_v1.log.Z
>
>
> When following the steps [in the 
> docs|https://docs.cloudera.com/runtime/7.2.0/administering-kudu/topics/kudu-decommissioning-or-permanently-removing-tablet-server-from-cluster.html]
>  for decommissioning a tserver, the rebalance job fails with:
> {code:java}
> Invalid argument: ignored tserver  is not reported among know 
> tservers 
> {code}
> Steps followed:
> 1. Checked that ksck passes.
> 2. Put the tserver to be decommissioned in maintenance mode.
> {code:java}
> sudo -u kudu kudu tserver state enter_maintenance $MASTER_ADDRESSES 
> 5ae499b1b870419daabb0e8da90ef233 {code}
> 3. Ran rebalance with {{-ignored_tservers}} and 
> {{-move_replicas_from_ignored_tservers}} flags.
> {code:java}
> sudo -u kudu kudu cluster rebalance $MASTER_ADDRESSES 
> -move_replicas_from_ignored_tservers 
> -ignored_tservers=5ae499b1b870419daabb0e8da90ef233 -v=1{code}
> The logs for the rebalace command are attached.
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)