[ 
https://issues.apache.org/jira/browse/HBASE-25007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17194274#comment-17194274
 ] 

Duo Zhang commented on HBASE-25007:
-----------------------------------

I think after a master restart, we will reload meta so we will know the regions 
on the 'unknown servers', as we will create ServerStateNode for them.

And the problem is that, we will not schedule SCP for them if something goes 
wrong. So I think a normal SCP is enough? Or at least, we do not need to scan 
meta to find out the regions on a 'unknown server'?

At least, if meta is on an 'unknown server', the only way to fix is to restart 
master, and then schedule a SCP manually for the region server. I do not think 
schedule a HBCKSCP without restarting master could help here, as we will not 
consider the region server as 'carryingMeta', so we will not try to assign meta.

In general, I think the HBCKChore should have two levels of scanning. First it 
should always scan the location of meta(fetch zk for now, or scan root after 
splittable meta, if we choose to use root table then there will be 3 
levels...), and if there are inconsistencies, stop and report it. And once meta 
is fine, we will scan meta to report other inconsistencies.

And if we find out that meta is already in inconsistent state, we schedule a 
SCP for the region server, to bring meta online. And then the HBCKChore could 
continue to show other inconsistencies, then we start a schedule other SCPs. 
Notice that, we should restart master first to let the master load the newest 
data in meta.

Thanks.

> Make HBCK2 work for 'root table'
> --------------------------------
>
>                 Key: HBASE-25007
>                 URL: https://issues.apache.org/jira/browse/HBASE-25007
>             Project: HBase
>          Issue Type: Sub-task
>          Components: hbck2
>            Reporter: Duo Zhang
>            Assignee: Duo Zhang
>            Priority: Major
>
> We  will also scan catalog table and fix them in HBCK2, we should add support 
> for root too.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to