[jira] [Commented] (HBASE-26884) Find unavailable regions by the startcode checking on hmaster start up and reassign them

2022-03-28 Thread Zheng Wang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-26884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17513293#comment-17513293
 ] 

Zheng Wang commented on HBASE-26884:


Seen this in 2.0(cdh6.0.1, at about 1 years ago) and 2.2.0 rencently, not sure 
it could happen without misoperation by user.   [~anoop.hbase] 

 

> Find unavailable regions by the startcode checking on hmaster start up and 
> reassign them
> 
>
> Key: HBASE-26884
> URL: https://issues.apache.org/jira/browse/HBASE-26884
> Project: HBase
>  Issue Type: Improvement
>  Components: master
>Reporter: Zheng Wang
>Assignee: Zheng Wang
>Priority: Major
>
> Sometimes we have seen there are regions in open or opening state, but does 
> not deployed on any rs and without procs for them, and afting  checking the 
> meta table, we find these startcode are expired. 
> It is no easy to reproduce, may be caused by corner bug or user misoperation.
> My approach is add some checking on hmaster start up, if the startcode of the 
> regionLocation expired, and neither TRSP on region nor SCP on regionserver, 
> then we should reassign the region, then we can resovle it easily just by 
> restart hmaster. 
> Hbck2 maybe also useful for some of them cases, but not easily for common 
> user to use, especially the number of these regions not small and need to be 
> recovery quickly.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HBASE-26884) Find unavailable regions by the startcode checking on hmaster start up and reassign them

2022-03-28 Thread Anoop Sam John (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-26884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17513231#comment-17513231
 ] 

Anoop Sam John commented on HBASE-26884:


Exact which version you have seen this?

> Find unavailable regions by the startcode checking on hmaster start up and 
> reassign them
> 
>
> Key: HBASE-26884
> URL: https://issues.apache.org/jira/browse/HBASE-26884
> Project: HBase
>  Issue Type: Improvement
>  Components: master
>Reporter: Zheng Wang
>Assignee: Zheng Wang
>Priority: Major
>
> Sometimes we have seen there are regions in open or opening state, but does 
> not deployed on any rs and without procs for them, and afting  checking the 
> meta table, we find these startcode are expired. 
> It is no easy to reproduce, may be caused by corner bug or user misoperation.
> My approach is add some checking on hmaster start up, if the startcode of the 
> regionLocation expired, and neither TRSP on region nor SCP on regionserver, 
> then we should reassign the region, then we can resovle it easily just by 
> restart hmaster. 
> Hbck2 maybe also useful for some of them cases, but not easily for common 
> user to use, especially the number of these regions not small and need to be 
> recovery quickly.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)