[ https://issues.apache.org/jira/browse/HBASE-21745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16888302#comment-16888302 ]
stack commented on HBASE-21745: ------------------------------- On... bq. Fix assignment errors (undeployed regions, double assignments (yes, should not be possible), etc) HBASE-22527 adds display on master UI of 'problematic regions' which are one of the following: * Master thought this region opened, but no regionserver reported it. * Master thought this region opened on Server1, but regionserver reported Server2 * More than one regionservers reported opened this region All above should be fixable with HBCK2 currently; what combination depends on the particular problem. For example, HBASE-22527 has case #1 above where a meta had a region assigned to a server no longer a member of the cluster (for whatever reason...). A recipe in HBASE-22527 shows one fix (I think there a more compact solution but in the heat of the moment... whatever works). For #2 and #3, Master used to tell disagreeing regionserver to kill itself because it was in disagreement with the Master's view of the world (but I think this killing was later undone). I think 'Fix assignment errors (undeployed regions, double assignments (yes, should not be possible), etc)' covered. Let me strike it out in the list above. > Make HBCK2 be able to fix issues other than region assignment > ------------------------------------------------------------- > > Key: HBASE-21745 > URL: https://issues.apache.org/jira/browse/HBASE-21745 > Project: HBase > Issue Type: Umbrella > Components: hbase-operator-tools, hbck2 > Reporter: Duo Zhang > Assignee: stack > Priority: Critical > > This is what [~apurtell] posted on mailing-list, HBCK2 should support > * -Rebuild meta from region metadata in the filesystem, aka offline meta > rebuild.- > * Fix assignment errors (undeployed regions, double assignments (yes, should > not be possible), etc) > * Fix region holes, overlaps, and other errors in the region chain > * Fix failed split and merge transactions that have failed to roll back due > to some bug (related to previous) > * -Enumerate store files to determine file level corruption and sideline > corrupt files- > * -Fix hfile link problems (dangling / broken)- -- This message was sent by Atlassian JIRA (v7.6.14#76016)