Hi all, The RIT message shows the following:
Owner procedure: { ID => '73827', PARENT_ID => '73587', STATE => 'WAITING_TIMEOUT', OWNER => 'hbase', TYPE => 'UnassignProcedure table=hds2_md5, region=f25fe93e24b34cb2f7fffddee1d89eec, server=ba-hbase25.jumbo.hq. com,16020,1604475904456', START_TIME => 'Thu Feb 18 06:31:06 CET 2021', LAST_UPDATE => 'Fri Feb 19 10:49:20 CET 2021', PARAMETERS => [ { transitionState => 'REGION_TRANSITION_DISPATCH', regionInfo => { regionId => '1535957697205', tableName => { namespace => 'ZGVmYXVsdA==', qualifier => 'aGRzMl9tZDU=' }, startKey => 'QkRGRkVFRg==', endKey => 'QkVBQTgyMUQy', offline => 'false', split => 'false', replicaId => '0' }, hostingServer => { hostName => 'ba-hbase25.jumbo.hq.eset.com', port => '16020', startCode => '1604475904456' }, attempt => '179' } ] } HBASE master UI->Table details Shows region 'hds2_md5,BDFFEEF,1535957697205.f25fe93e24b34cb2f7fffddee1d89eec' as being on region server ba-hbase18.jumbo.hq.com So, is the table hosted on server hbase25 and being moved TO hbase18? For some reason the table is not enabled at this time. Table hds2_md5 Table Attributes Attribute Name Value Description Enabled false Is the table enabled Compaction NONE Is the table compacting The table has be online to perform these kinds of moves, yes? A RIT is not going to occur if the table is disabled, surely. There was a network issue where net traffic went up on some paths as other paths went down. So one question could be: was the table taken offline during this unassign - but then with more than 30000 regions it is likely that other assign/unassigns were being carried out on this and other tables. Or was the table disabled with a view to performing some fix on this RIT. (currently, data 'owners' are unavailable for comment). Table has been offline for (at least) one day. One of the techies stopped the regionserver instance on the hbase25 mode to try and force some movement. Thanks in advance.