Hi all,
The RIT message shows the following:
Owner procedure: { ID => '73827', PARENT_ID => '73587', STATE =>
'WAITING_TIMEOUT', OWNER => 'hbase', TYPE => 'UnassignProcedure table=hds2_md5,
region=f25fe93e24b34cb2f7fffddee1d89eec, server=ba-hbase25.jumbo.hq.
com,16020,1604475904456', START_TIME => 'Thu Feb 18 06:31:06 CET 2021',
LAST_UPDATE => 'Fri Feb 19 10:49:20 CET 2021', PARAMETERS => [ {
transitionState => 'REGION_TRANSITION_DISPATCH', regionInfo => { regionId =>
'1535957697205', tableName => { namespace => 'ZGVmYXVsdA==', qualifier =>
'aGRzMl9tZDU=' }, startKey => 'QkRGRkVFRg==', endKey => 'QkVBQTgyMUQy', offline
=> 'false', split => 'false', replicaId => '0' }, hostingServer => { hostName
=> 'ba-hbase25.jumbo.hq.eset.com', port => '16020', startCode =>
'1604475904456' }, attempt => '179' } ] }
HBASE master UI->Table details
Shows region 'hds2_md5,BDFFEEF,1535957697205.f25fe93e24b34cb2f7fffddee1d89eec'
as being on region server ba-hbase18.jumbo.hq.com
So, is the table hosted on server hbase25 and being moved TO hbase18?
For some reason the table is not enabled at this time.
Table hds2_md5
Table Attributes
Attribute Name
Value
Description
Enabled
false
Is the table enabled
Compaction
NONE
Is the table compacting
The table has be online to perform these kinds of moves, yes? A RIT is not
going to occur if the table is disabled, surely.
There was a network issue where net traffic went up on some paths as other
paths went down.
So one question could be: was the table taken offline during this unassign -
but then with more than 30000 regions it is likely that other assign/unassigns
were being carried out on this and other tables.
Or was the table disabled with a view to performing some fix on this RIT.
(currently, data 'owners' are unavailable for comment). Table has been offline
for (at least) one day.
One of the techies stopped the regionserver instance on the hbase25 mode to try
and force some movement.
Thanks in advance.