Prathyusha created HBASE-29256:
----------------------------------
Summary: Multiple Split Procedures on same region stuck
indefinitely waiting for Exclusive Lock
Key: HBASE-29256
URL: https://issues.apache.org/jira/browse/HBASE-29256
Project: HBase
Issue Type: Improvement
Reporter: Prathyusha
Multiple Split Procedures on same region got stuck indefinitely waiting for
Exclusive Lock help by the first Split Procedure created on the region and that
procedure wasnt scheduled for almost a week till HMaster restart happened.
First SplitProcedure created failed to update procedure storeĀ
{color:#4c9aff}_ERROR [PEWorker-25] region.RegionProcedureStore - Failed to
update proc pid=966118, state=RUNNABLE:SPLIT_TABLE_REGION_PREPARE, locked=true;
SplitTableRegionProcedure table=_tablename_, parent=_parent-XXX_,
daughterA=_daughter1-xxx_, daughterB=_daughter2-xxx_
java.io.InterruptedIOException: No ack received after 25s and a timeout of 25s
at org.apache.hadoop.hdfs.DataStreamer.waitForAckedSeqno(DataStreamer.java:938)
at org.apache.hadoop.hdfs.DFSOutputStream.flushOrSync(DFSOutputStream.java:692)
at
org.apache.hadoop.hdfs.DFSOutputStream.hflush(DFSOutputStream.java:580)_{color}
All the rest of the SplitProcedures were waiting on the Exclusive lock held by
above pid, and the first one never got rescheduled till a HMaster restart.
{color:#4c9aff}_assignment.SplitTableRegionProcedure - LOCK_EVENT_WAIT
serverLocks={}, namespaceLocks={{default=exclusiveLockOwner=NONE,
sharedLockCount=1, waitingProcCount=0}},
tableLocks={{tsdb=exclusiveLockOwner=NONE, sharedLockCount=1,
waitingProcCount=0}}, regionLocks={{parent-XXX=exclusiveLockOwner=966118,
sharedLockCount=0, waitingProcCount=8043}}, peerLocks={}, metaLocks={}_{color}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)