Sun Xin created HBASE-27476: ------------------------------- Summary: Recovered replication may be blocked if enabled hbase.separate.oldlogdir.by.regionserver Key: HBASE-27476 URL: https://issues.apache.org/jira/browse/HBASE-27476 Project: HBase Issue Type: Bug Components: Replication Affects Versions: 2.4.15, 3.0.0-alpha-3 Reporter: Sun Xin Assignee: Sun Xin
In other PR, I got a failed UT {code:java} [ERROR] Failures: [ERROR] org.apache.hadoop.hbase.replication.TestReplicationKillMasterRSWithSeparateOldWALs.killOneMasterRS [ERROR] Run 1: TestReplicationKillMasterRSWithSeparateOldWALs>TestReplicationKillMasterRS.killOneMasterRS:47->TestReplicationKillRS.loadTableAndKillRS:84 Waited too much time for queueFailover replication. Waited 61065ms. [ERROR] Run 2: TestReplicationKillMasterRSWithSeparateOldWALs>TestReplicationKillMasterRS.killOneMasterRS:47->TestReplicationKillRS.loadTableAndKillRS:84 Waited too much time for queueFailover replication. Waited 58864ms. [ERROR] Run 3: TestReplicationKillMasterRSWithSeparateOldWALs>TestReplicationKillMasterRS.killOneMasterRS:47->TestReplicationKillRS.loadTableAndKillRS:84 Waited too much time for queueFailover replication. Waited 57103ms. {code} This should be caused by a bug. If enabled {_}hbase.separate.oldlogdir.by.regionserver{_}, old wals will be moved into different dir by regionserver name like root/oldWALs/server1/wal1 . For recovered replication, can't convert wal path(like root/oldWALs/wal1) into such paths, and throws FileNotFoundException. -- This message was sent by Atlassian Jira (v8.20.10#820010)