Sun Xin created HBASE-27476:
-------------------------------

             Summary: Recovered replication may be blocked if enabled 
hbase.separate.oldlogdir.by.regionserver
                 Key: HBASE-27476
                 URL: https://issues.apache.org/jira/browse/HBASE-27476
             Project: HBase
          Issue Type: Bug
          Components: Replication
    Affects Versions: 2.4.15, 3.0.0-alpha-3
            Reporter: Sun Xin
            Assignee: Sun Xin


In other PR, I got a failed UT
{code:java}
[ERROR] Failures: 
[ERROR] 
org.apache.hadoop.hbase.replication.TestReplicationKillMasterRSWithSeparateOldWALs.killOneMasterRS
[ERROR]   Run 1: 
TestReplicationKillMasterRSWithSeparateOldWALs>TestReplicationKillMasterRS.killOneMasterRS:47->TestReplicationKillRS.loadTableAndKillRS:84
 Waited too much time for queueFailover replication. Waited 61065ms.
[ERROR]   Run 2: 
TestReplicationKillMasterRSWithSeparateOldWALs>TestReplicationKillMasterRS.killOneMasterRS:47->TestReplicationKillRS.loadTableAndKillRS:84
 Waited too much time for queueFailover replication. Waited 58864ms.
[ERROR]   Run 3: 
TestReplicationKillMasterRSWithSeparateOldWALs>TestReplicationKillMasterRS.killOneMasterRS:47->TestReplicationKillRS.loadTableAndKillRS:84
 Waited too much time for queueFailover replication. Waited 57103ms. {code}
This should be caused by a bug.

If enabled {_}hbase.separate.oldlogdir.by.regionserver{_}, old wals will be 
moved into different dir by regionserver name like root/oldWALs/server1/wal1 . 
For recovered replication,  can't convert wal path(like root/oldWALs/wal1) into 
such paths, and throws FileNotFoundException.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to