Hernan Gelaf-Romer created HBASE-29149:
------------------------------------------

             Summary: WAL files can be archived during incremental backup 
process
                 Key: HBASE-29149
                 URL: https://issues.apache.org/jira/browse/HBASE-29149
             Project: HBase
          Issue Type: Bug
            Reporter: Hernan Gelaf-Romer
            Assignee: Hernan Gelaf-Romer


At my job, we've run into FNFE issues when WAL files are archived as they are 
being loaded to be converted into HFiles. When looking at the failure logs, we 
can see that the WAL was loaded just after the archive had occurred 
server-side. 

 
{quote}2025-02-24 17:10:34.333  [pool-124-thread-1] ERROR 
o.a.h.h.b.impl.TableBackupClient - Unexpected exception in incremental-backup: 
incremental copy backup_1740417014671File 
hdfs://nestor-hb2-a-qa:8020/hbase/WALs/na1-purple-dizzy-antelope.iad03.hubinternal.net,60020,1739996267893/na1-purple-dizzy-antelope.iad03.hubinternal.net%2C60020%2C1739996267893.1740412909549
 does not exist.
java.io.FileNotFoundException: File 
hdfs://nestor-hb2-a-qa:8020/hbase/WALs/na1-purple-dizzy-antelope.iad03.hubinternal.net,60020,1739996267893/na1-purple-dizzy-antelope.iad03.hubinternal.net%2C60020%2C1739996267893.1740412909549
 does not exist.
{quote}
 
{quote}2025-02-24 17:10:17.787 Archiving 
hdfs://nestor-hb2-a-qa:8020/hbase/WALs/na1-purple-dizzy-antelope.iad03.hubinternal.net,60020,1739996267893/na1-purple-dizzy-antelope.iad03.hubinternal.net%2C60020%2C1739996267893.1740412909549
 to 
hdfs://nestor-hb2-a-qa:8020/hbase/oldWALs/na1-purple-dizzy-antelope.iad03.hubinternal.net%2C60020%2C1739996267893.1740412909549
{quote}
 

We already handle a similar situation when loading bulkloads, and add a re-try 
mechanism that checks the archive directory. We should probably do a similar 
thing here



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to