[
https://issues.apache.org/jira/browse/HBASE-29797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18048231#comment-18048231
]
Duo Zhang commented on HBASE-29797:
-----------------------------------
Checked the log at NN side, seems all the sequence id files are written by the
same client...
{noformat}
hadoop-zhangduo-namenode-meta02.log.3:2025-12-29 10:58:50,407 INFO
org.apache.hadoop.hdfs.StateChange: DIR* completeFile:
/hbase/data/default/IntegrationTestBigLinkedList/aea23dbdd1f955e04dab89e5327ae553/recovered.edits/1212213.seqid
is closed by DFSClient_NONMAPREDUCE_836297033_1
hadoop-zhangduo-namenode-meta02.log.3:2025-12-29 10:59:02,091 INFO
org.apache.hadoop.hdfs.StateChange: DIR* completeFile:
/hbase/data/default/IntegrationTestBigLinkedList/aea23dbdd1f955e04dab89e5327ae553/recovered.edits/1212217.seqid
is closed by DFSClient_NONMAPREDUCE_836297033_1
hadoop-zhangduo-namenode-meta02.log.3:2025-12-29 11:00:32,047 INFO
org.apache.hadoop.hdfs.StateChange: DIR* completeFile:
/hbase/data/default/IntegrationTestBigLinkedList/aea23dbdd1f955e04dab89e5327ae553/recovered.edits/1212221.seqid
is closed by DFSClient_NONMAPREDUCE_836297033_1
hadoop-zhangduo-namenode-meta02.log.3:2025-12-29 11:03:06,849 INFO
org.apache.hadoop.hdfs.StateChange: BLOCK* allocate blk_1081620879_7887967,
replicas=192.168.0.64:9866, 192.168.0.63:9866, 192.168.0.65:9866 for
/hbase/data/default/IntegrationTestBigLinkedList/aea23dbdd1f955e04dab89e5327ae553/recovered.edits/0000000000001212216-data03%2C16020%2C1766977086762.1766977091448-data04%2C16020.temp
hadoop-zhangduo-namenode-meta02.log.3:2025-12-29 11:03:06,871 INFO
org.apache.hadoop.hdfs.StateChange: DIR* completeFile:
/hbase/data/default/IntegrationTestBigLinkedList/aea23dbdd1f955e04dab89e5327ae553/recovered.edits/0000000000001212216-data03%2C16020%2C1766977086762.1766977091448-data04%2C16020.temp
is closed by DFSClient_NONMAPREDUCE_-57215219_1
hadoop-zhangduo-namenode-meta02.log.3:2025-12-29 11:03:28,431 INFO
org.apache.hadoop.hdfs.StateChange: DIR* completeFile:
/hbase/data/default/IntegrationTestBigLinkedList/aea23dbdd1f955e04dab89e5327ae553/recovered.edits/1212225.seqid
is closed by DFSClient_NONMAPREDUCE_836297033_1
{noformat}
> RegionServer aborted because of invalid max sequence id
> -------------------------------------------------------
>
> Key: HBASE-29797
> URL: https://issues.apache.org/jira/browse/HBASE-29797
> Project: HBase
> Issue Type: Bug
> Components: Region Assignment
> Reporter: Duo Zhang
> Priority: Critical
>
> {noformat}
> 2025-12-29T11:03:32,429 WARN [RS_CLOSE_REGION-regionserver/data02:16020-0]
> handler.UnassignRegionHandler: Fatal error occurred while closing region
> 8d60369be1061570a2f6e47a1af7a797, aborting...
> java.io.IOException: The new max sequence id 1212630 is less than the old max
> sequence id 1212631
> at
> org.apache.hadoop.hbase.wal.WALSplitUtil.writeRegionSequenceIdFile(WALSplitUtil.java:402)
> at
> org.apache.hadoop.hbase.regionserver.HRegion.writeRegionCloseMarker(HRegion.java:1290)
> at
> org.apache.hadoop.hbase.regionserver.HRegion.doClose(HRegion.java:1950)
> at
> org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1675)
> at
> org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1630)
> at
> org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1613)
> at
> org.apache.hadoop.hbase.regionserver.handler.UnassignRegionHandler.process(UnassignRegionHandler.java:139)
> at
> org.apache.hadoop.hbase.executor.EventHandler.run(EventHandler.java:104)
> at
> java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136)
> at
> java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635)
> at java.base/java.lang.Thread.run(Thread.java:840)
> 2025-12-29T11:03:32,433 ERROR [RS_CLOSE_REGION-regionserver/data02:16020-0]
> regionserver.HRegionServer: ***** ABORTING region server
> data02,16020,1766977119966: Failed to close region
> 8d60369be1061570a2f6e47a1af7a797 and can not recover *****
> java.io.IOException: The new max sequence id 1212630 is less than the old max
> sequence id 1212631
> at
> org.apache.hadoop.hbase.wal.WALSplitUtil.writeRegionSequenceIdFile(WALSplitUtil.java:402)
> at
> org.apache.hadoop.hbase.regionserver.HRegion.writeRegionCloseMarker(HRegion.java:1290)
> at
> org.apache.hadoop.hbase.regionserver.HRegion.doClose(HRegion.java:1950)
> at
> org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1675)
> at
> org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1630)
> at
> org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1613)
> at
> org.apache.hadoop.hbase.regionserver.handler.UnassignRegionHandler.process(UnassignRegionHandler.java:139)
> at
> org.apache.hadoop.hbase.executor.EventHandler.run(EventHandler.java:104)
> at
> java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136)
> at
> java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635)
> at java.base/java.lang.Thread.run(Thread.java:840)
> {noformat}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)