[ https://issues.apache.org/jira/browse/HBASE-21490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16690535#comment-16690535 ]
Duo Zhang commented on HBASE-21490: ----------------------------------- OK I found this {noformat} 2018-11-16,21:06:04,667 INFO [master/c4-hadoop-tst-ct05:19100:becomeActiveMaster] org.apache.hadoop.hbase.procedure2.store.wal.WALProcedureStore: Remove the oldest log hdfs://c4tst-xiaomi/hbase/c4tst-sync1/MasterProcWALs/pv2-00000000000000000185.log 2018-11-16,21:06:04,667 INFO [master/c4-hadoop-tst-ct05:19100:becomeActiveMaster] org.apache.hadoop.hbase.procedure2.store.wal.ProcedureWALFile: Archiving hdfs://c4tst-xiaomi/hbase/c4tst-sync1/MasterProcWALs/pv2-00000000000000000185.log to hdfs://c4tst-xiaomi/hbase/c4tst-sync1/oldWALs/pv2-00000000000000000185.log 2018-11-16,21:06:04,672 DEBUG [master/c4-hadoop-tst-ct05:19100:becomeActiveMaster] org.apache.hadoop.hbase.procedure2.store.wal.WALProcedureStore: Removed log=hdfs://c4tst-xiaomi/hbase/c4tst-sync1/MasterProcWALs/pv2-00000000000000000185.log, activeLogs=[hdfs://c4tst-xiaomi/hbase/c4tst-sync1/MasterProcWALs/pv2-00000000000000000186.log, hdfs://c4tst-xiaomi/hbase/c4tst-sync1/MasterProcWALs/pv2-00000000000000000187.log] {noformat} I think there maybe something wrong when building the holdingCleanupTracker under some special case. Let me dig. > WALProcedure may remove proc wal files still with active procedures > ------------------------------------------------------------------- > > Key: HBASE-21490 > URL: https://issues.apache.org/jira/browse/HBASE-21490 > Project: HBase > Issue Type: Sub-task > Components: proc-v2 > Reporter: Duo Zhang > Priority: Major > > It happens for me several times. After master restart, all the procedures are > gone. > And the proc wal files were deleted before restarting, I see this in the > master's log > {noformat} > 2018-11-16,20:57:40,177 INFO [WALProcedureStoreSyncThread] > org.apache.hadoop.hbase.procedure2.store.wal.WALProcedureStore: Remove all > state logs with ID less than 184, since all the active procedures are in the > latest log > 2018-11-16,20:57:40,177 INFO [WALProcedureStoreSyncThread] > org.apache.hadoop.hbase.procedure2.store.wal.ProcedureWALFile: Archiving > hdfs://c4tst-xiaomi/hbase/c4tst-sync1/MasterProcWALs/pv2-00000000000000000184.log > to hdfs://c4tst-xiaomi/hbase/c4tst-sync1/oldWALs/pv2-00000000000000000184.log > {noformat} > Let me dig... -- This message was sent by Atlassian JIRA (v7.6.3#76005)