[ https://issues.apache.org/jira/browse/HDFS-16557?focusedWorklogId=780186&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-780186 ]
ASF GitHub Bot logged work on HDFS-16557: ----------------------------------------- Author: ASF GitHub Bot Created on: 10/Jun/22 03:14 Start Date: 10/Jun/22 03:14 Worklog Time Spent: 10m Work Description: tomscut commented on PR #4219: URL: https://github.com/apache/hadoop/pull/4219#issuecomment-1151882369 > OK, back to BootstrapStandby GAP. Form this stack information, I got that it try to get streams from 1049842441 to 1050196644. But cannot get the txid 1049842441 from the result streams. So I think we should to trace the root cause, why can't we find txid 1049842441 in the return result of `selectInputStreams(streams, 1049842441, true, true)`? > > Please correct me if anything is wrong. Please refer to the discussion with @xkrogen above. The root cause is the` if` condition (`if(next == HdfsServerConstants.INVALID_TXID)`) that does not enter properly. Issue Time Tracking ------------------- Worklog Id: (was: 780186) Time Spent: 3.5h (was: 3h 20m) > BootstrapStandby failed because of checking gap for inprogress > EditLogInputStream > --------------------------------------------------------------------------------- > > Key: HDFS-16557 > URL: https://issues.apache.org/jira/browse/HDFS-16557 > Project: Hadoop HDFS > Issue Type: Bug > Reporter: Tao Li > Assignee: Tao Li > Priority: Major > Labels: pull-request-available > Attachments: image-2022-04-22-17-17-14-577.png, > image-2022-04-22-17-17-14-618.png, image-2022-04-22-17-17-23-113.png, > image-2022-04-22-17-17-32-487.png > > Time Spent: 3.5h > Remaining Estimate: 0h > > The lastTxId of an inprogress EditLogInputStream lastTxId isn't necessarily > HdfsServerConstants.INVALID_TXID. We can determine its status directly by > EditLogInputStream#isInProgress. > We introduced [SBN READ], and set > {color:#ff0000}{{dfs.ha.tail-edits.in-progress=true}}{color}. Then > bootstrapStandby, the EditLogInputStream of inProgress is misjudged, > resulting in a gap check failure, which causes bootstrapStandby to fail. > hdfs namenode -bootstrapStandby > !image-2022-04-22-17-17-32-487.png|width=766,height=161! > !image-2022-04-22-17-17-14-577.png|width=598,height=187! -- This message was sent by Atlassian Jira (v8.20.7#820007) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org