Boris Bondarenko created HDFS-16289:
---------------------------------------
Summary: Hadoop HA checkpointer issue
Key: HDFS-16289
URL: https://issues.apache.org/jira/browse/HDFS-16289
Project: Hadoop HDFS
Issue Type: Bug
Components: dfs
Affects Versions: 3.2.2
Reporter: Boris Bondarenko
In HA setup active namenode will reject fsimage sync from one of the two
standby namenodes all the time. This maybe an edge case, in our environment it
primarily affect standby cluster. What we experienced was memory problem on
standby namenodes in the scenario when the standby node was not able to
complete sync cycle for a long time.
It is my understanding that the break out from the loop will only happen when
doCheckpoint call succeeds otherwise it throws an exception and continues.
I can provide more details on my findings with code references if necessary.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]