zuston commented on issue #234: URL: https://github.com/apache/incubator-uniffle/issues/234#issuecomment-1254448844
Got your thought. > How do the yarn resourcemanager to process this problem? In HA resourcemanagers, there is no such problems due to failing back to standby active RM by zookeeper. Let's talk about it in single-one resourcemanager or hadoop namenode. As I know, the namenode will enter in the safe mode util enough block reports from datanode have been accepted when starting. Refer to : https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/HdfsDesign.html > I suggest that we should pend the requests instead of rejection when we start the coordinator. Pending will slow down the apps. I think we should make the request falling back to another coordinator. Maybe the heartbeat interval waiting when starting is a good tradeoff. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
