Chackaravarthy created HDFS-10365: ------------------------------------- Summary: FullBlockReports retransmission delays NN startup time in large cluster. Key: HDFS-10365 URL: https://issues.apache.org/jira/browse/HDFS-10365 Project: Hadoop HDFS Issue Type: Bug Components: hdfs Affects Versions: 2.6.0 Environment: version - hadoop-2.6.0 DN - 1200 nodes Reporter: Chackaravarthy Priority: Critical
Whenever NN is restarted, it takes huge time for NN to come back to stable state. i.e. Last contact time remains more than 1 or 2 mins continuously for around 3 to 4 hours. This is mainly because most of the DN's getting timeout (60s) in blockReport (FBR) rpc call and then it keep sending FBR again. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-dev-h...@hadoop.apache.org