[ https://issues.apache.org/jira/browse/HDFS-9574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15088438#comment-15088438 ]
Kihwal Lee commented on HDFS-9574: ---------------------------------- The new patch addresses the review comments. - All relevant DataTransfer methods are now calling {{checkAccess()}} and the registration is checked from there. - The elapsed time is now tracked using {{StopWatch}}. - {{getReplicaVisibleLength()}} now throws {{RetriableException}}. - {{DFSInpuStream}} retries those nodes that threw {{RetriableException}} on {{getReplicaVisibleLength()}}, with a limit. The client read timeout is used for the retry timeout. - The test case was expanded to cover the {{getReplicaVisibleLength()}} case. > Reduce client failures during datanode restart > ---------------------------------------------- > > Key: HDFS-9574 > URL: https://issues.apache.org/jira/browse/HDFS-9574 > Project: Hadoop HDFS > Issue Type: Bug > Reporter: Kihwal Lee > Assignee: Kihwal Lee > Attachments: HDFS-9574.patch, HDFS-9574.v2.patch, HDFS-9574.v3.patch > > > Since DataXceiverServer is initialized before BP is fully up, client requests > will fail until the datanode registers. -- This message was sent by Atlassian JIRA (v6.3.4#6332)