[ 
https://issues.apache.org/jira/browse/HDFS-9574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15088438#comment-15088438
 ] 

Kihwal Lee commented on HDFS-9574:
----------------------------------

The new patch addresses the review comments. 
- All relevant DataTransfer methods are now calling {{checkAccess()}} and the 
registration is checked from there.
- The elapsed time is now tracked using {{StopWatch}}.
- {{getReplicaVisibleLength()}} now throws {{RetriableException}}. 
- {{DFSInpuStream}} retries those nodes that threw {{RetriableException}} on 
{{getReplicaVisibleLength()}}, with a limit. The client read timeout is used 
for the retry timeout.
- The test case was expanded to cover the {{getReplicaVisibleLength()}} case.

> Reduce client failures during datanode restart
> ----------------------------------------------
>
>                 Key: HDFS-9574
>                 URL: https://issues.apache.org/jira/browse/HDFS-9574
>             Project: Hadoop HDFS
>          Issue Type: Bug
>            Reporter: Kihwal Lee
>            Assignee: Kihwal Lee
>         Attachments: HDFS-9574.patch, HDFS-9574.v2.patch, HDFS-9574.v3.patch
>
>
> Since DataXceiverServer is initialized before BP is fully up, client requests 
> will fail until the datanode registers.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to