He Tianyi created HDFS-9665:
-------------------------------

             Summary: Cilents are easily affected by standby namenode
                 Key: HDFS-9665
                 URL: https://issues.apache.org/jira/browse/HDFS-9665
             Project: Hadoop HDFS
          Issue Type: Wish
          Components: hdfs-client, namenode
    Affects Versions: 2.6.0
            Reporter: He Tianyi
            Assignee: He Tianyi
            Priority: Minor


My case is during restarting of standby NameNode, there is chances that 
{{hadoop fs}} command get hung until either IPC timeout reached or 
{{StandbyException}} received, and then failover to active NameNode.

Normally, duration of the 'hung stage' depends on {{min(timeout_configuration, 
rpc_queue_time)}}. However, RPC queue in standby NameNode is usually filled 
with block reports at this period, client requests can't get processed quickly.

I wish to get rid of this, by one of the following manners:
a) we distinguish priority in RPC queue (chances causing starvation)
b) we speculate first request, send it to both NameNodes, and take one valid 
response.
c) make client aware of HA state (by accessing ZK probably, chances causing 
performance issue)

Any suggestions or comments?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to