Sahil Takiar created IMPALA-9300:
------------------------------------

             Summary: Add a limit on the number of nodes that can be 
blacklisted per query
                 Key: IMPALA-9300
                 URL: https://issues.apache.org/jira/browse/IMPALA-9300
             Project: IMPALA
          Issue Type: Sub-task
          Components: Backend
            Reporter: Sahil Takiar


We currently have no limit on the number of nodes that can be blacklisted if an 
Exec() RPC fails.

For data transfer (TransmitData()) RPC failures, we blacklist at most one node 
per status update (so typically one node per query).

It would be nice to have a global limit on the number of nodes blacklisted to 
prevent a single query from blacklisting a large part of the cluster. This can 
help guard against intermittent, cluster-wide, hardware issues that might only 
last a few seconds. It would be nice if the max number of blacklist-able nodes 
is a function of the cluster size (e.g. a query cannot blacklist more than a 
third of the nodes in the cluster).

TBD if the value should be configurable or not. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to