Sahil Takiar created IMPALA-10139:
-------------------------------------

             Summary: Slow RPC logs can be misleading
                 Key: IMPALA-10139
                 URL: https://issues.apache.org/jira/browse/IMPALA-10139
             Project: IMPALA
          Issue Type: Improvement
            Reporter: Sahil Takiar


The slow RPC logs added in IMPALA-9128 are based on the total time taken to 
successfully complete a RPC. The issue is that there are many reasons why an 
RPC might take a long time to complete. An RPC is considered complete only when 
the receiver has processed that RPC. 

The problem is that due to client-driven back-pressure mechanism, it is 
entirely possible that the receiver RPC does not process a receiver RPC because 
{{KrpcDataStreamRecvr::SenderQueue::GetBatch}} just hasn't been called yet 
(indirectly called by {{ExchangeNode::GetNext}}).

This can lead to flood of slow RPC logs, even though the RPCs might not 
actually be slow themselves. What is worse is that the because of the 
back-pressure mechanism, slowness from the client (e.g. Hue users) will 
propagate across all nodes involved in the query.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org

Reply via email to