Jason Lowe created HADOOP-8942: ---------------------------------- Summary: Thundering herd of RPCs with large responses leads to OOM Key: HADOOP-8942 URL: https://issues.apache.org/jira/browse/HADOOP-8942 Project: Hadoop Common Issue Type: Bug Components: ipc Affects Versions: 0.23.3 Reporter: Jason Lowe
When a large number of clients are all making calls with large amounts of response data then the IPC server can exhaust memory. See MAPREDUCE-4730 for an example of this. There does not appear to be any flow control between the server's handler threads and the responder thread. If a handler thread cannot write out all of the response data without blocking, it queues up the remainder for the responder thread and goes back to the next call in the call queue. If there are enough clients, this can cause the handler threads to overwhelm the heap by queueing response data faster than it can be processed. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira