Hadoop has been used in quite varying cluster sizes (in the range 1-2000), so am strongly in favor of as much automatic configuration as possible.

Doug Cutting wrote:
Raghu Angadi wrote:
Right now Namenode does not know about the cluster size before starting IPC server.

Sounds like perhaps we should make the handler count, queue size, etc. dynamically adjustable, e.g., by adding Server methods for setHandlerCount(), setQueueSize(), etc. There's been talk of trying to automatically adjust these within Server.java, based on load, and that would be better yet, but short of that, we might adjust them heuristically based on cluster size.

The urgent thing, since we expect the best settings for large clusters to change, is to make it so that folks don't need to adjust these manually, even if the automation is an ill-understood heuristic. I think we can easily get some workable heuristics into 0.15, but we might not get be able to implement async responses or figure out how to adjust it automatically in Server.java or whatever in that timeframe. Perhaps we should just change the defaults to be big enough for 2000 nodes, but that seems like too big of a hammer.

Doug

Reply via email to