Re: Questions about mod_event's documentation

Rainer Jung Tue, 02 Feb 2016 08:13:32 -0800

Hi Luca,

some fragmentary answer:


Am 01.02.2016 um 10:17 schrieb Luca Toscano:

...

- AsyncRequestWorkerFactor is used to regulate the amount of requests
that a single process/threads block can handle, calculating the value
periodically using the idle threads/workers available. In case of
workers maxed out, the keep alive sockets/connections are closed to free
some space.

...

If my understanding is correct (I doubt it but let's assume this) then I
have the following questions:

- Would it be worth to add more info in the "how it works" section? A
first read may bring the user to think that the listening thread is the
one doing the actual work, rather than the workers, being a bit puzzled
when reading the AsyncRequestWorkerFactor section.

...

- The summary talks about "supporting threads" and given the fact that
AsyncRequestWorkerFactor is added to ThreadsPerChild, it raises the
question about how many of them are created at startup. Conversely, is
it a way to say: the number of threads for each process are
ThreadsPerChild but since they now perform also small bursts of work
(like keep alive house keeping and flushing data to clients) the total
amount of connections allowed should be more to make room for all these
connection/socket states?

The number of worker threads per process is constant during the lifetimefrom the process creation to its end. It is equals to ThreadsPerChildand the sole purpose of this configuration item.

AsyncRequestWorkerFactor comes into the image, because in contrast tothe traditional MPMs prefork, worker and winnt, event does not have a1:1 relation between worker threads and client connections. Event isdesigned to scale better than those in terms of connections. It shouldhandle a lot more connections with the same number of threads. How canthis work? It works by freeing the worker threads from the connectionhandling in certain situations where the thread would actually simplywait until it can write back the next part of the response or until thenext request on the same connection arrives. But this design results insome over commitment. What happens if by bad luck over time we acceptedlots of connections, which were mostly idle and then suddenly many ofthem start activity which needs a worker thread for each of them. Thenit might happen, that we do not have enough worker threads in theprocess that accepted the connections. We don't have a way to move suchconnections to another child process. Once the connection is accepted itstays with the same process.

To reduce the likeliness of such shortage in free worker threads, thenumber of connections we accept per process is limited. The limit is nota fixed number or some fixed multiple of ThreadsPerChild, but instead itis calculated based on the number of idle worker threads. If that numbergets small during run time, the number of additional connections weaccept decreases until at some point that process won't accept any newconnections. This situation can be monitored via mod_status.


Let's have a look at the formula: The docs say:

ThreadsPerChild + (AsyncRequestWorkerFactor * number of idle workers)

is the number of new connections a process will still accept (notcounting connections in closing state). Ignoring for a moment the"closing state" stuff, we can write:

max_connections = ThreadsPerChild + (AsyncRequestWorkerFactor *idle_workers)


Let's replace ThreadsPerChild by (idle_workers + busy_workers):

max_connections = (idle_workers + busy_workers) +(AsyncRequestWorkerFactor * idle_workers)

   = busy_workers + (AsyncRequestWorkerFactor + 1) * idle_workers

Now splitting connections in busy and idle ones and defining a busyconnection as one needing a worker thread so that the number of busyconnections equals the number of busy workers, we get:

max_idle_connections + busy_workers = busy_workers +(AsyncRequestWorkerFactor + 1) * idle_workers


and thus

max_idle_connections = (AsyncRequestWorkerFactor + 1) * idle_workers

So although we only have idle_workers left, we accept up to(AsyncRequestWorkerFactor + 1) * idle_workers as the number of idleconnections. Since AsyncRequestWorkerFactor by default has the value 2,it means by default we accept 3 times as many connections, as we haveidle workers. That's a handy formulation for our type of over commitment.

Of course the formula is not always true. If many connections turn overinto busy ones, than we might suddenly have more (old) idle connectionsthan allowed by this formula. E.g. if for a moment all workers are idle,we would allow 3*ThreadsPerChild (idle) connections. Then assumeThreadsPerChild of them start to be busy, then we would allow no idleconnections because no idle workers are left, but we would still have2*ThreadsPerChild old idle connections. The formula only tells us howmany we are willing to handle unless we are already over the limitbecause to many turned into busy ones.

I have not done the math if we also add "connections in closing state"into the game.

There's no easy way to tune AsyncRequestWorkerFactor since it somehowdepends on the activity pattern of connections, which is hard toestimate. It is best to monitor the status via mod_status to get anidea, if one can save more threads with a higher value (more overcommitment), or if one needs to stay on the conservative side. Sometimesit is better to work with higher ThreadsPerChild, so that the activitypatterns on more connections per process gives a better average behavior.

HTH a bit w.r.t. AsyncRequestWorkerFactor (and also hoping that I didn'tmake mistakes in explaining).


Regards,

Rainer

Re: Questions about mod_event's documentation

Reply via email to