TaskManager single thread bottleneck

Patricia Shanahan Sun, 18 Jul 2010 12:18:13 -0700

There is a serious bottleneck in the existing TaskManager implementationwhen, under load, the typical task has at least one runAfter dependencyon an older task at the time it is added.

I plan to remove this bottleneck, but I think understanding it may helpin some decisions that need to be made about the new implementation, soI'm going to describe it.

The constructor comment block says: "A new thread is created if thetotal number of runnable tasks (both active and pending) exceeds thenumber of threads times the loadFactor, and the maximum number ofthreads has not been reached."

Achieving that requires a check whenever the number of runnable tasksincreases. Similarly, a thread that is waiting for a task to run needsto be notified when a runnable task becomes available.


There are two ways a Task x can become runnable:

1. At the time x is added, it does not need to run after any older task.

2. Another task y has just completed, x did need to run after y, and xdoes not need to run after any other task that still exists.

Condition 1 is being handled correctly, with both a check for increasingthe number of tasks and a notify on add of an immediately runnable task.


Condition 2 is completely ignored.

At least one thread will always be created, because the first task to beadded must be immediately runnable. Even if a second thread gets createdbecause a task happens to be immediately runnable, there is a danger ofa thread timing out waiting for work, because it will not get notifieduntil another immediately runnable task is added. For a workload withdependencies, the single threading will tend to increase the queuelength, making added tasks less likely to be immediately runnablebecause they have more tasks to conflict with.

I believe the problem is a consequence of the data structure design,which makes asking "Did this task completion make any other task ortasks runnable?" very expensive. The design I'm working on will makeanswering this question a very fast side effect of work that needs to bedone anyway.

Empirically, I have tested a workload with dependencies that shouldalternate between 5 parallel tasks and a single task, mean three tasksat a time, but runs no faster permitted up to 100 threads than whenlimited to one thread. I measured on a computer with two Xeonprocessors, each dual core and dual threaded.


Patricia

TaskManager single thread bottleneck

Reply via email to