Also, it looks like the tweaks to worker to reduce the time spent in mutex-protected code may have worked. In this test case, the mutex lock/wakeup calls aren't as prominent as they used to be.
syscall seconds calls errors read 21.22 3611 902 open 4.51 905 close 8.32 1802 brk .07 6 stat 3.96 905 lseek 5.15 903 fcntl 9.70 1815 lwp_park 2.72 1042 lwp_unpark 15.08 970 poll 5.45 900 writev 4.14 902 lwp_mutex_wakeup .46 83 lwp_mutex_lock .70 71 fstat64 11.69 1815 accept 15.08 908 shutdown 4.26 902 getsockname 3.96 908 getsockopt 11.17 1815 setsockopt 6.04 907 ------- ------ ---- sys totals: 133.68 21170 902 usr time: 3.93 elapsed: 49.28