OK, Some info to back this up, please see inline.
On 12/15/2010 01:20 PM, Thomas Hellstrom wrote:
On 12/15/2010 09:23 AM, Marek Olšák wrote:
On Tue, Dec 14, 2010 at 8:10 PM, Thomas Hellstrom
<thellst...@vmware.com <mailto:thellst...@vmware.com>> wrote:
Hmm,
for the uninformed, where do we need to use spinlocks in gallium
and how do
we avoid using them on an UP system?
I plan to use spinlocks to guard very simple code like the macro
remove_from_list, which might be, under some circumstances, called
too often. Entering and leaving a mutex is quite visible in callgrind.
What does UP stand for?
Marek
I've also noted that mutexes show up high on profiling on a number of
occasions, but
Linux mutexes should theoretically be roughly as fast as spinlocks
unless there is high contention. (They shouldn't call into the kernel
unless there is contention). If this is not the case it is worth
figuring out why.
At a first glance, looking at this article,
http://www.alexonlinux.com/pthread-mutex-vs-pthread-spinlock
spinlocks should be superior for things like quick list manipulation,
but If I try that code with the second thread disabled (thus avoiding
lock contention), it turns out that in the uncontended case, spinlocks
are only some 13% faster.
The problem with spinlocks in user-space is that a context switch may
occur in a locked section and that will have other processes trying to
lock spinning until the process holding the lock is scheduled again.
On uni-processor (UP) systems spinlocks are particularly bad, since if
there is contention on a lock, the process will always continue to
spin until a timeout context switch, and thus behave much worse than a
mutex which will context switch immediately.
(This is of course unless pthread spinlocks does some magic I'm not
aware of)
Trying the code with two threads on a uni-processor desktop system,
spinlocks are some 63% slower than mutexes. This is with a kernel tick
of 1000HZ. Probably spinlock stats would be much worse with a slower tick.
In kernel space the situation is different, since spinlocks block
preemption and contention can thus never occur on uniprocessor systems.
There are of course situations where user-space spinlocks are faster
(high contention and low chance of context switch), but that really
means the user needs to know exactly what he's doing and the drawbacks
of using spinlocks.
I don't really think we should add spinlocks to gallium without
a) Figuring out why, in your case, mutexes behave so much worse than
spinlocks. Do you expect high lock contention?
b) adding some kind of warning about the drawbacks of using them.
Finally, trying to repeat the author's findings on my dual-processor
system, Spinlocks are faster (but not nearly as much as the author
states, but that's probably because my CPUs are faster hence lower lock
contention), but the cpu-usage is roughly the same as in the mutex case.
Given this, I would advise strongly against building spinlocks into any
code that might be run on a uni-processor system. Particularly gallium
utility code.
If we want to get rid of unnecessary locking overhead we should probably
fix the code up to avoid taking the locks when not strictly needed.
/Thomas
/Thomas
_______________________________________________
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/mesa-dev