Re: netisr_direct

Robert Watson Sat, 10 Mar 2007 23:13:07 -0800


On Sun, 11 Mar 2007, Keith Arner wrote:

On 3/9/07, Robert Watson <[EMAIL PROTECTED]> wrote:
It also introduces parallelism in the in-bound network layer processingpath by allowing processing to occur in more than one thread at a time.However, you can see
From the experimentation I've done, it seems that for TCP loads at least,the input path tends to bottleneck on INP_INFO_WLOCK(&tcbinfo), essentiallyreducing the input path to a single thread.

Yes -- right now the in-bound TCP path is essentially serialized because ofthe tcbinfo lock. The reason for this is that the tcbinfo lock doesn't justprotect the inpcb chains during lookup, but also effectively acts as areference to prevent the inpcb from being freed during input processing.There are several ways we could start to reduce contention on that lock:


(1) Introduce true references on inpcbs, which would prevent the inpcb from
    being freed even though the tcbinfo lock has been released.  This would
    require reworking the TCP inpcb close/free model a bit, among other
    things.

(2) Move from using a mutex supporting only exclusive locking on tcbinfo to a
    lock class supporting shared locks, and use shared locks in situations
    where exclusion isn't required (i.e., in cases where the connection may
    close, requiring the global lists to be modified).

(3) Move towards greater granularity of locking for the tcbinfo: instead of
    a single mutex, move to more than one locks, so that different connections
    processed simultaneously are likely to involve different locks.  For
    listen sockets, we would have to have a special case, such as a single
    lock to serialize simultaneous lock acquisitions of multiple chain locks
    (for example).

I think all of these are reasonable approaches. Mohan and I have had plans todo (1) for at least a year or two now, but have not gotten to it yet. Iexperimented with (2), with the help of Attilio, a few months ago and ran intoseveral issues (especially relating to recursion). I've not yet explored (3);for web servers with many connections, it is likely quite a good approach, butsince it doesn't preclude "unlucky" hashing assignments of connections tochains that could lead to high contention. It might be quite easy toimplement, however, which would be a big win :-).

reduced parallelism in some environments as the ithread no longer runconcurrently with the netisr. I would not use net.isr.direct in versionsof FreeBSD before 6.1, but it should be pretty safe for 6.1 and later.
Admitedly, the experimentation I've done has been with 6.0, rather than 6.1.

Looking back from the CVS log, I see that the issue I was concerned about wasfixed in 6.x before 6.0, rather than after, which was my fear. I'm not awareof specific bugs that might cause problems with direct dispatch in earlier 6.xreleases, off hand.

Have there been any changes in 6.1 that address the locking in the TCP inputpath? I'm very interested in being able to run highly parallel TCP loads.

I'd have to look in detail at the logs to answer that -- I'm not aware of anysignificant changes. There are probably some minor ones to reduce contentionin various ways, but I think the more important changes are all in 7.x onlystill. In 7.x, the pcbinfo lock is held significantly less in socket-sidepaths such as send/receive, which should improve contention issues.


Robert N M Watson
Computer Laboratory
University of Cambridge
_______________________________________________
freebsd-net@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: netisr_direct

Reply via email to