Re: [HACKERS] WIP: dynahash replacement for buffer table

Ryan Johnson Thu, 16 Oct 2014 08:34:14 -0700

On 16/10/2014 7:19 AM, Robert Haas wrote:

On Thu, Oct 16, 2014 at 8:03 AM, Ryan Johnson
<[email protected]> wrote:

Why not use an RCU mechanism [1] and ditch the hazard pointers? Seems like
an ideal fit...


In brief, RCU has the following requirements:

Read-heavy access pattern
Writers must be able to make dead objects unreachable to new readers (easily
done for most data structures)
Writers must be able to mark dead objects in such a way that existing
readers know to ignore their contents but can still traverse the data
structure properly (usually straightforward)
Readers must occasionally inform the system that they are not currently
using any RCU-protected pointers (to allow resource reclamation)

Have a look at http://lwn.net/Articles/573424/ and specifically the
"URCU overview" section.  Basically, that last requirement - that
readers inform the system tat they are not currently using any
RCU-protected pointers - turns out to require either memory barriers
or signals.
All of the many techniques that have been developed in this area are
merely minor variations on a very old theme: set some kind of flag
variable in shared memory to let people know that you are reading a
shared data structure, and clear it when you are done.  Then, other
people can figure out when it's safe to recycle memory that was
previously part of that data structure.

Sure, but RCU has the key benefit of decoupling its machinery (esp. thatflag update) from the actual critical section(s) it protects. In a DBMSsetting, for example, once per transaction or SQL statement would dojust fine. The notification can be much better than a simple flag---youwant to know whether the thread has ever quiesced since the last reclaimcycle began, not whether it is currently quiesced (which it usuallyisn't). In the implementation I use, a busy thread (e.g. not about to goidle) can "chain" its RCU "transactions." In the common case, a chainedquiesce call comes when the RCU epoch is not trying to change, and the"flag update" degenerates to a simple load. Further, the only time it'scritical to have that memory barrier is if the quiescing thread is aboutto go idle. Otherwise, missing a flag just imposes a small delay onresource reclamation (and that's assuming the flag in question evenbelonged to a straggler process). How you implement epoch management,especially the handling of stragglers, is the deciding factor in whetherRCU works well. The early URCU techniques were pretty terrible, andmaybe general-purpose URCU is doomed to stay that way, but in a DBMScore it can be done very cleanly and efficiently because we can easilyadd the quiescent points at appropriate locations in the code.

  In Linux's RCU, the flag
variable is "whether the process is currently scheduled on a CPU",
which is obviously not workable from user-space.  Lacking that, you
need an explicit flag variable, which means you need memory barriers,
since the protected operation is a load and the flag variable is
updated via a store.  You can try to avoid some of the overhead by
updating the flag variable less often (say, when a signal arrives) or
you can make it more fine-grained (in my case, we only prevent reclaim
of a fraction of the data structure at a time, rather than all of it)
or various other variants, but none of this is unfortunately so simple
as "apply technique X and your problem just goes away".

Magic wand, no (does nothing for update contention, for example, andrequires some care to apply). But from a practical perspective RCU,properly implemented, does make an awful lot of problems an awful lotsimpler to tackle. Especially for the readers.


Ryan



--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] WIP: dynahash replacement for buffer table

Reply via email to