Re: [HACKERS] Postgres-R: internal messaging

Markus Wanner Wed, 23 Jul 2008 13:28:23 -0700

Hi,

what follows are some comments after trying to understand how theautovacuum launcher works and thoughts on how to apply this to thereplication manager in Postgres-R.


The initial comments in autovacuum.c say:

If the fork() call fails in the postmaster, it sets a flag in the shared
memory area, and sends a signal to the launcher.

I note that the shmem area that the postmaster is writing to is prettystatic and not dependent on any other state stored in shmem. Thatcertainly makes a difference compared to my imessages approach, where acorruption in the shmem for imessages could also confuse the postmaster.

Reading on, the 'can_launch' flag in the launcher's main loop makes surethat only one worker is requested concurrently, so that the launcherdoesn't miss a failure or success notice from either the postmaster orthe newly started worker. The replication manager currently shamelesslyrequests as many helper backend as it wants. I think I can change thatwithout much trouble. Would certainly make sense.

Notifications of the replication manager after termination or crashes ofa helper backend remain. Upon normal errors (i.e. elog(ERROR... ), thebackend processes themselves should take care of notifying thereplication manager. But crashes are more difficult. IMO the replicationmanager needs to stay alive during this reinitialization, to keep theGCS connection. However, it can easily detach from shared memorytemporarily (the imessages stuff is the only shmem place it touches,IIRC). However, a more difficult aspect is: it must be able to tell if abackend has applied its transaction *before* it died or not. Thus, afterall backends have been killed, the postmaster needs to wait withreinitializing shared memory, until the replication manager has consumedall its messages. (Otherwise we would risk "losing" local transactions,probably also remote ones).

So, yes, after thinking about it, detaching the postmaster from sharedmemory seems doable for Postgres-R (in the sense of "the postmaster doesnot rely on possibly corrupted data in shared memory"). Reinitializationneeds some more thoughts, but in general that seems like the way to go.


Regards

Markus Wanner


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Postgres-R: internal messaging

Reply via email to