[HACKERS] occasional startup failures

Andrew Dunstan Sun, 25 Mar 2012 09:13:38 -0700

Every so often buildfarm animals (nightjar and raven recently, forexample) report failures on starting up the postmaster. It appears thatthese failures are due to the postmaster not creating the pid filewithin 5 seconds, and so the logic in commit0bae3bc9be4a025df089f0a0c2f547fa538a97bc kicks in. Unfortunately, whenthis happens the postmaster has in fact sometimes started up, and theend result is that subsequent buildfarm runs will fail when they detectthat there is already a postmaster listening on the port, and withoutmanual intervention to kill the "rogue" postmaster this continues endlessly.

I can probably add some logic to the buildfarm script to try to detectthis condition and kill an errant postmaster so subsequent runs don'tget affected, but that seems to be avoiding a problem rather than fixingit. I'm not sure what we can do to improve it otherwise, though.


Thoughts?

cheers

andrew

--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

[HACKERS] occasional startup failures

Reply via email to