[HACKERS] (re)start in our init scripts seems broken

Tomas Vondra Tue, 19 Jul 2016 19:42:53 -0700

Hi,

A few days ago I ran into a problem with the init script packaged in ourcommunity RPM packages. What happened was that they initiated a restart,but this happened:


# /etc/init.d/postgresql-9.3 restart
Stopping postgresql-9.3 service:                           [FAILED]
Starting postgresql-9.3 service:                           [  OK  ]

The database was however still in the shutdown mode, performing acheckpoint. Sadly the init script uses default timeout, so the stopterminates after just 60 seconds. But that seems fine, as the initscript reports the failure correctly.


However the start action then seemingly succeeds, because it does this:

    echo -n "$PSQL_START"

$SU -l postgres -c "$PGENGINE/postmaster -D '$PGDATA' ${PGOPTS} &">> "$PGLOG" 2>&1 < /dev/null

    sleep 2
    pid=`head -n 1 "$PGDATA/postmaster.pid" 2>/dev/null`
    if [ "x$pid" != x ]
    then
            success "$PSQL_START"
            touch "$lockfile"
            echo $pid > "$pidfile"
            echo
    else
            failure "$PSQL_START"
            echo
            script_result=1
    fi

It simply attempts to start the postmaster directly (instead of usingpg_ctl), does not check the return code and instead proceeds to checkthe postmaster.pid file and existence of the process.

This however fails to do the trick, because the database is stillrunning (in shutdown), so the postmaster does not overwrite the file.And of course the PID still matches a running process.

Is there a reason why it's coded like this? I think we should use thepg_ctl instead or (at the very least) check the postmaster return code.Also, perhaps we should add an explicit timeout, higher than 60 seconds.


regards

--
Tomas Vondra                  http://www.2ndQuadrant.com
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services


--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

[HACKERS] (re)start in our init scripts seems broken

Reply via email to