Re: [HACKERS] 9.2.3 crashes during archive recovery

Heikki Linnakangas Wed, 13 Feb 2013 01:04:52 -0800

On 13.02.2013 09:46, Kyotaro HORIGUCHI wrote:

In this case, the FINAL consistency point is at the
XLOG_SMGR_TRUNCATE record, but current implemet does not record
the consistency point (checkpoint, or commit or smgr_truncate)
itself, so we cannot predict the final consistency point on
starting of recovery.


Hmm, what you did was basically:

1. Run server normally.
2. Kill it with "pg_ctl stop -m immediate".
3. Create a recovery.conf file, turning the server into a hot standby.

Without step 3, the server would perform crash recovery, and it wouldwork. But because of the recovery.conf file, the server goes intoarchive recovery, and because minRecoveryPoint is not set, it assumesthat the system is consistent from the start.

Aside from the immediate issue with truncation, the system really isn'tconsistent until the WAL has been replayed far enough, so it shouldn'topen for hot standby queries. There might be other, later, changesalready flushed to data files. The system has no way of knowing how farit needs to replay the WAL to become consistent.

At least in back-branches, I'd call this a pilot error. You can't turn amaster into a standby just by creating a recovery.conf file. At leastnot if the master was not shut down cleanly first.

If there's a use case for doing that, maybe we can do something betterin HEAD. If the control file says that the system was running(DB_IN_PRODUCTION), but there is a recovery.conf file, we could do crashrecovery first, until we reach the end of WAL, and go into archiverecovery mode after that. We'd recover all the WAL files in pg_xlog asfar as we can, same as in crash recovery, and only start restoring filesfrom the archive once we reach the end of WAL in pg_xlog. At that point,we'd also consider the system as consistent, and start up for hot standby.

I'm not sure that's worth the trouble, though. Perhaps it would bebetter to just throw an error if the control file state isDB_IN_PRODUCTION and a recovery.conf file exists. The admin can alwaysstart the server normally first, shut it down cleanly, and then createthe recovery.conf file.

On the other hand, updating control file on every commits or
smgr_truncate's should slow the transactions..

To be precise, we'd need to update the control file on everyXLogFlush(), like we do during archive recovery. That would indeed beunacceptable from a performance point of view. Updating the control filethat often would also be bad for robustness.


- Heikki


--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] 9.2.3 crashes during archive recovery

Reply via email to