Re: [HACKERS] Hot standby, recovery infra

Heikki Linnakangas Thu, 29 Jan 2009 05:32:40 -0800

Simon Riggs wrote:

On Thu, 2009-01-29 at 12:22 +0200, Heikki Linnakangas wrote:
Itcomes from the fact that we set minSafeStartPoint beyond the actual endof WAL, if the last WAL segment is only partially filled (= fails CRCcheck at some point). If we crash after setting minSafeStartPoint likethat, and then restart recovery, we'll get the error.
Look again please. My proposal would avoid the error when it is not
relevant, yet keep it when it is (while recovering base backups).

I fail to see what base backups have to do with this. The problem arisesin this scenario:

0. A base backup is unzipped. recovery.conf is copied in place, and theremaining unarchived WAL segments are copied from the primary server topg_xlog. The last WAL segment is only partially filled. Let's say thatredo point is in WAL segment 1. The last, partial, WAL segment is 3, andWAL ends at 0/3500000

1. postmaster is started, recovery starts.
2. WAL segment 1 is restored from archive.
3. We reach consistent recovery point

4. We restore WAL segment 2 from archive. minSafeStartPoint is advancedto 0/30000005. WAL segment 2 is completely replayed, we move on to WAL segment 3. Itis not in archive, but it's found in pg_xlog. minSafeStartPoint isadvanced to 0/4000000. Note that that's beyond end of WAL.6. At replay of WAL record 0/3200000, the recovery is interrupted. Forexample, by a fast shutdown request, or crash.

Now when we restart the recovery, we will never reach minSafeStartPoint,which is now 0/4000000, and we'll fail with the error that Fujii-sanpointed out. We're already way past the min recovery point of basebackup by then.


--
  Heikki Linnakangas
  EnterpriseDB   http://www.enterprisedb.com

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Hot standby, recovery infra

Reply via email to