Re: [HACKERS] Tracking latest timeline in standby mode

Heikki Linnakangas Tue, 04 Jan 2011 12:08:57 -0800

On 02.11.2010 07:15, Fujii Masao wrote:

On Mon, Nov 1, 2010 at 8:32 PM, Heikki Linnakangas
<heikki.linnakan...@enterprisedb.com>  wrote:

Yeah, that's one approach. Another is to validate the TLI in the xlog page
header, it should always match the current timeline we're on. That would
feel more robust to me.


Yeah, that seems better.

I finally got around to look at this. I wrote a patch to validate thatthe TLI on xlog page header matches ThisTimeLineID during recovery, andnoticed quickly in testing that it doesn't catch all the cases I'd liketo catch :-(.


The problem scenario is this:


TLI 1 -----------+C-------+------->Standby
                 .
                 .
TLI 2            +C-------+------->

The two horizontal lines represent two timelines. TLI 2 forks off fromTLI 1, because of a failover to a not-completely up-to-date standbyserver, for example. The plus-signs represent WAL segment boundaries andC's represent checkpoint records.

Another standby server has replayed all the WAL on TLI 2. Its latestrestartpoint is C. The checkpoint records on the different timelines areat the same location, at the beginning of the WAL files - not all thatimpossible if you have archive_timeout set, for example.

Now, if you stop and restart the standby, it will try to recover to thelatest timeline, which is TLI 2. But before the restart, it had alreadyreplayed the WAL from TLI 1, so it's wrong to replay the WAL from theparallel universe of TLI 2. At the moment, it will go ahead and do it,and you end up with an inconsistent database.

I planned to fix that by checking the TLI on the xlog page header, butthat alone isn't enough in the above scenario. The TLI on the pageheaders on timeline 2 are what's expected; the first page on the segmenthas TLI==1, because it was just forked off from timeline 1, and thesubsequent pages have TLI==2, as they should after the checkpoint record.

So we have to remember that before the restart, which timeline where weon. We already remember how far we had replayed, that's theminRecoveryPoint we store in the control file, but we have to memorizethe timeline along that.

On reflection, your idea of checking the history file before replayinganything seems much easier. We'll still need to add the timelinealongside minRecoveryPoint to do the checking, but it's a lot easier todo against the history file. And we can validate the TLIs on pageheaders against the information from the history file as we read in the WAL.


--
  Heikki Linnakangas
  EnterpriseDB   http://www.enterprisedb.com

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Tracking latest timeline in standby mode

Reply via email to