Re: [HACKERS] Back-branch update releases coming in a couple weeks

MauMau Thu, 24 Jan 2013 06:52:58 -0800

From: "Fujii Masao" <[email protected]>

On Thu, Jan 24, 2013 at 7:42 AM, MauMau <[email protected]> wrote:

I searched through PostgreSQL mailing lists with "WAL contains referencesto
invalid pages", and i found 19 messages.  Some people encountered similar
problem.  There were some discussions regarding those problems (Tom and
Simon Riggs commented), but those discussions did not reach a solution.
I also found a discussion which might relate to this problem. Does thisfix
the problem?

[BUG] lag of minRecoveryPont in archive recovery
http://www.postgresql.org/message-id/[email protected]


Yes. Could you check whether you can reproduce the problem on the
latest REL9_2_STABLE?

I tried to produce the problem by doing "pg_ctl stop -mi" against theprimary more than ten times on REL9_2_STABLE, but the problem did notappear. However, I encountered the crash only once out of dozens offailovers, possibly more than a hundred times, on PostgreSQL 9.1.6. So, I'mnot sure the problem is fixed in REL9_2_STABLE.

I'm wondering if the fix discussed in the above thread solves my problem. Ifound the following differences between Horiguchi-san's case and my case:


(1)
Horiguchi-san says the bug outputs the message:

WARNING:  page 0 of relation base/16384/16385 does not exist

On the other hand, I got the message:

WARNING:  page 506747 of relation base/482272/482304 was uninitialized


(2)

Horiguchi-san produced the problem when he shut the standby immediately andrestarted it. However, I saw the problem during failover.

(3)

Horiguchi-san did not use any index, but in my case the WARNING messagerefers to an index.

But there's a similar point. Horiguchi-san says the problem occurs afterDELETE+VACUUM. In my case, I shut the primary down while the applicationwas doing INSERT/UPDATE. As the below messages show, some vacuuming wasrunning just before the immediate shutdown:


...
LOG:  automatic vacuum of table "GOLD.scm1.tbl1": index scans: 0
pages: 0 removed, 36743 remain
tuples: 0 removed, 73764 remain
system usage: CPU 0.09s/0.11u sec elapsed 0.66 sec

LOG: automatic analyze of table "GOLD.scm1.tbl1" system usage: CPU0.00s/0.14u sec elapsed 0.32 sec

LOG:  automatic vacuum of table "GOLD.scm1.tbl2": index scans: 0
pages: 0 removed, 12101 remain

tuples: 40657 removed, 44142 remain system usage: CPU 0.06s/0.06u secelapsed 0.30 secLOG: automatic analyze of table "GOLD.scm1.tbl2" system usage: CPU0.00s/0.06u sec elapsed 0.14 sec

LOG:  received immediate shutdown request
...

Could you tell me the details of the problem discussed and fixed in theupcoming minor release? I would to like to know the phenomenon and itsconditions, and whether it applies to my case.


Regards
MauMau




--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Back-branch update releases coming in a couple weeks

Reply via email to