Re: Wait event that should be reported while waiting for WAL archiving to finish
On 2020/03/23 15:56, Fujii Masao wrote: On 2020/03/19 19:39, Atsushi Torikoshi wrote: On Wed, Feb 26, 2020 at 9:19 PM Fujii Masao mailto:masao.fu...@oss.nttdata.com>> wrote: I have no idea about this. But I wonder how much that change is helpful to reduce the power consumption because waiting for WAL archive during the backup basically not so frequently happens. +1. And as far as I reviewed the patch, I didn't find any problems. Thanks for the review! Barring any objection, I will commit this patch. Pushed! Thanks! Regards, -- Fujii Masao NTT DATA CORPORATION Advanced Platform Technology Group Research and Development Headquarters
Re: Wait event that should be reported while waiting for WAL archiving to finish
On 2020/03/19 19:39, Atsushi Torikoshi wrote: On Wed, Feb 26, 2020 at 9:19 PM Fujii Masao mailto:masao.fu...@oss.nttdata.com>> wrote: I have no idea about this. But I wonder how much that change is helpful to reduce the power consumption because waiting for WAL archive during the backup basically not so frequently happens. +1. And as far as I reviewed the patch, I didn't find any problems. Thanks for the review! Barring any objection, I will commit this patch. Regards, -- Fujii Masao NTT DATA CORPORATION Advanced Platform Technology Group Research and Development Headquarters
Re: Wait event that should be reported while waiting for WAL archiving to finish
On Wed, Feb 26, 2020 at 9:19 PM Fujii Masao wrote: > I have no idea about this. But I wonder how much that change > is helpful to reduce the power consumption because waiting > for WAL archive during the backup basically not so frequently > happens. > +1. And as far as I reviewed the patch, I didn't find any problems. Regards, -- Atsushi Torikoshi
Re: Wait event that should be reported while waiting for WAL archiving to finish
On 2020/02/18 12:39, Michael Paquier wrote: On Mon, Feb 17, 2020 at 10:21:23PM +0900, Fujii Masao wrote: On 2020/02/17 18:48, Michael Paquier wrote: Actually, I have some questions: 1) Should a new wait event be added in recoveryPausesHere()? That would be IMO useful. Yes, it's useful, I think. But it's better to implement that as a separate patch. No problem for me. On second thought, it's OK to add that event into the patch. Attached is the updated version of the patch. This patch adds two wait events for WAL archiving and recovery pause. 2) Perhaps those two points should be replaced with WaitLatch(), where we would use the new wait events introduced? For what? Maybe it should, but I'm not sure it's worth the trouble. I don't have more to offer than signal handling consistency for both without relying on pg_usleep()'s behavior depending on the platform, and power consumption. For the recovery pause, the second argument may not be worth carrying, but we never had this argument for the archiving wait, did we? I have no idea about this. But I wonder how much that change is helpful to reduce the power consumption because waiting for WAL archive during the backup basically not so frequently happens. Regards, -- Fujii Masao NTT DATA CORPORATION Advanced Platform Technology Group Research and Development Headquarters diff --git a/doc/src/sgml/monitoring.sgml b/doc/src/sgml/monitoring.sgml index 87586a7b06..6c0c93ffdc 100644 --- a/doc/src/sgml/monitoring.sgml +++ b/doc/src/sgml/monitoring.sgml @@ -1335,7 +1335,11 @@ postgres 27093 0.0 0.0 30096 2752 ?Ss 11:34 0:00 postgres: ser Waiting in an extension. - IPC + IPC + BackupWaitWalArchive + Waiting for WAL files required for the backup to be successfully archived. + + BgWorkerShutdown Waiting for background worker to shut down. @@ -1467,6 +1471,10 @@ postgres 27093 0.0 0.0 30096 2752 ?Ss 11:34 0:00 postgres: ser Promote Waiting for standby promotion. + + RecoveryPause + Waiting for recovery to be resumed. + ReplicationOriginDrop Waiting for a replication origin to become inactive to be dropped. diff --git a/src/backend/access/transam/xlog.c b/src/backend/access/transam/xlog.c index d19408b3be..5569606183 100644 --- a/src/backend/access/transam/xlog.c +++ b/src/backend/access/transam/xlog.c @@ -5945,7 +5945,9 @@ recoveryPausesHere(void) while (RecoveryIsPaused()) { + pgstat_report_wait_start(WAIT_EVENT_RECOVERY_PAUSE); pg_usleep(100L);/* 1000 ms */ + pgstat_report_wait_end(); HandleStartupProcInterrupts(); } } @@ -11129,7 +11131,9 @@ do_pg_stop_backup(char *labelfile, bool waitforarchive, TimeLineID *stoptli_p) reported_waiting = true; } + pgstat_report_wait_start(WAIT_EVENT_BACKUP_WAIT_WAL_ARCHIVE); pg_usleep(100L); + pgstat_report_wait_end(); if (++waits >= seconds_before_warning) { diff --git a/src/backend/postmaster/pgstat.c b/src/backend/postmaster/pgstat.c index 462b4d7e06..2400bf31ee 100644 --- a/src/backend/postmaster/pgstat.c +++ b/src/backend/postmaster/pgstat.c @@ -3740,6 +3740,9 @@ pgstat_get_wait_ipc(WaitEventIPC w) switch (w) { + case WAIT_EVENT_BACKUP_WAIT_WAL_ARCHIVE: + event_name = "BackupWaitWalArchive"; + break; case WAIT_EVENT_BGWORKER_SHUTDOWN: event_name = "BgWorkerShutdown"; break; @@ -3839,6 +3842,9 @@ pgstat_get_wait_ipc(WaitEventIPC w) case WAIT_EVENT_PROMOTE: event_name = "Promote"; break; + case WAIT_EVENT_RECOVERY_PAUSE: + event_name = "RecoveryPause"; + break; case WAIT_EVENT_REPLICATION_ORIGIN_DROP: event_name = "ReplicationOriginDrop"; break; diff --git a/src/include/pgstat.h b/src/include/pgstat.h index 3a65a51696..6d8332163c 100644 --- a/src/include/pgstat.h +++ b/src/include/pgstat.h @@ -817,7 +817,8 @@ typedef enum */ typedef enum { - WAIT_EVENT_BGWORKER_SHUTDOWN = PG_WAIT_IPC, + WAIT_EVENT_BACKUP_WAIT_WAL_ARCHIVE = PG_WAIT_IPC, + WAIT_EVENT_BGWORKER_SHUTDOWN, WAIT_EVENT_BGWORKER_STARTUP, WAIT_EVENT_BTREE_PAGE, WAIT_EVENT_CLOG_GROUP_UPDATE, @@ -850,6 +851,7 @@ typedef enum WAIT_EVENT_PARALLEL_FINISH, WAIT_EVENT_PROCARRAY_GROUP_UPDATE, WAIT_EVENT_PROMOTE, + WAIT_EVENT_RECOVERY_PAUSE,
Re: Wait event that should be reported while waiting for WAL archiving to finish
On Mon, Feb 17, 2020 at 10:21:23PM +0900, Fujii Masao wrote: > On 2020/02/17 18:48, Michael Paquier wrote: >> Actually, I have some questions: >> 1) Should a new wait event be added in recoveryPausesHere()? That >> would be IMO useful. > > Yes, it's useful, I think. But it's better to implement that > as a separate patch. No problem for me. >> 2) Perhaps those two points should be replaced with WaitLatch(), where >> we would use the new wait events introduced? > > For what? Maybe it should, but I'm not sure it's worth the trouble. I don't have more to offer than signal handling consistency for both without relying on pg_usleep()'s behavior depending on the platform, and power consumption. For the recovery pause, the second argument may not be worth carrying, but we never had this argument for the archiving wait, did we? For both, on top of it you don't need to worry about concurrent issues with the wait events attached around. -- Michael signature.asc Description: PGP signature
Re: Wait event that should be reported while waiting for WAL archiving to finish
On 2020/02/17 18:48, Michael Paquier wrote: On Mon, Feb 17, 2020 at 04:30:00PM +0900, Fujii Masao wrote: On 2020/02/14 23:43, Robert Haas wrote: On Thu, Feb 13, 2020 at 10:47 PM Fujii Masao wrote: Fixed. Thanks for the review! I think it would be safer to just report the wait event during pg_usleep(100L) rather than putting those calls around the whole loop. It does not seem impossible that ereport() or CHECK_FOR_INTERRUPTS() could do something that reports a wait event internally. CHECK_FOR_INTERRUPTS() would reset the event wait state. Hm.. You may be right about the WARNING and it would be better to not rely on that. Do you remember the states which may be triggered? OK, so I attached the updated version of the patch. Thanks for the review! Actually, I have some questions: 1) Should a new wait event be added in recoveryPausesHere()? That would be IMO useful. Yes, it's useful, I think. But it's better to implement that as a separate patch. 2) Perhaps those two points should be replaced with WaitLatch(), where we would use the new wait events introduced? For what? Maybe it should, but I'm not sure it's worth the trouble. Regards, -- Fujii Masao NTT DATA CORPORATION Advanced Platform Technology Group Research and Development Headquarters
Re: Wait event that should be reported while waiting for WAL archiving to finish
On Mon, Feb 17, 2020 at 04:30:00PM +0900, Fujii Masao wrote: > On 2020/02/14 23:43, Robert Haas wrote: >> On Thu, Feb 13, 2020 at 10:47 PM Fujii Masao >> wrote: >>> Fixed. Thanks for the review! >> >> I think it would be safer to just report the wait event during >> pg_usleep(100L) rather than putting those calls around the whole >> loop. It does not seem impossible that ereport() or >> CHECK_FOR_INTERRUPTS() could do something that reports a wait event >> internally. CHECK_FOR_INTERRUPTS() would reset the event wait state. Hm.. You may be right about the WARNING and it would be better to not rely on that. Do you remember the states which may be triggered? > OK, so I attached the updated version of the patch. > Thanks for the review! Actually, I have some questions: 1) Should a new wait event be added in recoveryPausesHere()? That would be IMO useful. 2) Perhaps those two points should be replaced with WaitLatch(), where we would use the new wait events introduced? -- Michael signature.asc Description: PGP signature
Re: Wait event that should be reported while waiting for WAL archiving to finish
On 2020/02/14 23:43, Robert Haas wrote: On Thu, Feb 13, 2020 at 10:47 PM Fujii Masao wrote: Fixed. Thanks for the review! I think it would be safer to just report the wait event during pg_usleep(100L) rather than putting those calls around the whole loop. It does not seem impossible that ereport() or CHECK_FOR_INTERRUPTS() could do something that reports a wait event internally. OK, so I attached the updated version of the patch. Thanks for the review! Regards, -- Fujii Masao NTT DATA CORPORATION Advanced Platform Technology Group Research and Development Headquarters diff --git a/doc/src/sgml/monitoring.sgml b/doc/src/sgml/monitoring.sgml index a9f6ee6e32..2f88b41529 100644 --- a/doc/src/sgml/monitoring.sgml +++ b/doc/src/sgml/monitoring.sgml @@ -1335,7 +1335,11 @@ postgres 27093 0.0 0.0 30096 2752 ?Ss 11:34 0:00 postgres: ser Waiting in an extension. - IPC + IPC + BackupWaitWalArchive + Waiting for WAL files required for the backup to be successfully archived. + + BgWorkerShutdown Waiting for background worker to shut down. diff --git a/src/backend/access/transam/xlog.c b/src/backend/access/transam/xlog.c index 3813eadfb4..dce0f3b041 100644 --- a/src/backend/access/transam/xlog.c +++ b/src/backend/access/transam/xlog.c @@ -11107,7 +11107,9 @@ do_pg_stop_backup(char *labelfile, bool waitforarchive, TimeLineID *stoptli_p) reported_waiting = true; } + pgstat_report_wait_start(WAIT_EVENT_BACKUP_WAIT_WAL_ARCHIVE); pg_usleep(100L); + pgstat_report_wait_end(); if (++waits >= seconds_before_warning) { diff --git a/src/backend/postmaster/pgstat.c b/src/backend/postmaster/pgstat.c index 59dc4f31ab..6e7a57b1b7 100644 --- a/src/backend/postmaster/pgstat.c +++ b/src/backend/postmaster/pgstat.c @@ -3737,6 +3737,9 @@ pgstat_get_wait_ipc(WaitEventIPC w) switch (w) { + case WAIT_EVENT_BACKUP_WAIT_WAL_ARCHIVE: + event_name = "BackupWaitWalArchive"; + break; case WAIT_EVENT_BGWORKER_SHUTDOWN: event_name = "BgWorkerShutdown"; break; diff --git a/src/include/pgstat.h b/src/include/pgstat.h index 3a65a51696..20313f1e32 100644 --- a/src/include/pgstat.h +++ b/src/include/pgstat.h @@ -817,7 +817,8 @@ typedef enum */ typedef enum { - WAIT_EVENT_BGWORKER_SHUTDOWN = PG_WAIT_IPC, + WAIT_EVENT_BACKUP_WAIT_WAL_ARCHIVE = PG_WAIT_IPC, + WAIT_EVENT_BGWORKER_SHUTDOWN, WAIT_EVENT_BGWORKER_STARTUP, WAIT_EVENT_BTREE_PAGE, WAIT_EVENT_CLOG_GROUP_UPDATE,
Re: Wait event that should be reported while waiting for WAL archiving to finish
On 2020/02/14 15:45, Michael Paquier wrote: On Fri, Feb 14, 2020 at 12:47:19PM +0900, Fujii Masao wrote: logical_rewrite_truncate_v1.patch adds the description of LogicalRewriteTruncate into the doc. This needs to be back-patched to v10 where commit 249cf070e3 introduced LogicalRewriteTruncate event. Indeed. You just be careful about the number of fields for morerows, as that's not the same across branches. gss_open_server_v1.patch adds the description of GSSOpenServer into the doc and update the code in pgstat_get_wait_client(). This needs to be applied in v12 where commit b0b39f72b9 introduced GSSOpenServer event. gss_open_server_for_master_v1.patch does not only what the above patch does but also update wait event enum into alphabetical order. This needs to be applied in the master. Thanks for splitting things. All that looks correct to me. Thanks for the review! Pushed the patches for LogicalRewriteTruncate and GSSOpenServer. Regards, -- Fujii Masao NTT DATA CORPORATION Advanced Platform Technology Group Research and Development Headquarters
Re: Wait event that should be reported while waiting for WAL archiving to finish
On Thu, Feb 13, 2020 at 10:47 PM Fujii Masao wrote: > Fixed. Thanks for the review! I think it would be safer to just report the wait event during pg_usleep(100L) rather than putting those calls around the whole loop. It does not seem impossible that ereport() or CHECK_FOR_INTERRUPTS() could do something that reports a wait event internally. -- Robert Haas EnterpriseDB: http://www.enterprisedb.com The Enterprise PostgreSQL Company
Re: Wait event that should be reported while waiting for WAL archiving to finish
On Fri, Feb 14, 2020 at 12:47:19PM +0900, Fujii Masao wrote: > logical_rewrite_truncate_v1.patch adds the description of > LogicalRewriteTruncate into the doc. This needs to be > back-patched to v10 where commit 249cf070e3 introduced > LogicalRewriteTruncate event. Indeed. You just be careful about the number of fields for morerows, as that's not the same across branches. > gss_open_server_v1.patch adds the description of GSSOpenServer > into the doc and update the code in pgstat_get_wait_client(). > This needs to be applied in v12 where commit b0b39f72b9 introduced > GSSOpenServer event. > > gss_open_server_for_master_v1.patch does not only what the above > patch does but also update wait event enum into alphabetical order. > This needs to be applied in the master. Thanks for splitting things. All that looks correct to me. -- Michael signature.asc Description: PGP signature
Re: Wait event that should be reported while waiting for WAL archiving to finish
On 2020/02/13 16:30, Michael Paquier wrote: On Thu, Feb 13, 2020 at 03:35:50PM +0900, Fujii Masao wrote: I found that the wait events "LogicalRewriteTruncate" and "GSSOpenServer" are not documented. I'm thinking to add them into doc separately if ok. Nice catch. The ordering of the entries is not respected either for GSSOpenServer in pgstat.h. The portion for the code and the docs can be fixed in back-branches, but not the enum list in WaitEventClient or we would have an ABI breakage. But this can be fixed on HEAD. Can you take care of it? Yes. Patch attached. logical_rewrite_truncate_v1.patch adds the description of LogicalRewriteTruncate into the doc. This needs to be back-patched to v10 where commit 249cf070e3 introduced LogicalRewriteTruncate event. gss_open_server_v1.patch adds the description of GSSOpenServer into the doc and update the code in pgstat_get_wait_client(). This needs to be applied in v12 where commit b0b39f72b9 introduced GSSOpenServer event. gss_open_server_for_master_v1.patch does not only what the above patch does but also update wait event enum into alphabetical order. This needs to be applied in the master. SyncRep Waiting for confirmation from remote server during synchronous replication. + + BackupWaitWalArchive + Waiting for WAL files required for the backup to be successfully archived. + The category IPC is adapted. You forgot to update the markup morerows from "36" to "37", causing the table of the wait events to have a weird format (the bottom should be incorrect). Fixed. Thanks for the review! + pgstat_report_wait_start(WAIT_EVENT_BACKUP_WAIT_WAL_ARCHIVE); while (XLogArchiveIsBusy(lastxlogfilename) || XLogArchiveIsBusy(histfilename)) { @@ -11120,6 +11121,7 @@ do_pg_stop_backup(char *labelfile, bool waitforarchive, TimeLineID *stoptli_p) "but the database backup will not be usable without all the WAL segments."))); } } + pgstat_report_wait_end(); Okay, that position is right. @@ -3848,6 +3848,9 @@ pgstat_get_wait_ipc(WaitEventIPC w) case WAIT_EVENT_SYNC_REP: event_name = "SyncRep"; break; + case WAIT_EVENT_BACKUP_WAIT_WAL_ARCHIVE: + event_name = "BackupWaitWalArchive"; + break; /* no default case, so that compiler will warn */ [...] @@ -853,7 +853,8 @@ typedef enum WAIT_EVENT_REPLICATION_ORIGIN_DROP, WAIT_EVENT_REPLICATION_SLOT_DROP, WAIT_EVENT_SAFE_SNAPSHOT, - WAIT_EVENT_SYNC_REP + WAIT_EVENT_SYNC_REP, + WAIT_EVENT_BACKUP_WAIT_WAL_ARCHIVE } WaitEventIPC; It would be good to keep entries in alphabetical order in the header, the code and in the docs (see the effort from 5ef037c), and your patch is missing that concept for all three places where it matters for this new event. Fixed. Patch attached. Regards, -- Fujii Masao NTT DATA CORPORATION Advanced Platform Technology Group Research and Development Headquarters diff --git a/doc/src/sgml/monitoring.sgml b/doc/src/sgml/monitoring.sgml index 9129f79bbf..027df985ac 100644 --- a/doc/src/sgml/monitoring.sgml +++ b/doc/src/sgml/monitoring.sgml @@ -1293,7 +1293,7 @@ postgres 27093 0.0 0.0 30096 2752 ?Ss 11:34 0:00 postgres: ser Waiting in main loop of WAL writer process. - Client + Client ClientRead Waiting to read data from the client. @@ -1301,6 +1301,10 @@ postgres 27093 0.0 0.0 30096 2752 ?Ss 11:34 0:00 postgres: ser ClientWrite Waiting to write data to the client. + + GSSOpenServer + Waiting to read data from the client while establishing the GSSAPI session. + LibPQWalReceiverConnect Waiting in WAL receiver to establish connection to remote server. diff --git a/src/backend/postmaster/pgstat.c b/src/backend/postmaster/pgstat.c index 7169509a79..59dc4f31ab 100644 --- a/src/backend/postmaster/pgstat.c +++ b/src/backend/postmaster/pgstat.c @@ -3697,6 +3697,9 @@ pgstat_get_wait_client(WaitEventClient w) case WAIT_EVENT_CLIENT_WRITE: event_name = "ClientWrite"; break; + case WAIT_EVENT_GSS_OPEN_SERVER: + event_name = "GSSOpenServer"; + break; case WAIT_EVENT_LIBPQWALRECEIVER_CONNECT: event_name = "LibPQWalReceiverConnect"; break; @@ -3715,9 +3718,6 @@ pgstat_get_wait_client(WaitEventClient w) case WAIT_EVENT_WAL_SENDER_WRITE_DATA:
Re: Wait event that should be reported while waiting for WAL archiving to finish
On Thu, Feb 13, 2020 at 03:35:50PM +0900, Fujii Masao wrote: > I found that the wait events "LogicalRewriteTruncate" and > "GSSOpenServer" are not documented. I'm thinking to add > them into doc separately if ok. Nice catch. The ordering of the entries is not respected either for GSSOpenServer in pgstat.h. The portion for the code and the docs can be fixed in back-branches, but not the enum list in WaitEventClient or we would have an ABI breakage. But this can be fixed on HEAD. Can you take care of it? If you need help, please feel free to poke me. I think that this should be fixed first, before adding the new event. > SyncRep > Waiting for confirmation from remote server during > synchronous replication. > > + > + BackupWaitWalArchive > + Waiting for WAL files required for the backup to be > successfully archived. > + The category IPC is adapted. You forgot to update the markup morerows from "36" to "37", causing the table of the wait events to have a weird format (the bottom should be incorrect). > + pgstat_report_wait_start(WAIT_EVENT_BACKUP_WAIT_WAL_ARCHIVE); > while (XLogArchiveIsBusy(lastxlogfilename) || > XLogArchiveIsBusy(histfilename)) > { > @@ -11120,6 +11121,7 @@ do_pg_stop_backup(char *labelfile, bool > waitforarchive, TimeLineID *stoptli_p) >"but the > database backup will not be usable without all the WAL segments."))); > } > } > + pgstat_report_wait_end(); Okay, that position is right. > @@ -3848,6 +3848,9 @@ pgstat_get_wait_ipc(WaitEventIPC w) > case WAIT_EVENT_SYNC_REP: > event_name = "SyncRep"; > break; > + case WAIT_EVENT_BACKUP_WAIT_WAL_ARCHIVE: > + event_name = "BackupWaitWalArchive"; > + break; > /* no default case, so that compiler will warn */ > [...] > @@ -853,7 +853,8 @@ typedef enum > WAIT_EVENT_REPLICATION_ORIGIN_DROP, > WAIT_EVENT_REPLICATION_SLOT_DROP, > WAIT_EVENT_SAFE_SNAPSHOT, > - WAIT_EVENT_SYNC_REP > + WAIT_EVENT_SYNC_REP, > + WAIT_EVENT_BACKUP_WAIT_WAL_ARCHIVE > } WaitEventIPC; It would be good to keep entries in alphabetical order in the header, the code and in the docs (see the effort from 5ef037c), and your patch is missing that concept for all three places where it matters for this new event. -- Michael signature.asc Description: PGP signature
Re: Wait event that should be reported while waiting for WAL archiving to finish
On 2020/02/13 12:28, Michael Paquier wrote: On Thu, Feb 13, 2020 at 02:29:20AM +0900, Fujii Masao wrote: When I saw pg_stat_activity.wait_event while pg_basebackup -X none is waiting for WAL archiving to finish, it was either NULL or CheckpointDone. I think this is confusing. What about introducing new wait_event like WAIT_EVENT_BACKUP_WAIT_WAL_ARCHIVE (BackupWaitWalArchive) and reporting it during that period? Sounds like a good idea to me. You need to be careful that this does not overwrite more low-level wait event registration though, so that could be more tricky than it looks at first sight. Thanks for the advise! Patch attached. I found that the wait events "LogicalRewriteTruncate" and "GSSOpenServer" are not documented. I'm thinking to add them into doc separately if ok. Regards, -- Fujii Masao NTT DATA CORPORATION Advanced Platform Technology Group Research and Development Headquarters diff --git a/doc/src/sgml/monitoring.sgml b/doc/src/sgml/monitoring.sgml index 9129f79bbf..1e2f81d994 100644 --- a/doc/src/sgml/monitoring.sgml +++ b/doc/src/sgml/monitoring.sgml @@ -1479,6 +1479,10 @@ postgres 27093 0.0 0.0 30096 2752 ?Ss 11:34 0:00 postgres: ser SyncRep Waiting for confirmation from remote server during synchronous replication. + + BackupWaitWalArchive + Waiting for WAL files required for the backup to be successfully archived. + Timeout BaseBackupThrottle diff --git a/src/backend/access/transam/xlog.c b/src/backend/access/transam/xlog.c index 3813eadfb4..d79417b443 100644 --- a/src/backend/access/transam/xlog.c +++ b/src/backend/access/transam/xlog.c @@ -11095,6 +11095,7 @@ do_pg_stop_backup(char *labelfile, bool waitforarchive, TimeLineID *stoptli_p) seconds_before_warning = 60; waits = 0; + pgstat_report_wait_start(WAIT_EVENT_BACKUP_WAIT_WAL_ARCHIVE); while (XLogArchiveIsBusy(lastxlogfilename) || XLogArchiveIsBusy(histfilename)) { @@ -11120,6 +11121,7 @@ do_pg_stop_backup(char *labelfile, bool waitforarchive, TimeLineID *stoptli_p) "but the database backup will not be usable without all the WAL segments."))); } } + pgstat_report_wait_end(); ereport(NOTICE, (errmsg("all required WAL segments have been archived"))); diff --git a/src/backend/postmaster/pgstat.c b/src/backend/postmaster/pgstat.c index 7169509a79..6114ab44ee 100644 --- a/src/backend/postmaster/pgstat.c +++ b/src/backend/postmaster/pgstat.c @@ -3848,6 +3848,9 @@ pgstat_get_wait_ipc(WaitEventIPC w) case WAIT_EVENT_SYNC_REP: event_name = "SyncRep"; break; + case WAIT_EVENT_BACKUP_WAIT_WAL_ARCHIVE: + event_name = "BackupWaitWalArchive"; + break; /* no default case, so that compiler will warn */ } diff --git a/src/include/pgstat.h b/src/include/pgstat.h index aecb6013f0..7c40ebc3dc 100644 --- a/src/include/pgstat.h +++ b/src/include/pgstat.h @@ -853,7 +853,8 @@ typedef enum WAIT_EVENT_REPLICATION_ORIGIN_DROP, WAIT_EVENT_REPLICATION_SLOT_DROP, WAIT_EVENT_SAFE_SNAPSHOT, - WAIT_EVENT_SYNC_REP + WAIT_EVENT_SYNC_REP, + WAIT_EVENT_BACKUP_WAIT_WAL_ARCHIVE } WaitEventIPC; /* --
Re: Wait event that should be reported while waiting for WAL archiving to finish
On Thu, Feb 13, 2020 at 02:29:20AM +0900, Fujii Masao wrote: > When I saw pg_stat_activity.wait_event while pg_basebackup -X none > is waiting for WAL archiving to finish, it was either NULL or > CheckpointDone. I think this is confusing. What about introducing > new wait_event like WAIT_EVENT_BACKUP_WAIT_WAL_ARCHIVE > (BackupWaitWalArchive) and reporting it during that period? Sounds like a good idea to me. You need to be careful that this does not overwrite more low-level wait event registration though, so that could be more tricky than it looks at first sight. -- Michael signature.asc Description: PGP signature
Wait event that should be reported while waiting for WAL archiving to finish
Hi, When I saw pg_stat_activity.wait_event while pg_basebackup -X none is waiting for WAL archiving to finish, it was either NULL or CheckpointDone. I think this is confusing. What about introducing new wait_event like WAIT_EVENT_BACKUP_WAIT_WAL_ARCHIVE (BackupWaitWalArchive) and reporting it during that period? Regards, -- Fujii Masao NTT DATA CORPORATION Advanced Platform Technology Group Research and Development Headquarters