Re: Review of "pg_basebackup and pg_receivexlog to use non-blocking socket communication", was: Re: [HACKERS] Re: [BUGS] BUG #7534: walreceiver takes long time to detect n/w breakdown

Heikki Linnakangas Wed, 16 Jan 2013 02:32:16 -0800

On 07.01.2013 16:23, Boszormenyi Zoltan wrote:

Since my other patch against pg_basebackup is now committed,
this patch doesn't apply cleanly, patch rejects 2 hunks.
The fixed up patch is attached.

Now that I look at this a high-level perspective, why are we onlyworried about timeouts in the Copy-mode and when connecting? The initialcheckpoint could take a long time too, and if the server turns into ablack hole while the checkpoint is running, pg_basebackup will stillhang. Then again, a short timeout on that phase would be a bad idea,because the checkpoint can indeed take a long time.

In streaming replication, the keep-alive messages carry additionalinformation, the timestamps and WAL locations, so a keepalive makessense at that level. But otherwise, aren't we just trying to reimplementTCP keepalives? TCP keepalives are not perfect, but if we want to havean application level timeout, it should be implemented in the FE/BEprotocol.

I don't think we need to do anything specific to pg_basebackup. The usercan simply specify TCP keepalive settings in the connection string, likewith any libpq program.


- Heikki


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: Review of "pg_basebackup and pg_receivexlog to use non-blocking socket communication", was: Re: [HACKERS] Re: [BUGS] BUG #7534: walreceiver takes long time to detect n/w breakdown

Reply via email to