Re: [HACKERS] Streaming replication status

Stefan Kaltenbrunner Thu, 14 Jan 2010 22:54:11 -0800

Greg Smith wrote:

Fujii Masao wrote:
"I'm thinking something like pg_standbys_xlog_location() [on the primary] which 
returns
one row per standby servers, showing pid of walsender, host name/
port number/user OID of the standby, the location where the standby
has written/flushed WAL. DBA can measure the gap from the
combination of pg_current_xlog_location() and pg_standbys_xlog_location()
via one query on the primary."
This function is useful but not essential for troubleshooting, I think.
So I'd like to postpone it.
Sure; in a functional system where primary and secondary are both up,you can assemble the info using the new functions you just added, sothis other one is certainly optional. I just took a brief look at thecode of the features you added, and it looks like it exposes the minimumnecessary to make this whole thing possible to manage. I think it's OKif you postpone this other bit, more important stuff for you to work on.


agreed

So: the one piece of information I though was most important to exposehere at an absolute minimum is there now. Good progress. The otherpopular request that keeps popping up here is providing an easy way tosee how backlogged the archive_command is, to make it easier to monitorfor out of disk errors that might prove catastrophic to replication.

I tend to disagree - in any reasonable production setup basic stulfflike disk space usage is monitored by non-application specific matters.While monitoring backlog might be interesting for other reasons, citingdisk space usage/exhaustions seems just wrong.



[...]

I'd find this extremely handy as a hook for monitoring scripts that wantto watch the server but don't have access to the filesystem directly,even given those limitations. I'd prefer to have the "tried to"version, because it will populate with the name of the troublesome fileit's stuck on even if archiving never gets its first segment delivered.

While fancy at all I think this goes way to far for the first cut atSR(or say this release), monitoring disk usage and tracking log filesfor errors are SOLVED issues in estabilished production setups. If youare in an environment that does neither for each and every serverindependent on what you have running on it, or a setup where thesysadmins are clueless and the poor DBA has to hack around that fact youhave way bigger issues anyway.



Stefan

--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Streaming replication status

Reply via email to