Re: [HACKERS] We need to log aborted autovacuums

Greg Smith Sun, 16 Jan 2011 08:24:01 -0800

Robert Haas wrote:

On Sat, Jan 15, 2011 at 11:14 AM, Tom Lane <t...@sss.pgh.pa.us> wrote:

Greg Smith <g...@2ndquadrant.com> writes:

Does try_relation_open need to have a lock acquisition timeout when AV
is calling it?

Hmm.  I think when looking at the AV code, I've always subconsciously
assumed that try_relation_open would fail immediately if it couldn't get
the lock.  That certainly seems like it would be a more appropriate way
to behave than delaying indefinitely.


I'm confused how that's not happening already. What does "try" mean, otherwise?

Apparently "try" means acquire the requested lock on the oid of therelation, waiting for any amount of time for that part, and then checkif the relation really exists or not once it's got it. In this context,it means it will try to open the relation, but might fail if it doesn'tactually exist anymore. The relation not existing once it tries thecheck done after the lock is acquired is the only way it will return afailure.

try_relation_open calls LockRelationOid, which blocks. There is also aConditionalLockRelationOid, which does the same basic thing except itexits immediately, with a false return code, if it can't acquire thelock. I think we just need to nail down in what existing cases it'sacceptable to have try_relation_oid use ConditionalLockRelationOidinstead, which would make it do what all us reading the code thought itdid all along.


There are four callers of try_relation_open to be concerned about here:

src/backend/commands/vacuum.c    vacuum_rel
   onerel = try_relation_open(relid, lmode);

src/backend/commands/analyze.c    analyze_rel
   onerel = try_relation_open(relid, ShareUpdateExclusiveLock);

src/backend/commands/cluster.c    cluster_rel
   OldHeap = try_relation_open(tableOid, AccessExclusiveLock);

src/backend/commands/lockcmds.c LockTableRecurse
   * Apply LOCK TABLE recursively over an inheritance tree
   rel = try_relation_open(reloid, NoLock);

I think that both the vacuum_rel and analyze_rel cases are capable offiguring out if they are the autovacuum process, and if so calling thefast non-blocking version of this. I wouldn't want to mess with theother two, which rely upon the current behavior as far as I can see.

Probably took me longer to write this e-mail than the patch will take.Since I've got trivial patch fever this weekend and already have thetest case, I'll do this one next.


--
Greg Smith   2ndQuadrant US    g...@2ndquadrant.com   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support  www.2ndQuadrant.us
"PostgreSQL 9.0 High Performance": http://www.2ndQuadrant.com/books

Re: [HACKERS] We need to log aborted autovacuums

Reply via email to