Re: [HACKERS] Testing of parallel restore with current snapshot

Andrew Dunstan Fri, 15 May 2009 12:01:09 -0700


Tom Lane wrote:

Andrew Dunstan <[email protected]> writes:

Tom Lane wrote:

I don't want to mess with it right now either, but perhaps we should
have a TODO item to improve the intelligence of parallel restore so that
it really does try to do things this way.

Other things being equal it schedules things in TOC order, which oftenworks as we want anyway. I think there's a good case for altering thename sort order of pg_dump to group sub-objects of a table (indexes,constraints etc.) together, ie. instead of sorting by <objectname>, we'dsort by <tablename, objectname>. This would possibly improve the effectseen in parallel restore without requiring any extra intelligence there.


I'm not at all excited about substituting one arbitrary ordering rule
for another one ...

What is probably happening that accounts for Josh's positive experience
is that all the indexes of a particular table "come free" from the
dependency restrictions at the same instant, namely when the data load
for that table ends.  So if there's nothing else to do they'll get
scheduled together.  However, if the data load for table B finishes
before all the indexes of table A have been scheduled, then B's indexes
will start competing with A's for scheduling slots.  The performance
considerations suggest that we'd be best advised to finish out all of
A's indexes before scheduling any of B's, but I'm not sure that that's
what it will actually do.

Based on this thought, what seems to make sense as a quick-and-dirty
answer is to make sure that items get scheduled in the same order they
came free from dependency restrictions.  I don't recall whether that
is true at the moment, or how hard it might be to make it true if it
isn't already.

AIUI, pg_dump sorts items by <object-type, schema, objectname> and thendoes a topological sort to permute this order to reflect dependencies.This is the TOC order parallel restore starts with (unless the order ismucked with by the user via the --use-list option). Each time it needsto schedule an item from the list, it chooses the first one yet to runthat meets both these criteria:


   * all its dependencies have already been restored
   * it has no locking conflicts with a currently running item.

Now, it is common practice to use the table name as a prefix of an indexname, and this will actually cause indexes for a table to be groupedtogether in the TOC list. I think that's why Josh is seeing what he'sseeing. If this holds, then all of the index creations for A will bestarted before any of the indexes for B, even if B's table data finishesrestoring half way through restoring A's indexes. So your speculationabout B's indexes contending with A's is incorrect unless their namessort intermingled.

During development, I did play with changing the TOC order some, butabandoned it, as testing didn't show any obvious gain - if anything thereverse. There are some remnants of this in the code.


cheers

andrew

--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Testing of parallel restore with current snapshot

Reply via email to