Re: [HACKERS] checkpointer continuous flushing - V18

Fabien COELHO Mon, 07 Mar 2016 12:11:46 -0800


Hello Andres,

(1) with 16 tablespaces (1 per table) on 1 disk : 680.0 tps
   per second avg, stddev [ min q1 median d3 max ] <=300tps
   679.6 ± 750.4 [0.0, 317.0, 371.0, 438.5, 2724.0] 19.5%

(2) with 1 tablespace on 1 disk : 956.0 tps
   per second avg, stddev [ min q1 median d3 max ] <=300tps
   956.2 ± 796.5 [3.0, 488.0, 583.0, 742.0, 2774.0] 2.1%

Well, that's not a particularly meaningful workload. You increased thenumber of flushed to the same number of disks considerably.

It is just a simple workload designed to emphasize the effect of havingone context shared for all table space instead of on per tablespace,without rewriting the patch and without a large host with multiple disks.

For a meaningful comparison you'd have to compare using one writebackcontext for N tablespaces on N separate disks/raids, and using Nwriteback contexts for the same.

Sure, it would be better to do that, but that would require (1) rewritingthe patch, which is a small work, and also (2) having access to a machinewith a number of disks/raids, that I do NOT have available.

What happens in the 16 tb workload is that much smaller flushes areperformed on the 16 files writen in parallel, so the tps performance issignificantly degraded, despite the writes being sorted in each file. Onone tb, all buffers flushed are in the same file, so flushes are much moreeffective.

When the context is shared and checkpointer buffer writes are balancedagainst table spaces, then when the limit is reached the flushing gets fewbuffers per tablespace, so this limits sequential writes to few buffers,hence the performance degradation.

So I can explain the performance degradation *because* the flush contextis shared between the table spaces, which is a logical argument backedwith experimental data, so it is better than handwaving. Given theavailable hardware, this is the best proof I can have that context shouldbe per table space.

Now I cannot see how having one context per table space would have asignificant negative performance impact.

So the logical conclusion for me is that without further experimental datait is better to have one context per table space.

If you have a hardware with plenty disks available for testing, that wouldprovide better data, obviously.


--
Fabien.
--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] checkpointer continuous flushing - V18

Reply via email to