Re: [HACKERS] Remaining 'needs review' patchs in July commitfest

Fabien COELHO Wed, 29 Jul 2015 07:50:37 -0700


Hello Heikki,


About two patches I submitted:

pgbench - allow backslash-continuations in custom scripts
Everyone wants the feature, using multi-line SELECTs in pgbench scripts,but we don't seem to be reaching a consensus on how it should work. Ithink we'll need to integrate the lexer, but it would be nice to stillsupport multi-statements as well, with some syntax.

I was willing to do a small job of that one, but it drifted to solving themeta problem of pgbench lexing for doing it "the right way". KyotaroHORIGUCHI has taken up the challenge and submitted a 3-part patch tomodify psql lexer, then reuse it "as is" in pgbench. I'm impressed:-)

I think that the approach is a little overkill for the simple targettedfeature, and I'm not sure that it allows for multi-statements, but I havenot checked. Anyway I may try to review it, although not in the shortterm. I'm not too optimistic because if passed, it means that the psqllexer would be used in 2 contexts, so changes in any of them would requireensuring that it does not break anything in the other one, and I'm notsure that it is such constraint is desirable. Duplicating the lexer isnot a realistic option either, I do not think that pgbench is worth it.

Anyway, in the mean time, the patch may be switched to "returned withfeedback" or "rejected".

checkpoint continuous flushing
This does a big memory allocation at checkpoint, which Tom vehementlyobjects to. I don't much like it either, although I would be OK with amore moderately-sized allocation.

AFAICS Tom has not expressed any view on the patch in its current form(there is no message from Tom on the thread).

ISTM that Tom complained about a year ago about OOM risks on a 2007version of the sorting part by Takahiro ITAGAKI which was dynamicallyallocating and freeing about 24 bytes per buffers on each checkpoint.

The current v5 version uses 4 bytes per buffer at the first run and reusethe memory, it is allocated once and thus there is no risk of returning itand failing to get it back, so no significant OOM risk. Maybe theallocation should be moved earlier when starting the checkpointer process,though.

A second objection a year ago from Tom was about proof of performancegains. I've spent quite some time to collect a lot of data to measurebenefits under different loads, representing X00 hours of pgbench runs, ascan be seen in the thread. ISTM that the performance (tps & latency)figures, for instance:


http://www.postgresql.org/message-id/raw/alpine.DEB.2.10.1506170803210.9794@sto

are overwhelmingly in favor of the patch.

If the memory requirement of 4 bytes per buffer is still too much, it iseventually possible to reduce it with a guc to specify the allowed amountof memory and some shuffling in the code to do things by chunk.

My opinion is that 4 bytes per buffer is reasonable enough given themeasured benefits. Also, there is more benefits if the whole checkpoint issorted instead of just part of it.

I'm really willing to improve the write stall issues which freezespostgresql when checkpointing on HDD, so if it has to be chunked becausethis is a blocker I'll make the effort, but I do not think that it isuseful as such.

It's not clear on what criteria this should be accepted or rejected.

Given the overall performance gains and reduction in latency, where a lotof write stalls are avoided or at least greatly reduced, I would be sadfor pg to get it rejected. That does not mean that it cannot be improved.

What workloads need to be tested?

If you tell me, and provide the matching dedicated host for testing, I canrun tests...


--
Fabien.


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Remaining 'needs review' patchs in July commitfest

Reply via email to