Re: [HACKERS] [PATCH] pgbench --throttle (submission 7 - with lag measurement)

Greg Smith Mon, 15 Jul 2013 10:20:00 -0700

To clarify what state this is all in: Fabien's latestpgbench-throttle-v15.patch is the ready for a committer version. Thelast two revisions are just tweaking the comments at this point, and hisversion is more correct than my last one.

My little pgbench-delay-finish-v1.patch is a brand new bug fix of sortsthat, while trivial, isn't ready for commit yet. I'll start a separatee-mail thread and CF entry for that later. Fabien has jumped intoinitial review comments of that already below, but the throttle featureisn't dependent on that. The finish delay just proves that the latencyspikes I was getting here aren't directly tied to the throttle feature.


On 7/14/13 5:42 PM, Fabien COELHO wrote:

* ISTM that the impact of the chosen 1000 should appear somewhere.

I don't have a problem with that, but I didn't see that the little tableyou included was enough to do that. I think if someone knows how thistype of random generation works, they don't need the comment to analyzethe impact. And if they don't know, that comment alone wasn't enough tohelp them figure it out. That's why I added some terms that might helppoint the right way for someone who wanted to search for moreinformation instead.

The text of pgbench is not really the right place to put a lecture abouthow to generate numbers with a target probability distribution function.Normally the code comments tries to recommend references for this sortof thing instead. I didn't find a really good one in a quick search though.

About 123456 12345 vs 123456.012345: My data parser is usually "gnuplot"
or "my eyes", and in both cases the later option is better:-)

pgbench-tools uses gnuplot too. If I were doing this again today fromscratch, I would recommend using the epoch time format compatible withit you suggested. I need to look into this whole topic a little morebefore we get into that though. This patch just wasn't the right placeto get into that change.

About the little patch: Parameter "ok" should be renamed to something
meaningful (say "do_finish"?).

It's saying if the connection finished "ok" or not. I think exactlywhat is done with that information is an implementation detail thatdoesn't need to be exposed to the function interface. We might changehow this is tied to PQfinish later.

Also, it seems that when timer is
exceeded in doCustom it is called with true, but maybe you intended that
it should be called with false??

The way timeouts are handled right now is a known messy thing. Exactlywhat you should do with a client that has hit one isn't obvious. Tryagain? Close the connection and abort? The code has a way it handlesthat now, and I didn't want to change it any.

it is important to remove the connection because it serves as a marker
to know whether a client must run or not.

This little hack moved around how clients finished enough to get rid ofthe weird issue with your patch I was bothered by. You may be rightthat the change isn't really correct because of how the connection iscompared to null as a way to see if it's active. I initially added amore complicated "finished" state to the whole mess that tracked thismore carefully. I may need to return to that idea now.


--
Greg Smith   2ndQuadrant US    [email protected]   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support www.2ndQuadrant.com


--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] [PATCH] pgbench --throttle (submission 7 - with lag measurement)

Reply via email to