Re: [HACKERS] PATCH: pgbench - merging transaction logs

Fabien COELHO Mon, 23 Mar 2015 09:25:57 -0700


Hello,

Yes but for a third thread (each on a physical core) it will be 1/40 +
1/40 and so on up to roughly 40/40 for 40 cores.


That is why I proposed a formula which depends on the number of threads.

[...] But they aren't constant only close. It may or not show up in thiscase but I've noticed that often the collision rate is a lot higher thanthe probability would suggest, I'm not sure why,

If so, I would suggested that the probability is wrong and try tounderstand why:-)

Moreover  they will write to the same cache lines for every fprintf
and this is very very bad even without atomic operations.


We're talking of transactions that involve network messages and possibly
disk IOs on the server, so some cache issues issues within pgbench would not
be a priori the main performance driver.

Sure but :
- good measurement is hard and by adding locking in fprintf it make
its timing more noisy.

This really depends on the probability of the lock collisions. If it issmall enough, the impact would be negligeable.

- it's against 'good practices' for scalable code.
Trivial code can show that elapsed time for as low as four cores writingto same cache line in a loop, without locking or synchronization, isgreater than the elapsed time for running these four loops sequentiallyon one core. If they write to different cache lines it scales linearly.

I'm not argumenting about general scalability principles, which may or maynot be relevant to the case at hand.

I'm discussing whether the proposed feature can be implemented much simplywith mutex instead of the current proposal which is on the heavy side,thus induces more maintenance effort latter.

Now I agree that if there is a mutex it must be a short as possible andnot hinder performance significantly for pertinent use case. Note thatoverhead evaluation by Tomas is pessimistic as it only involves read-onlytransactions for which all transaction details are logged. Note also thatif you have 1000 cores to run pgbench and that locking may be an issue,you could still use the per-thread logs.

The current discussion suggests that each thread should prepare the stringoff-lock (say with some sprintf) and then only lock when sending thestring. This looks reasonable, but still need to be validated (i.e. thelock time would indeed be very small wrt the transaction time).


--
Fabien.


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] PATCH: pgbench - merging transaction logs

Reply via email to