On Sat, 6 Apr 2024 at 22:21, Andres Freund <and...@anarazel.de> wrote: > The small regression for small results is still kinda visible, I haven't yet > tested the patch downthread.
Thanks a lot for the faster test script, I'm also impatient. I still saw the small regression with David his patch. Here's a v6 where I think it is now gone. I added inline to internal_put_bytes too. I think that helped especially because for two calls to internal_put_bytes len is a constant (1 and 4) that is smaller than PqSendBufferSize. So for those calls the compiler can now statically eliminate the new codepath because "len >= PqSendBufferSize" is known to be false at compile time. Also I incorporated all of Ranier his comments.
v6-0001-Faster-internal_putbytes.patch
Description: Binary data