[HACKERS] prefetching and asynchronous io

johnlumby Mon, 10 Sep 2012 22:52:00 -0700

On 08/18/2012 10:11 AM, John Lumby wrote:
>

I've recently tried extending the postgresql prefetch mechanism on linux
to use the posix (i.e. librt)
aio_read and friends where possible.     In other words,   in
PrefetchBuffer(),   try getting a buffer
and issuing aio_read before falling back to fposix_advise().     It
gives me about 8% improvement
in throughput relative to the fposix-advise variety, for a workload of
16 highly-disk-read-intensive applications running to 16 backends.
For my test each application runs a query chosen to have plenty of
bitmap heap scans.


I can provide more details on my changes if interested.

On whether this technique might improve sort performance  :

First,  the disk access pattern for sorting is mostly sequential
(although I think
the sort module does some tricky work with reuse of pages in its
"logtape" files
which maybe is random-like),  and there are several claims on the net
that linux buffered file  handling
already does a pretty good job of read-ahead for a sequential access
pattern
without any need for the application to help it.
I can half-confirm that in that I tried adding calls to PrefetchBuffer
in regular heap scan
and did not see much improvement.    But I am still pursuing that area.

But second,  it would be easy enough to add some fposix_advise calls to
sort and see whether
that helps.    (Can't make use of PrefetchBuffer since sort does not use
the regular relation buffer pool)


I have also added prefetching calls to regular index scans
(non-bitmap,  non-index-only,  for btree only)
and see a 25% reduction in total elapsed time for a heavy
index-scan workload.     That 25% is with just the
basic posix_fadvise,  and then extending that with asynch io
gives a small extra improvement again.

John


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

[HACKERS] prefetching and asynchronous io

Reply via email to