Re: [HACKERS] random_page_cost vs seq_page_cost

Greg Smith Tue, 07 Feb 2012 14:06:58 -0800

On 02/07/2012 03:23 PM, Bruce Momjian wrote:

Where did you see that there will be an improvement in the 9.2
documentation?  I don't see an improvement.

I commented that I'm hoping for an improvement in the documentation ofhow much timing overhead impacts attempts to measure this area better.That's from the "add timing of buffer I/O requests" feature submission.I'm not sure if Bene read too much into that or not; I didn't mean toimply that the docs around random_page_cost have gotten better.

This particular complaint is extremely common though, seems to pop up onone of the lists a few times each year. Your suggested doc fix is fineas a quick one, but I think it might be worth expanding further on thistopic. Something discussing SSDs seems due here too. Here's a firstdraft of a longer discussion, to be inserted just after where it statesthe default value is 4.0:

True random access to mechanical disk storage will normally be moreexpensive than this default suggests. The value used is lower toreflect caching effects. Some common random accesses to disk, such asindexed reads, are considered likely to be in cache. The default valuecan be thought of as modeling random access as 40 times as expensive assequential, while expecting that 90% of random reads will actually becached.

If you believe a high cache rate is an incorrect assumption for yourworkload, you might increase random_page_cost to closer reflect the truecost of random reads against your storage. Correspondingly, if yourdata is likely to be completely cached, such as when the database issmaller than the total memory in the server, decreasing random_page_costcan be appropriate. Storage where the true cost of random reads is low,such as solid-state drives and similar memory-based devices, might alsofind lower values of random_page_cost better reflect the real-world costof that operation.

===

I think of the value as being more like 80 times as expensive and a 95%hit rate, but the above seems more likely to turn into understandablemath to a first-time reader of this section. I stopped just short ofrecommending a value for the completely cached case. I normally use1.01 there; I know others prefer going fully to 1.0 instead. Thatargument seems like it could rage on for some time.


--
Greg Smith   2ndQuadrant US    g...@2ndquadrant.com   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support www.2ndQuadrant.com


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] random_page_cost vs seq_page_cost

Reply via email to