Re: [PERFORM] limit clause breaks query planner?

Matthew Wakeling Thu, 04 Sep 2008 09:20:50 -0700

On Thu, 4 Sep 2008, Guillaume Cottenceau wrote:

It seems to me that if the correlation is 0.99, and you're
looking for less than 1% of rows, the expected rows may be at the
beginning or at the end of the heap?

Not necessarily. Imagine for example that you have a table with 1M rows,and one of the fields has unique values from 1 to 1M, and the rows areordered in the table by that field. So the correlation would be 1. If youwere to SELECT from the table WHERE the field = 500000 LIMIT 1, then thedatabase should be able to work out that the rows will be right in themiddle of the table, not at the beginning or end. It should set thestartup cost of a sequential scan to the amount of time required tosequential scan half of the table.

Of course, this does bring up a point - if the matching rows areconcentrated at the end of the table, the database could perform asequential scan backwards, or even a scan from the middle of the tableonwards.

This improvement of course only actually helps if the query has a LIMITclause, and presumably would muck up simultaneous sequential scans.


Matthew

--
Picard: I was just paid a visit from Q.
Riker:  Q! Any idea what he's up to?
Picard: No. He said he wanted to be "nice" to me.
Riker:  I'll alert the crew.

--
Sent via pgsql-performance mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance

Re: [PERFORM] limit clause breaks query planner?

Reply via email to