Re: [PERFORM] Severe performance problems for simple query

Matthew Mon, 07 Apr 2008 10:03:43 -0700

On Mon, 7 Apr 2008, Heikki Linnakangas wrote:

In that case, a regular index on (ipFrom, ipTo) should work just fine, andthat's what he's got. Actually, an index on just ipFrom would probably workjust as well. The problem is that the planner doesn't know about that specialrelationship between ipFrom and ipTo. Perhaps it could be hinted byexplicitly specifying "AND ipTo > ipFrom" in the query?

Actually, the problem is that the database doesn't know that the entriesdon't overlap. For all it knows, you could have data like this:


0               10
10              20
20              30
... ten million rows later
100000030       100000040
100000040       100000050
0               100000050

So say you wanted to search for the value of 50,000,000. The index onipFrom would select five million rows, all of which then have to befiltered by the constraint on ipTo. Likewise, an index on ipTo wouldreturn five million rows, all of which then have to be filtered by theconstraint on ipFrom. If you just read the index and took the closestentry to the value, then you would miss out on the last entry whichoverlaps with the whole range. An R-tree on both fields will correctlyfind the small set of entries that are relevant.

It would be very cool to be able to create an R-tree index that would justmake the original query run fast without needing alteration. I had a lookat this a while back, but it is not currently possible in GiST, becauseonly one field is handed to the index at a time. So all the current R-treeimplementations require that you generate an object containing the twovalues, like the box, and then index that.


Something for 8.4?

Matthew

--
$ rm core
Segmentation Fault (core dumped)

--
Sent via pgsql-performance mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance

Re: [PERFORM] Severe performance problems for simple query

Reply via email to