Re: [GENERAL] Non-unique index performance

Sezai YILMAZ Fri, 24 Jun 2005 03:26:04 -0700

Richard Huxton wrote:

Sezai YILMAZ wrote:
Hello!
I have a table eith name person as described below. It has an uniqueindex for id column (it is also primary key) and has an index forparent column.
If I run a query with where clause on id column it uses the index(look at the first explain analyze result; it says "Index Scanusing...") and the query for 582856 rows table results in 225,893 ms.
But, if I run another query with where clause on parent column itdoes not use the index (look at the second explain analyze result; itsays "Seq Scan using...") and the query for 582856 rows table resultsin 11192.913 ms.
Why the difference of both queries is so dramatical for unique andnon-unique indexed columns? Why PostgreSQL does not use thenon-unique indexes (it says that it does sequential scan)?
Because it thinks it will be faster/cheaper.
I have to use an index on non-unique column. What is the solution forthat? Is there a way to speed up non-unique indexes?
You don't want to force it to use an index, you want it to make betterestimates. Let's have a look...
*****************************************************************
test=> \d person
          Table "public.person"
Column  |         Type          | Modifiers
---------+-----------------------+-----------
name    | character varying(30) |
surname | character varying(30) |
id      | integer               | not null
parent  | integer               |
Indexes:
   "person_pkey" primary key, btree (id)
   "parent_ndx" btree (parent)
OK - all very simple. And you've said there are about 580,000 rows.
test=> explain analyze select id,name from person where id in('17201', '338191', '244319', '515209', '20415');
Why are you quoting integers?

I qouted them to use indexes. The other method is type casting thevalues to indexed column type. I prefer the quoting method.

test=> explain analyze select * from person where parent in ('17201','338191', '244319', '515209', '20415');
                                                  QUERY PLAN
-----------------------------------------------------------------------------------------------------------------Seq Scan on person (cost=0.00..35871.26 rows=14427 width=32) (actualtime=0.063..11192.809 rows=5 loops=1)Filter: ((parent = 17201) OR (parent = 338191) OR (parent = 244319)OR (parent = 515209) OR (parent = 20415))
Total runtime: 11192.913 ms
Hmm - for some reason it's expecting 14427 rows to be returned. Ifthere were that many matches, then it might well be a better choicethan going back and fore between the index and the table all the time.
So - we need to find out why it thinks there will be so many rowsreturned.
1. VACUUM FULL ANALYSE person;
2. re-run the explain


This solved the problem. Now it takes about 213 ms.

Thanks in advance.

--
sy


---------------------------(end of broadcast)---------------------------
TIP 2: you can get off all lists at once with the unregister command
   (send "unregister YourEmailAddressHere" to [EMAIL PROTECTED])

Re: [GENERAL] Non-unique index performance

Reply via email to