Re: [GENERAL] Forcing the right queryplan

Yeb Havinga Fri, 03 Sep 2010 00:18:36 -0700

Henk van Lingen wrote:

Now there are two types of query plans:
syslog=# explain SELECT id, devicereportedtime, facility, priority, fromhost, syslogtag, infounitid, message FROM systemevents WHERE ( ( to_tsvector('english', message) @@ to_tsquery ( '131.211.112.9')) ) ORDER BY id DESC LIMIT 100; QUERY PLANLimit (cost=0.00..10177.22 rows=100 width=159)
   ->  Index Scan Backward using systemevents_pkey on systemevents  (cost=0.00..
1052934.86 rows=10346 width=159)
         Filter: (to_tsvector('english'::regconfig, message) @@ to_tsquery('131.
211.112.9'::text))
(3 rows)

This one is useless (takes very long). However this one:

Hello Henk,

I saw your other mail today, I'm replying on this one for better formatting.

With a limit of 100 the planner guesses it will find 100 matching rowswithin some cost. At 500 rows the cost is higher than that of the secondplan:

syslog=# explain SELECT id, devicereportedtime, facility, priority, fromhost, 
syslogtag, infounitid, message FROM systemevents WHERE (  ( 
to_tsvector('english', message) @@ to_tsquery ( '131.211.112.9')) )  ORDER BY 
id DESC LIMIT 500;

QUERY PLAN--------------------------------------------------------------------------------

-----------------------------------
 Limit  (cost=40928.89..40930.14 rows=500 width=159)
   ->  Sort  (cost=40928.89..40954.76 rows=10346 width=159)
         Sort Key: id
         ->  Bitmap Heap Scan on systemevents  (cost=2898.06..40413.36 rows=1034
6 width=159)
               Recheck Cond: (to_tsvector('english'::regconfig, message) @@ to_t
squery('131.211.112.9'::text))
               ->  Bitmap Index Scan on msgs_idx  (cost=0.00..2895.47 rows=10346
 width=0)

Index Cond: (to_tsvector('english'::regconfig, message) @@to_tsquery('131.211.112.9'::text))

(7 rows)

works acceptable.

How to use the right plan regardless of the 'LIMIT-size'?

The planner obviously thinks it will have read 100 rows fromsystemevents backwards earlier than it actually does, with the whereclause that contains the scanning for string 131.211.112.9. Increasingthe stats target in this case will probably not help, since thestatistics will not contain selectivity for all possible ts queries.

If the index is useless anyway, you might consider dropping it.Otherwise, increasing random_page_cost might help in choosing theotherplan, but on the other hand that plan has index scanning too, soI'm not to sure there.

If that doesn't help, it would be interesting to see some output ofvmstat 1 (or better: iostat -xk 1) to see what is the bottleneck duringexecution of the first plan. If it is IO bound, you might want toincrease RAM or add spindles for increased random io performance. If itis CPU bound, it is probably because of executing the to_tsvectorfunction. In that case it might be interesting to see if changingts_vectors cost (see ALTER FUNCTION ... COST .../http://developer.postgresql.org/pgdocs/postgres/sql-alterfunction.html)again helps the planner to favor the second plan over the first.


regards,
Yeb Havinga


--
Sent via pgsql-general mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

Re: [GENERAL] Forcing the right queryplan

Reply via email to