Marti Raudsepp <[email protected]> writes: > On Mon, May 28, 2012 at 10:32 PM, Marti Raudsepp <[email protected]> wrote: >> There was a similar case in 9.0.4 with WHERE i=1, but that has been >> fixed in 9.0.7
> Oh, it's been fixed in 9.0.7, but apparently not in 8.4.11; the empty > parent tables are confusing the estimate: Hmm ... what your test case seems to be exhibiting is related to this: Author: Tom Lane <[email protected]> Branch: master [f3ff0433a] 2011-07-14 17:30:57 -0400 Branch: REL9_1_STABLE Release: REL9_1_0 [cf8245285] 2011-07-14 17:31:12 -0400 Branch: REL9_0_STABLE Release: REL9_0_5 [0dd46a776] 2011-07-14 17:31:25 -0400 In planner, don't assume that empty parent tables aren't really empty. There's a heuristic in estimate_rel_size() to clamp the minimum size estimate for a table to 10 pages, unless we can see that vacuum or analyze has been run (and set relpages to something nonzero, so this will always happen for a table that's actually empty). However, it would be better not to do this for inheritance parent tables, which very commonly are really empty and can be expected to stay that way. Per discussion of a recent pgsql-performance report from Anish Kejariwal. Also prevent it from happening for indexes (although this is more in the nature of documentation, since CREATE INDEX normally initializes relpages to something nonzero anyway). Back-patch to 9.0, because the ability to collect statistics across a whole inheritance tree has improved the planner's estimates to the point where this relatively small error makes a significant difference. In the referenced report, merge or hash joins were incorrectly estimated as cheaper than a nestloop with inner indexscan on the inherited table. That was less likely before 9.0 because the lack of inherited stats would have resulted in a default (and rather pessimistic) estimate of the cost of a merge or hash join. However, the error in your original example is far too large to be explained by that, so I think it was tripping over something different. When I run your test case in 8.4, I get Nested Loop (cost=0.00..1276.12 rows=28168 width=4) Join Filter: (public.a_parent.i = public.b_parent.i) -> Append (cost=0.00..62.57 rows=14 width=4) -> Seq Scan on a_parent (cost=0.00..46.00 rows=12 width=4) Filter: ((i >= 1) AND (i <= 2)) -> Index Scan using a1_i_idx on a_child1 a_parent (cost=0.00..8.29 rows=1 width=4) ... -> Append (cost=0.00..56.63 rows=2404 width=4) -> Seq Scan on b_parent (cost=0.00..34.00 rows=2400 width=4) -> Index Scan using b1_i_idx on b_child1 b_parent (cost=0.00..11.34 rows=2 width=4) ... and that join size estimate is not too out of line if you accept the admittedly-bogus numbers for the appendrel sizes. There seems to be something else going on in your original example. regards, tom lane -- Sent via pgsql-hackers mailing list ([email protected]) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers
