Re: [PERFORM] how to change the provoke table in hash join

Huang, Suya Thu, 11 Sep 2014 20:35:36 -0700

From: Jeff Janes [mailto:[email protected]]
Sent: Friday, September 12, 2014 4:09 AM
To: Matheus de Oliveira
Cc: Huang, Suya; [email protected]
Subject: Re: [PERFORM] how to change the provoke table in hash join



On Thu, Sep 11, 2014 at 7:09 AM, Matheus de Oliveira 
<[email protected]<mailto:[email protected]>> wrote:

On Wed, Sep 10, 2014 at 10:05 PM, Huang, Suya 
<[email protected]<mailto:[email protected]>> wrote:
--plan 1, 10 seconds were spent on sequential scan on term_weekly table.

dev=# explain analyze select distinct  cs_id from lookup_weekly  n inner join 
term_weekly s on s.b_id=n.b_id and s.date=n.date where term in ('cat'::text);


                                                                           
QUERY PLAN
-----------------------------------------------------------------------------------------------------------------------------------------------------------------
HashAggregate  (cost=2100211.06..2100211.11 rows=5 width=4) (actual 
time=27095.470..27095.487 rows=138 loops=1)
...

--plan 2, only 1 second spent on index scan of term_weekly table, however, as 
it selects the big table to do the hashing, it takes 22 seconds for the hash to 
complete. The advantage get from index has been totally lost because of this 
join order.

                                                                                
        QUERY PLAN
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
HashAggregate  (cost=1429795.17..1429795.22 rows=5 width=4) (actual 
time=22991.289..22991.307 rows=138 loops=1)
...

Am I reading something wrong here? I haven't looked all the plan, but the 
second is faster (overall), so why do you think you need a hint or change what 
the planner choose? For me looks like using the index is the best for this 
situation. Could you try running this multiple times and taking the min/max/avg 
time of both?

The difference in time could be a caching effect, not a reproducible difference.

The 2nd plan uses 3GB of memory, and there might be better uses for that memory.

Currently memory is un-costed, other than "cliff costing" once you thinks it 
will exceed work_mem, which I think is a problem.  Just because I will let you 
use 4GB of memory if you will really benefit from it, doesn't mean you should 
use 4GB gratuitously.


Suya, what happens if you lower work_mem setting?  Does it revert to the plan 
you want?

Cheers,

Jeff

________________________________

Hey Jeff,

It’s quite interesting, after I reduced the work_mem to 1GB, it chose the right 
plan. Also, if I create a temporary table and then join it with the temporary 
table, it also chose the right plan. Is this a defect of PG optimizer? While 
doing hash join, it’s unable to pick the small table to be the hash probe table 
while the query is complicated (not really that complicated in this case)

                                                                                
           QUERY PLAN
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
HashAggregate  (cost=1524294.96..1524295.01 rows=5 width=4) (actual 
time=13409.960..13409.979 rows=138 loops=1)
   ->  Hash Join  (cost=143.25..1524294.94 rows=5 width=4) (actual 
time=10648.440..13409.718 rows=160 loops=1)
         Hash Cond: (((n.b_id)::text = (s.b_id)::text) AND (n.date = s.date))
         ->  Append  (cost=0.00..862153.59 rows=37828460 width=52) (actual 
time=0.006..8152.938 rows=37828459 loops=1)
               ->  Seq Scan on lookup_weekly n  (cost=0.00..0.00 rows=1 
width=524) (actual time=0.000..0.000 rows=0 loops=1)
               ->  Seq Scan on lookup_weekly_20131130 n_1  
(cost=0.00..117764.18 rows=5158718 width=52) (actual time=0.006..743.985 
rows=5158718 loops=1)
               ->  Seq Scan on lookup_weekly_20131207 n_2  
(cost=0.00..117764.18 rows=5158718 width=52) (actual time=0.003..894.061 
rows=5158718 loops=1)
               ->  Seq Scan on lookup_weekly_20131214 n_3  
(cost=0.00..117764.18 rows=5158718 width=52) (actual time=0.008..746.660 
rows=5158718 loops=1)
               ->  Seq Scan on lookup_weekly_20131221 n_4  
(cost=0.00..117764.18 rows=5158718 width=52) (actual time=0.004..750.305 
rows=5158718 loops=1)
               ->  Seq Scan on lookup_weekly_20131228 n_5  
(cost=0.00..117764.18 rows=5158718 width=52) (actual time=0.004..741.233 
rows=5158718 loops=1)
               ->  Seq Scan on lookup_weekly_20140426 n_6  (cost=0.00..91715.42 
rows=4042442 width=52) (actual time=0.010..595.792 rows=4042442 loops=1)
               ->  Seq Scan on lookup_weekly_20140503 n_7  (cost=0.00..93516.49 
rows=4118149 width=52) (actual time=0.009..598.208 rows=4118149 loops=1)
               ->  Seq Scan on lookup_weekly_20140329 n_8  (cost=0.00..88100.78 
rows=3874278 width=52) (actual time=0.004..574.846 rows=3874278 loops=1)
         ->  Hash  (cost=142.77..142.77 rows=32 width=61) (actual 
time=0.924..0.924 rows=553 loops=1)
               Buckets: 1024  Batches: 1  Memory Usage: 43kB
               ->  Append  (cost=0.00..142.77 rows=32 width=61) (actual 
time=0.031..0.752 rows=553 loops=1)
                     ->  Seq Scan on term_weekly s  (cost=0.00..0.00 rows=1 
width=520) (actual time=0.000..0.000 rows=0 loops=1)
                           Filter: (term = 'cat'::text)
                     ->  Index Scan using idx_term_weekly_20140503_3 on 
term_weekly_20140503 s_1  (cost=0.56..36.70 rows=8 width=46) (actual 
time=0.031..0.225 rows=166 loops=1)
                           Index Cond: (term = 'cat'::text)
                     ->  Index Scan using idx_term_weekly_20140510_3 on 
term_weekly_20140510 s_2  (cost=0.56..36.70 rows=8 width=46) (actual 
time=0.023..0.192 rows=152 loops=1)
                           Index Cond: (term = 'cat'::text)
                     ->  Index Scan using idx_term_weekly_20140517_3 on 
term_weekly_20140517 s_3  (cost=0.56..36.70 rows=8 width=46) (actual 
time=0.022..0.176 rows=135 loops=1)
                           Index Cond: (term = 'cat'::text)
                     ->  Index Scan using idx_term_weekly_20140524_3 on 
term_weekly_20140524 s_4  (cost=0.56..32.68 rows=7 width=46) (actual 
time=0.016..0.126 rows=100 loops=1)
                           Index Cond: (term = 'cat'::text)
Total runtime: 13410.097 ms

Thanks,
Suya

Re: [PERFORM] how to change the provoke table in hash join

Reply via email to