Re: [HACKERS] Aggregates push-down to partitions

Maksim Milyutin Thu, 09 Nov 2017 10:51:12 -0800

Hi Konstantin!


09.11.17 20:14, Konstantin Knizhnik wrote:

It is still far from ideal plan because each worker is working withall partitions, instead of spitting partitions between workers andcalculate partial aggregates for each partition.
But if we add FDW as a child of parent table, then parallel scan cannot be used and we get the worst possible plan:
postgres=# create foreign table derived_fdw() inherits(base) serverpg_fdw options (table_name 'derived1');CREATE FOREIGN TABLE
postgres=# explain select sum(x) from base;
                                    QUERY PLAN
----------------------------------------------------------------------------------
 Aggregate  (cost=34055.07..34055.08 rows=1 width=8)
   ->  Append  (cost=0.00..29047.75 rows=2002926 width=4)
         ->  Seq Scan on base  (cost=0.00..0.00 rows=1 width=4)
-> Seq Scan on derived1 (cost=0.00..14425.00 rows=1000000width=4) -> Seq Scan on derived2 (cost=0.00..14425.00 rows=1000000width=4) -> Foreign Scan on derived_fdw (cost=100.00..197.75rows=2925 width=4)
(6 rows)
So we sequentially pull all data to this node and compute aggregateslocally.Ideal plan will calculate in parallel partial aggregates at all nodesand then combine partial results.
It requires two changes:
1. Replace Aggregate->Append withFinalize_Aggregate->Append->Partial_Aggregate2. Concurrent execution of Append. It also can be done in twodifferent ways: we can try to use existed parallel workersinfrastructure andreplace Append with Gather. It seems to be the best approach for localpartitioning. In case of remote (FDW) partitions, it is enoughto split starting of execution (PQsendQuery in postgres_fdw) andgetting results. So it requires some changes in FDW protocol.
I wonder if somebody already investigate this problem or working inthis direction.
May be there are already some patches proposed?
I have searched hackers archive, but didn't find something relevant...
Are there any suggestions about the best approach to implement thisfeature?

Maybe in this thread[1] your described problem are solved throughintroducing Parallel Append node?

1.https://www.postgresql.org/message-id/CAJ3gD9dy0K_E8r727heqXoBmWZ83HwLFwdcaSSmBQ1%2BS%2BvRuUQ%40mail.gmail.com


--
Regards,
Maksim Milyutin



--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Aggregates push-down to partitions

Reply via email to