Re: [HACKERS] Ordered Append Node

Markus Schiltknecht Fri, 23 Nov 2007 00:16:38 -0800

Hello Gregory,

Gregory Stark wrote:

It is kind of like a merge join but not quite. It's interleaving rows rather
than matching them up. It's more like the final merge of a sort which also
uses a heap to efficiently find the next value from the source tapes.

Well, maybe my point here is: why do you need the heap to sort? Theinputs you've got are already sorted, you only need to merge them. To methis sounds very much like the final stage of a merge sort, which wouldnot require much memory.

IMO, a merge sort could easier be implemented by a binary tree zippernode, as I had in mind. Leading to a plan like that (well, hey, this isall made up):


Zipper  (cost..., sort by public.t.i)
  ->  Zipper  (cost .., sort by public.t.i)
        -> Zipper (cost .., sort by public.t.i)
             -> Index Scan using ti1 on t1
             -> Index Scan using t12 on t2
        -> Index Scan using ti2 on t3
  ->  Zipper  (cost .., sort by public.t.i)
        -> Index Scan using ti4 on t4
        -> Index Scan using ti5 on t5

Every zipper node would simply have to keep the two top tuples from itsinputs in memory, compare them and return the best.

But maybe that's exactly how Knuth's polyphase merge sort internallyalso merge their inputs (or runs). And perhaps it makes sense to showthe user just one simple append node instead of throwing a tree ofZipper nodes at her. ;-)

Not necessarily but it is something Postgres supports and I don't think we
want to break it. Actually it's useful for partitioned tables if you build the
new partition in a separate table and then add it to the partitioned table. In
that case you may have gone through several steps of adding columns and
dropping them to get the structure to line up.


Agreed, especially because lining up the columns isn't that hard after all.

OTOH I think Postgres is way too flexible in how it allows partitioningto be done and thus it often can't optimize it properly. I'd very muchlike to teach it a stricter and simpler to use partitioning scheme thanwhat we have with constraint exclusion.

But that's pipe dreaming, and your improvement to the append node iscertainly a good step towards the right direction, keep up the good work!


Regards

Markus


---------------------------(end of broadcast)---------------------------
TIP 3: Have you checked our extensive FAQ?

              http://www.postgresql.org/docs/faq

Re: [HACKERS] Ordered Append Node

Reply via email to