Re: [HACKERS] [PERFORM] A Better External Sort?

Pailloncy Jean-Gerard Thu, 29 Sep 2005 04:14:36 -0700

Your main example seems to focus on a large table where a keycolumn has
constrained values.  This case is interesting in proportion to the
number of possible values. If I have billions of rows, eachhaving one
of only two values, I can think of a trivial and very fast method of
returning the table "sorted" by that key: make two sequential passes,
returning the first value on the first pass and the second valueon the
second pass.  This will be faster than the method you propose.
1= No that was not my main example. It was the simplest exampleused toframe the later more complicated examples. Please don't get hungup on it.
2= You are incorrect. Since IO is the most expensive operation wecan do,any method that makes two passes through the data at top scanningspeedwill take at least 2x as long as any method that only takes onesuch pass.

You do not get the point.

As the time you get the sorted references to the tuples, you need tofetch the tuples themself, check their visbility, etc. and returnsthem to the client.

So,

if there is only 2 values in the column of big table that is largerthan available RAM,

two seq scans of the table without any sorting
is the fastest solution.

Cordialement,
Jean-Gérard Pailloncy


---------------------------(end of broadcast)---------------------------
TIP 9: In versions below 8.0, the planner will ignore your desire to
      choose an index scan if your joining column's datatypes do not
      match

Re: [HACKERS] [PERFORM] A Better External Sort?

Reply via email to