Re: [HACKERS] qsort again (was Re: [PERFORM] Strange Create

Ron Fri, 17 Feb 2006 05:02:14 -0800

At 04:24 AM 2/17/2006, Ragnar wrote:

On fös, 2006-02-17 at 01:20 -0500, Ron wrote:
>
> OK, so here's _a_ way (there are others) to obtain a mapping such that
>   if a < b then f(a) < f (b) and
>   if a == b then f(a) == f(b)


> By scanning the table once, we can map say 0000001h (Hex used to ease
> typing) to the row with the minimum value and 1111111h to the row
> with the maximum value as well as mapping everything in between to
> their appropriate keys.  That same scan can be used to assign a
> pointer to each record's location.

This step is just as expensive as the originalsort you want to replace/improve.

Why do you think that? External sorts involvethe equivalent of multiple scans of the table tobe sorted, sometimes more than lgN (where N isthe number of items in the table to besorted). Since this is physical IO we aretalking about, each scan is very expensive, andtherefore 1 scan is going to take considerablyless time than >= lgN scans will be.

If you want to keep this mapping saved as a sortof an index, or as part ot each row data, thiswill make the cost of inserts and updates enormous.

Not sure you've got this right either. Looks tome like we are adding a <= 32b quantity to eachrow. Once we know the mapping, incrementallyupdating it upon insert or update would seem tobe simple matter of a fast search for the correctranking [Interpolation search, which we have allthe needed data for, is O(lglgN). Hash basedsearch is O(1)]; plus an increment/decrement ofthe key values greater/less than the key value ofthe row being inserted / updated. Given than weare updating all the keys in a specific rangewithin a tree structure, that update can be donein O(lgm) (where m is the number of records affected).

>  We can now sort the key+pointer pairs instead of the actual data and
> use an optional final pass to rearrange the actual rows if we wish.
How are you suggesting this mapping be accessed?If the mapping is kept separate from the tupledata, as in an index, then how will you look up the key?

??? We've effectively created a data set whereeach record is a pointer to a DB row plus itskey. We can now sort the data set by key andthen do an optional final pass to rearrange theactual DB rows if we so wish. Since that finalpass is very expensive, it is good that not alluse scenarios will need that final pass.

The amount of storage required to sort thisrepresentation of the table rather than theactual table is so much less that it turns anexternal sorting problem into a internal sortingproblem with an optional final pass that is =1=scan (albeit one scan with a lot of seeks anddata movement). This is a big win. It is avariation of a well known technique. See Sedgewick, Knuth, etc.

> That initial scan to set up the keys is expensive, but if we wish
> that cost can be amortized over the life of the table so we don't
> have to pay it all at once.  In addition, once we have created those
> keys, then can be saved for later searches and sorts.

What is the use case where this would work better than a
regular btree index ?

Again, ??? btree indexes address differentissues. They do not in any way help create acompact data representation of the original datathat saves enough space so as to turn an externalranking or sorting problem into an internal one.

Ron



---------------------------(end of broadcast)---------------------------
TIP 3: Have you checked our extensive FAQ?

              http://www.postgresql.org/docs/faq

Re: [HACKERS] qsort again (was Re: [PERFORM] Strange Create

Reply via email to