Re: [HACKERS] 8.2 features status

Rick Gigger Sat, 05 Aug 2006 00:18:43 -0700

If people are going to start listing features they want here's somethings I think would be nice. I have no idea though if they would beuseful to anyone else:

1) hierarchical / recursive queries. I realize it's just beendiscussed at length but since there was some question as to whetheror not there's demand for it so I am just weighing in that I thinkthere is. I have to deal with hierarchy tables all the time and Isimply have several standard methods of dealing with them dependingon the data set / format. But they all suck. I've just gotten useto using the workarounds since there is nothing else. If you are nothearing the screams it's just because I think it's just become a factof life for most people (unless you're using oracle) that you've justgot to work around it. And everyone already has some code to do thisand they've already done it everywhere it needs to be done. And aslong as you're a little bit clever you can always work around itwithout taking a big performance hit. But it would sure be nice tohave next time I have to deal with a tree table.

2) PITR on a per database basis. I think this would be nice but I'mguessing that the work involved is big and that few people reallycare or need it, so it will probably never happen.

3) A further refinement of PITR where some sort of deamon ships smalllog segments as they are created so that the hot standby doesn't haveto be updated in 16MB increments or have to wait for some timeout tooccur. It could always be up to the minute data.

4) All the Greenplum Bizgress MPP goodness. In reality (and I don'tknow if bizgress mpp can actually do this) I'd like to have a clusterof cheap boxes. I'd like to install postgres on all of them andconfigure them in such a way that it automatically partitions andmirrors each table so that each piece of data is always on two boxesand large tables and indexes get divided up intelligently. Sort oflike a raid10 on the database level. This way any one box could dieand I would be fine. Enormous queries could be handled efficientlyand I could scale up by just dropping in new hardware.

Maybe greeenplum has done this. Maybe we will get their changes soonenough, maybe not. Maybe this sort of functionality will neverhappen. My guess is that all the little bit's a pieces of this willtrickle in over the next several years and this sort of setup will beslowly converged on over time as lot's of little things cometogether. Table spaces and constraint exclusion come to mind here asthings that could eventually evolve to contribute to a larger solution.

5) Somehow make it so I NEVER HAVE TO THINK ABOUT OR DEAL WITH VACUUMAGAIN. Once I get everything set up right everything works great butI'm sure if there's one thing I think everyone would love it would begetting postgres to the point where you don't even need to shipvacuumdb because there's no way the user could outsmart postgres'sattempts to do garbage collection on it's own.

6) genuine updatable views. such that you just add an updatablekeyword when you create the view and it's automagically updatable.I'm guessing that we'll get something like that, but its real magicwill be throwing an error to tell you when you try to make a viewupdatable and it can't figure out how to make the rules properly.

7) allow some way to extract the data files from a single databaseand insert them into another database cluster. In many cases itwould be a lot faster to copy the datafiles across the network thanit is to dump, copy dump file, reload.

8) some sort of standard "hooks" to be used for replication. I guesswhen the replication people all get their heads together and tell thecore developers what they all need something like this could evolve.

Like I said, postgres more than satisfies my "needs". I amespecially happy when you factor in the cost of the software (free),and the quality of the community support (excellent).

And you can definitely say that the "missing" list is shrinking. ButI think of it like this. There are tiers of database functionalitythat different people need:A) Correct me if I'm wrong but as great as postgres is there arestill people out there that MUST HAVE Oracle or DB2 to get done whatthey need to get done. They just do things that the others can't.They may be expensive. They may suck to use and administer but thesimple fact is that they have features that people need that are notoffered in less expensive databases.B) Very, very powerful databases but lack the biggest, mostcomplicated "enterprise" features.C) Light weight db for taking care of the basic need to store dataand query it with sql. (some would call these "toy" databases)D) databases which are experimental, unreliable or have other limitsthat make them not practical compared with the other options

I would say that with version 7.0 postgres moved from D to C (pleasedon't get offended if this is way off base, I never used 6.x but Iheard it was prone to crashes, data corruption and of course therewas that pesky row size limit). It then proceeded to move up withintier C to become the best of it's class and pushing up into level B.With 8.0 it was firmly in level B. It was fast, efficient, powerfuland began adding lots of really, really big features like PITR,savepoints, tablespaces, etc. Add ons like slony also allowed it tobe used in places where it otherwise wouldn't have measured up.

Now there are only a few features left in the B range and so thereare tons of situations that can be taken care of by postgres now thatwere out of it's reach just a few years ago. Once those features areall gone there will still be some very big, very difficult featureson the table that once completed will begin to remove any advantagethat the really big guys have. I'm thinking especially of #4 abovehere. But they will definitely take a while.

I may have tons of details wrong here but my point is that I thinkthat postgres isn't just taking stuff off a big to do list, butrather is pushing itself upwards and is now in a position to startworking on some very hard problems that once completed will put itinto a very elite class of database systems. The "missing" list fortier B type problems is shrinking down to almost nothing and itemsfrom the tier A missing list are starting to come into view.

Maybe I'm way off base here but that's how I see it. Postgres hascome a long, long way, but the problems ahead are bigger and meanerthan the ones behind.



On Aug 4, 2006, at 12:02 AM, David Fetter wrote:

On Fri, Aug 04, 2006 at 12:37:10AM -0400, Tom Lane wrote:

Bruce Momjian <[EMAIL PROTECTED]> writes:

To me new things are like PITR, Win32, savepoints, two-phase
commit, partitioned tables, tablespaces.  These are from 8.0 and
8.1.  What is there in 8.2 like that?


[ shrug... ]  Five out of your six items have no basis in the SQL
spec.  So it's not clear to me what your definition of "major
feature" is, unless maybe it's "anything except what we did for
8.2".  Can you enumerate ten things you would consider comparable to
the above features that aren't done yet?


First, I'd like to say people are doing a fantastic job here.  Kudos!

One huge thing missing from the "done" list is that crucial bit of
infrastructure and process that has shortened feedback loops--hence
the beta period--by weeks if not months: the build farm.  It's now
smoothly integrated into the development process, and as a
consequence, we can realistically have a release each year. :)

As far as big missing features go, here's a short list:

* Splitting queries among CPUs--possibly even among machines--for OLAP
  loads

* In-place upgrades (pg_upgrade)

* Several varieties of replication, which I believe we as a project
  will eventually endorse and ship

* CALL

* WITH RECURSIVE

* MERGE

* Windowing functions

* On-the-fly in-line calls out to PL/your_choice without needing to
  issue DDL

* Wild-eyed feral bits of the SQL standard like SQL/MED and SQL/XML

But all that leaves out the oldest, most honored Postgres tradition:

    Breaking New Ground.

We're definitely not done yet. :)

Cheers,
D
--
David Fetter <[EMAIL PROTECTED]> http://fetter.org/
phone: +1 415 235 3778        AIM: dfetter666
                              Skype: davidfetter

Remember to vote!

---------------------------(end ofbroadcast)---------------------------

TIP 9: In versions below 8.0, the planner will ignore your desire to
       choose an index scan if your joining column's datatypes do not
       match



---------------------------(end of broadcast)---------------------------
TIP 6: explain analyze is your friend

Re: [HACKERS] 8.2 features status

Reply via email to