Re: [HACKERS] GSoC Query

Greg Smith Mon, 29 Mar 2010 00:01:09 -0700

gaurav gupta wrote:

My idea is to add a functionality of Auto tuning and Auto Indexing/Reindexing in DB languages.

Ah, the classic request to start with the tuning and index wizards.Really fun to work on, always lots of interest in it. Here's thething: projects in this area attract endless interest. If it werepossible to write something useful in a couple of months, we'd have ahundred such programs fighting for attention. So the fact that weactually have zero of them should tell you something about the actualdifficultly level of the work. You could spend the whole summer justreading research papers on this topic and maybe catch up to the late90's by the end.

Here's the usual advice I give to students looking to make a usefulcontribution to any mature development project: the more boring thework sounds, the more likely it is you'll actually do something peoplecan use. It's easy to find people who want to work on fun projects--soeasy that they've all been done already. What's left is either muchharder than it looks, or kind of dull to do. The idea behindintentionally picking a boring one is that you're more likely to get onethat's unfinished for that reason, rather than because it's actually ayear or two of work to complete. Or, in the case you're asking about, adecade or three if you were to start from scratch and were reallysmart. If you started working on this now rather than stopping tofollow the research already done you might catch up tohttp://portal.acm.org/citation.cfm?id=810505 in a couple of months.

Similarly using the no. of select hits on a table we can check that ifmaximum no. of times it is on a non-index field we can index on thatfield to make select faster.

It's impractical to figure out where indexes should go at withoutsimulating what the optimizer would then do with them against a sampleset of queries. You can't do anything useful just with basic statisticsabout the tables.

I would recommendhttp://msdn.microsoft.com/en-us/library/aa226167(SQL.70).aspx as a good,practical introduction to the topic of what it takes to figure out whereindexes go at, from someone who came up with a reasonable solution tothat problem. You can find a list of the underlying research they cite(and an idea what has been done since then) athttp://portal.acm.org/citation.cfm?id=673646


--
Greg Smith  2ndQuadrant US  Baltimore, MD
PostgreSQL Training, Services and Support
g...@2ndquadrant.com   www.2ndQuadrant.us


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] GSoC Query

Reply via email to