Re: [HACKERS] Dynamic Partitioning using Segment Visibility Maps

Markus Schiltknecht Sat, 05 Jan 2008 11:05:48 -0800

Hi,

Robert Treat wrote:

Personally I cant say it complicates things, because it isn't clear how itwill be managed. :-)

Well, management of relations is easy enough, known to the DBA and mostimportantly: it already exists. Having to set up something which is*not* tied to a relation complicates things just because it's anadditional concept.

But as I've pointed out, maybe what we have in mind isn't that differentat all. Just have a sentinel relation mean a set of segments, i.e. allread-only segments of a table. Then again, a table - in a way - is notmuch else than a set of segments. So where's the real difference?

To satisfy all the different requirements of partitioning with segments
based partitioning, we'd have to allow a table to span multiple table
spaces. I'm not very keen on going that way.


Why?

Uh.. if a table (RELKIND_RELATION) can only span one table space, as itis now, all of its segments are in the same table space. I don't quitecall that partitioning. Well, sure, you could call it so, but then, eachand every Postgres table is already partitioned in 1G segments.

It all depends on the definitions, but in my world, horizontalpartitioning for databases involves multiple table spaces (and is quiteuseless without that). Calling anything else partitioning is confusing, IMO.

So the one thing that always scares me about these "define it all and let thedatabase sort it out" methods is they seem to lead to cases where the systemends up rewriting the data to fit into some new partition layout.

That holds true no matter if you shuffle between segments or relations.To be able to let the DBA define an exact split point, the database*will* have to shuffle tuples around. Why does that scare you? It's aregular database system's maintenance procedure.

One thingthat is nice about the current partitioning scheme is you can control theimpact of this behavior in these scenarios, but moving around small portionsof the table at a time.

Uh.. I'm not quite following. What "current partitioning scheme" are youreferring to?

Why should that not be possible with other schemes? Moving the splitpoint between two partitions involves moving tuples around, no matter ifyou are going to move them between segments or between relationsbuilding the partitions.

More to the point (I think) is that people define access to the data based onthe meaning of the data, not how it is stored on disk. For example, in sometables we only need to be active on 1 months worth of data... how that islaid out on disk (# partitions, which tablespaces) is a means to the end ofworking actively on 1 months worth of data. I can't think of many cases wherepeople would actually say the want to work actively on the most recent GB ofdata.

Agreed. I'd say that's why the DBA needs to be able to define the splitpoint between partitions: only he knows the meaning of the data.

To me, both of SVM and
SE look much more like an optimization for certain special cases and
don't have much to do with partitioning.


Even if this were true, it might still be a useful optimization.

Possibly, yes. To me, the use case seems pretty narrow, though. Forexample it doesn't affect index scans much.

One table Iam thinking of in particular in my system has one query we need to run acrosspartitions, which ends up doing a slew of bitmap index scans for all thepartitions. If using segment exclusion on it meant that I could get a globalindex to help that query, I'd be happy.

As proposed, Segment Exclusion works only on exactly one table. Thus, ifyou already have your data partitioned into multiple relations, it mostprobably won't affect your setup much. It certainly has nothing to dowith what I understand by 'global index' (that's an index spanningmultiple tables, right?).


Regards

Markus


---------------------------(end of broadcast)---------------------------
TIP 6: explain analyze is your friend

Re: [HACKERS] Dynamic Partitioning using Segment Visibility Maps

Reply via email to