Re: [HACKERS] Dynamic Partitioning using Segment Visibility Maps

Markus Schiltknecht Sun, 06 Jan 2008 02:51:36 -0800

Hi,

Robert Treat wrote:

On Saturday 05 January 2008 14:02, Markus Schiltknecht wrote:

To satisfy all the different requirements of partitioning with segments
based partitioning, we'd have to allow a table to span multiple table
spaces. I'm not very keen on going that way.

Why?

Uh.. if a table (RELKIND_RELATION) can only span one table space, as it
is now, all of its segments are in the same table space. I don't quite
call that partitioning. Well, sure, you could call it so, but then, each
and every Postgres table is already partitioned in 1G segments.


It all depends on the definitions, but in my world, horizontal
partitioning for databases involves multiple table spaces (and is quite
useless without that). Calling anything else partitioning is confusing,
IMO.

I'm not following this. If we can work out a scheme, I see no reason not toallow a single table to span multiple tablespaces. Do you see a problem withthat?

Uhm... well, no. I was just pointing out that it's a requirement. Itdepends on how you define things, but I'm seeing it that way:


table -- 1:n -- partition -- 1:1 -- table space -- 1:n -- segments

What I'm advocating is making partitions available to the DBA as somekind of a relation, she can query separately and move around betweentable spaces.

Why should that not be possible with other schemes? Moving the split
point between two partitions involves moving tuples around, no matter if
you are going to move them between segments or between relations
building the partitions.
The difference is that, if I currently have a table split by month, Ican "re-partition" it into weekly segments, and only shuffle one months dataat a time minimize impact on the system while I shuffle it. This can even beused to do dynamic management, where data from the current month is archivedby day, data from the past year by week, and data beyond that done monthly.

This should be possible for both schemes, I see no connection to whatwe've discussed. SE doesn't magically give you this level of control youare requesting here. Quite the opposite: referring to CLUSTERing tomakes me wonder, if that's not going to shuffle way too many tuples around.

What I'm saying is, that SE doesn't partition the segments intodifferent table spaces. Thus I don't consider it "database partitioning"in the first place. As I currently understand it, it's:


table -- 1:1 -- table space -- 1:n -- partitions -- 1:n -- segments

On many other databases, if you change the partition scheme, it requiresexclusive locks and a shuffleing of all of the data, even data whosepartitions arent being redefined. Even worse are systems like mysql, whereyou need to rewrite the indexes as well. To me, these requirements alwaysseem like show stoppers; I generally can't afford to lock a table while thedatabase rewrites a billion rows of data.


I fully agree here. How do you plan to solve that problem on top of SE?

In a more general sense, a global index is a an index that spans multiplepartitions, as opposed to a local index, which is an index on specificpartitions; postgresql current supports the latter, not the former.
In any case, my thinking is if we had the segment exclusion technique, I couldconvert that partitioned table into a regular table again,


... on a single table space ...

use segmentexclusion to handle what is currently handled by partitions,


... except, that there is no partitioning (!?!) (between table spaces)

and createa "global index" across all the other data for that other, currently killer,query.

I thought the table you are referring to is bigger than your fastesttable space? That would even make it impossible.

See where I'm coming from? And why I'm stating that SE is anoptimization (for seq scans), but not partitioning?


Regards

Markus


---------------------------(end of broadcast)---------------------------
TIP 4: Have you searched our list archives?

              http://archives.postgresql.org

Re: [HACKERS] Dynamic Partitioning using Segment Visibility Maps

Reply via email to