Re: [HACKERS] Declarative partitioning grammar

Jeff Cohen Mon, 14 Jan 2008 19:23:07 -0800


On Jan 14, 2008, at 1:49 AM, Markus Schiltknecht wrote:

I don't think the separation into list, hash and range partitioningis adequate. What is the system supposed to do, if you try toinsert a row which doesn't fit any of the values in your list ordoesn't fit any of the ranges you defined?


Hi Markus,

If you don't define a "default" partition to handle outliers, theinsert should fail with an error.

I prefer a partitioning grammar which doesn't interfere withconstraints. We all know how to define constraints. Please don'tintroduce a new, ambiguous way. A partitioning definition should beable to tell the target partition for *every* row which satisfiesthe constraints (the real ones, not ambiguous ones).
IMO, a single DDL command should only touch a single split point,i.e. split a table into two partitions, move the split point orremove the split point (joining the partitions again). Those arethe only basic commands you need to be able to handle partitioning.

I can certainly appreciate the simplicity of this approach. It letsus use a generic check constraint to perform partitioning, so it ismore general than partitioning using hash, list, and range. However,it achieves this generality at the expense of usability for typicalcustomer cases. For example, let's look at the case of a table of 1year of sales data, where we want to create 12 partitions -- one foreach month.

With the generic approach, you start with a single table, and startby splitting it into two six-month partitions:


ALTER TABLE sales
  SPLIT where sales_date > date '2007-06-01'
   INTO
    (
     PARTITION first_half
     PARTITION second_half
     );

We could implement this approach using check constraints and tableinheritance: the partition second_half is a child table wheresales_date > date '2007-06-01', and the partition first_half has thecomplementary constraint NOT(sales_date > date '2007-06-01').


Next, you split each partition:

ALTER TABLE sales
  SPLIT PARTITION first_half where sales_date > date '2007-03-01'
   INTO
    (
     PARTITION first_quarter
     PARTITION second_quarter
     );

So now the child table for first_half itself has two children. Asyou continue this process you construct a binary tree of tableinheritance using 12 ALTER statements.

In the "long" grammar you can create and partition the table in onestatement:


CREATE TABLE sales
...
PARTITION BY sales_date
(
start (date '2007-01-01') end (date '2008-01-01')
every (interval '1 month')
);

Sorry, but for my taste, the proposed grammar is too long percommand, not flexible enough and instead ambiguous for split pointsas well as for constraints. To me it looks like repeating themistakes of others.

Thanks for your feedback. Partitioning the table using series ofsplits is a clever solution for situations where the partitioningoperation cannot be described using simple equality (like list,hash)or ordered comparison (range). But for many common business cases,the "long" grammar is easier to specify.


kind regards,

Jeff


---------------------------(end of broadcast)---------------------------
TIP 1: if posting/reading through Usenet, please send an appropriate
      subscribe-nomail command to [EMAIL PROTECTED] so that your
      message can get through to the mailing list cleanly

Re: [HACKERS] Declarative partitioning grammar

Reply via email to