Re: [HACKERS] Modifying TOAST thresholds

2007-04-27 Thread Chris Browne
[EMAIL PROTECTED] (Tom Lane) writes:
> Chris Browne <[EMAIL PROTECTED]> writes:
>> [EMAIL PROTECTED] (Bruce Momjian) writes:
>>> I have seen no one do peroformance testing of this, so it seems it
>>> will have to wait for 8.4.
>
>> I didn't have time...
>
>> (e.g. - we've got a case where dropping the threshold to ~900 bytes
>> would give us a big win for certain databases and tables.)
>
> How do you know?  Seems like you've got a readymade test case there.

I did some testing with Known Scenario, and found, indeed, that there
was a significant gain to be had.  I documented it at least partially
on March 21...



Unfortunately, the sample query that I used to validate usefulness
isn't one I can share :-(.

More importantly, it's only one test case, and is strongly influenced
by some *very* strong regularity to the patterns of updates that take
place to the table that I looked at.  It's not nearly good enough to
treat as a generalizable case.
-- 
output = ("cbbrowne" "@" "linuxfinances.info")
http://linuxdatabases.info/info/spiritual.html
MICROS~1 is to quality software what MacDonalds is to gourmet cooking

---(end of broadcast)---
TIP 9: In versions below 8.0, the planner will ignore your desire to
   choose an index scan if your joining column's datatypes do not
   match


Re: [HACKERS] Modifying TOAST thresholds

2007-04-27 Thread Tom Lane
Chris Browne <[EMAIL PROTECTED]> writes:
> [EMAIL PROTECTED] (Bruce Momjian) writes:
>> I have seen no one do peroformance testing of this, so it seems it
>> will have to wait for 8.4.

> I didn't have time...

> (e.g. - we've got a case where dropping the threshold to ~900 bytes
> would give us a big win for certain databases and tables.)

How do you know?  Seems like you've got a readymade test case there.

regards, tom lane

---(end of broadcast)---
TIP 4: Have you searched our list archives?

   http://archives.postgresql.org


Re: [HACKERS] Modifying TOAST thresholds

2007-04-27 Thread Chris Browne
[EMAIL PROTECTED] (Bruce Momjian) writes:
> I have seen no one do peroformance testing of this, so it seems it
> will have to wait for 8.4.

I didn't have time...

I'll see if I can find a decent place to document how to tweak the
threshold, as that seems like it could be worth doing in cases where
it is known that there is benefit to TOASTing smaller tuples for one's
well-understood workload.

(e.g. - we've got a case where dropping the threshold to ~900 bytes
would give us a big win for certain databases and tables.)
-- 
let name="cbbrowne" and tld="linuxfinances.info" in name ^ "@" ^ tld;;
http://linuxfinances.info/info/unix.html
recursion, n:
See recursion.

---(end of broadcast)---
TIP 2: Don't 'kill -9' the postmaster


Re: [HACKERS] Modifying TOAST thresholds

2007-04-26 Thread Bruce Momjian

I have seen no one do peroformance testing of this, so it seems it will
have to wait for 8.4.

---

Gregory Stark wrote:
> "Tom Lane" <[EMAIL PROTECTED]> writes:
> 
> > What I would definitely like to see for 8.3 is some performance testing
> > done to determine whether we ought to change the current defaults.
> > (Both TOAST_TUPLES_PER_PAGE and EXTERN_TUPLES_PER_PAGE ought to be looked
> > at.)
> 
> It will take some thinking before it's even clear what we should be measuring.
> The "optimal" value will depend heavily on the usage pattern so the best value
> for the default will be something hand-wavy like "the smallest tuple size
> where the cost of a select including the column is greater than the time saved
> on a select not including the column" or something like that.
> 
> -- 
>   Gregory Stark
>   EnterpriseDB  http://www.enterprisedb.com
> 
> 
> ---(end of broadcast)---
> TIP 7: You can help support the PostgreSQL project by donating at
> 
> http://www.postgresql.org/about/donate

-- 
  Bruce Momjian  <[EMAIL PROTECTED]>  http://momjian.us
  EnterpriseDB   http://www.enterprisedb.com

  + If your life is a hard drive, Christ can be your backup. +

---(end of broadcast)---
TIP 3: Have you checked our extensive FAQ?

   http://www.postgresql.org/docs/faq


Re: [HACKERS] Modifying TOAST thresholds

2007-04-04 Thread Luke Lonergan
Not just EDB :-)

- Luke

Msg is shrt cuz m on ma treo

 -Original Message-
From:   Chris Browne [mailto:[EMAIL PROTECTED]
Sent:   Wednesday, April 04, 2007 10:07 PM Eastern Standard Time
To: pgsql-hackers@postgresql.org
Subject:Re: [HACKERS] Modifying TOAST thresholds

[EMAIL PROTECTED] (Tom Lane) writes:
> Bruce Momjian <[EMAIL PROTECTED]> writes:
>> The big question is whether this is for 8.3 or 8.4.
>
> What I would definitely like to see for 8.3 is some performance testing
> done to determine whether we ought to change the current defaults.
> (Both TOAST_TUPLES_PER_PAGE and EXTERN_TUPLES_PER_PAGE ought to be looked
> at.)
>
> Whether it's possible to get the storage parameter in there depends on
> how soon someone produces a patch.  Given that we understand this area
> fairly well, I personally would be willing to give it a pass on the
> "feature freeze" rule, as long as we have the patch by say mid-April.

If I had to end of April, I'd volunteer.  Alas, I will be unavailable
between tomorrow and mid-April.

It seems to me that changing the denominator from 4 to something
somewhat higher (16 comes to mind) would probably be "the patch," and
would likely be useful for a fair number of cases.  I'd really like to
see something like this (e.g. - something that would start TOASTing
for, say, tuples > 500 bytes) in 8.3; this seems to me to have the
potential to be a significant optimization that would be VERY cheap to
implement.  "Low hanging fruit," so to speak.

EDB might like to use this as a route into a tunable "column store"
some time later; I don't think that should stop us from an improvement
today...
-- 
"cbbrowne","@","linuxfinances.info"
http://www3.sympatico.ca/cbbrowne/rdbms.html
"MSDOS didn't get as bad as it  is overnight -- it took over ten years
of careful development."  -- <[EMAIL PROTECTED]>

---(end of broadcast)---
TIP 4: Have you searched our list archives?

   http://archives.postgresql.org



Re: [HACKERS] Modifying TOAST thresholds

2007-04-04 Thread Tom Lane
Chris Browne <[EMAIL PROTECTED]> writes:
> [EMAIL PROTECTED] (Tom Lane) writes:
>> What I would definitely like to see for 8.3 is some performance testing
>> done to determine whether we ought to change the current defaults.
>> (Both TOAST_TUPLES_PER_PAGE and EXTERN_TUPLES_PER_PAGE ought to be looked
>> at.)

> It seems to me that changing the denominator from 4 to something
> somewhat higher (16 comes to mind) would probably be "the patch," and
> would likely be useful for a fair number of cases.  I'd really like to
> see something like this (e.g. - something that would start TOASTing
> for, say, tuples > 500 bytes) in 8.3; this seems to me to have the
> potential to be a significant optimization that would be VERY cheap to
> implement.  "Low hanging fruit," so to speak.

So let's see some performance measurements to back up that intuition.

regards, tom lane

---(end of broadcast)---
TIP 7: You can help support the PostgreSQL project by donating at

http://www.postgresql.org/about/donate


Re: [HACKERS] Modifying TOAST thresholds

2007-04-04 Thread Bruce Momjian

Patch rejected, since we have decided we need to have this as part of
CREATE/ALTER table, rather than a GUC.

---

Chris Browne wrote:
> [EMAIL PROTECTED] (Tom Lane) writes:
> > "Simon Riggs" <[EMAIL PROTECTED]> writes:
> >> Well it certainly seems worth separating them. It does seem possible
> >> that recursive toasting effected some of the earlier results we looked
> >> at.
> >
> >> Would you like me to do this, or will you?
> >
> > I'm willing to do the code changes to separate TOAST_THRESHOLD from
> > the toast chunk size, but I do not have the time or facilities to do
> > any performance testing for different parameter choices.  Anyone want
> > to work on that?
> >
> >> I'd like to get some mechanism for reducing WAL volume into 8.3, whether
> >> its configurable toast or WAL reduction for UPDATEs. If for no other
> >> reason than making backup and availability solutions more manageable.
> >
> > I think the WAL-reduction proposal needs more time and thought than is
> > feasible before 8.3.  OTOH, tuning the TOAST parameters seems like
> > something we understand well enough already, we just need to put some
> > cycles into testing different alternatives.  I would have no objection
> > to someone working on that during April and delivering a final patch
> > sometime before beta.
> 
> Here's a "drafty" patch that *tries* to do this using a GUC variable;
> it passes some interactive testing.  It probably needs an
> assign_hook() function to do further validation (probably to make sure
> that sizes are rightly aligned on both 32 and 64 bit platforms); feel
> free to consider me incompetent at this stage at generating such...
> 
> I would *very* much like to see something of this sort in 8.3; that
> would be of definite value to some of our applications which store
> data that is a bit too small to meet the present
> TOAST_TUPLE_THRESHOLD.  (E.g. - it's worth noting that common
> SOAP-like XML requests are in the 700-1000 byte range; such values are
> generally nicely compressible and are often not likely to be used in
> summary-oriented queries on mainline tables...)
> 
> I don't think I can come up with a performance "test suite" this week,
> and will be unavailable from April 6-14th; if others were to find this
> valuable, and volunteer to set up some sort of test in the interim,
> that would be super.  Absent that, I should be able to do some work on
> this in the latter half of April.
> 
> set toast_default_threshold TO 128;
> create table sample (id serial primary key, txt text);
> insert into sample (txt) values 
> ('12347890347902317849018732417094739027847891230978041237890341278904123789012347980123479081234789041378901234789041237980123478901423789014237980123478901342789014237890142378901423');
> create table sample (id serial primary key, txt text);
> insert into sample (txt) values 
> ('12347890347902317849018732417094739027847891230978041237890341278904123789012347980123479081234789041378901234789041237980123478901423789014237980123478901342789014237890142378901423');
> insert into sample (txt) values 
> ('12347890347902317849018732417094739027847891230978041237890341278904123789012347980123479081234789041378901234789041237980123478901423789014237980123478901342789014237890142378901423');
> insert into sample (txt) values 
> ('12347890347902317849018732417094739027847891230978041237890341278904123789012347980123479081234789041378901234789041237980123478901423789014237980123478901342789014237890142378901423');
> insert into sample (txt) values 
> ('12347890347902317849018732417094739027847891230978041237890341278904123789012347980123479081234789041378901234789041237980123478901423789014237980123478901342789014237890142378901423');
> insert into sample (txt) values 
> ('12347890347902317849018732417094739027847891230978041237890341278904123789012347980123479081234789041378901234789041237980123478901423789014237980123478901342789014237890142378901423');
> insert into sample (txt) values 
> ('12347890347902317849018732417094739027847891230978041237890341278904123789012347980123479081234789041378901234789041237980123478901423789014237980123478901342789014237890142378901423');
> insert into sample (txt) values 
> ('12347890347902317849018732417094739027847891230978041237890341278904123789012347980123479081234789041378901234789041237980123478901423789014237980123478901342789014237890142378901423');
> insert into sample (txt) values 
> ('12347890347902317849018732417094739027847891230978041237890341278904123789012347980123479081234789041378901234789041237980123478901423789014237980123478901342789014237890142378901423');
> insert into sample (txt) values 
> ('12347890347902317849018732417094739027847891230978041237890341278904123789012347980123479081234789041378901234789041237980123478901423789014237980123478901342789014237890142378901423');
> insert into sample (txt) values 
> ('1234789034790231784901873241709473902784789123097

Re: [HACKERS] Modifying TOAST thresholds

2007-04-04 Thread Chris Browne
[EMAIL PROTECTED] (Tom Lane) writes:
> Bruce Momjian <[EMAIL PROTECTED]> writes:
>> The big question is whether this is for 8.3 or 8.4.
>
> What I would definitely like to see for 8.3 is some performance testing
> done to determine whether we ought to change the current defaults.
> (Both TOAST_TUPLES_PER_PAGE and EXTERN_TUPLES_PER_PAGE ought to be looked
> at.)
>
> Whether it's possible to get the storage parameter in there depends on
> how soon someone produces a patch.  Given that we understand this area
> fairly well, I personally would be willing to give it a pass on the
> "feature freeze" rule, as long as we have the patch by say mid-April.

If I had to end of April, I'd volunteer.  Alas, I will be unavailable
between tomorrow and mid-April.

It seems to me that changing the denominator from 4 to something
somewhat higher (16 comes to mind) would probably be "the patch," and
would likely be useful for a fair number of cases.  I'd really like to
see something like this (e.g. - something that would start TOASTing
for, say, tuples > 500 bytes) in 8.3; this seems to me to have the
potential to be a significant optimization that would be VERY cheap to
implement.  "Low hanging fruit," so to speak.

EDB might like to use this as a route into a tunable "column store"
some time later; I don't think that should stop us from an improvement
today...
-- 
"cbbrowne","@","linuxfinances.info"
http://www3.sympatico.ca/cbbrowne/rdbms.html
"MSDOS didn't get as bad as it  is overnight -- it took over ten years
of careful development."  -- <[EMAIL PROTECTED]>

---(end of broadcast)---
TIP 4: Have you searched our list archives?

   http://archives.postgresql.org


Re: [HACKERS] Modifying TOAST thresholds

2007-04-04 Thread Gregory Stark
"Tom Lane" <[EMAIL PROTECTED]> writes:

> What I would definitely like to see for 8.3 is some performance testing
> done to determine whether we ought to change the current defaults.
> (Both TOAST_TUPLES_PER_PAGE and EXTERN_TUPLES_PER_PAGE ought to be looked
> at.)

It will take some thinking before it's even clear what we should be measuring.
The "optimal" value will depend heavily on the usage pattern so the best value
for the default will be something hand-wavy like "the smallest tuple size
where the cost of a select including the column is greater than the time saved
on a select not including the column" or something like that.

-- 
  Gregory Stark
  EnterpriseDB  http://www.enterprisedb.com


---(end of broadcast)---
TIP 7: You can help support the PostgreSQL project by donating at

http://www.postgresql.org/about/donate


Re: [HACKERS] Modifying TOAST thresholds

2007-04-04 Thread Bruce Momjian
Simon Riggs wrote:
> On Wed, 2007-04-04 at 16:26 -0400, Tom Lane wrote:
> > Bruce Momjian <[EMAIL PROTECTED]> writes:
> > > Simon Riggs wrote:
> > >> Having both default GUC and individual table-level WITH parameters seems
> > >> like the best way to me.
> > 
> > > Agreed.
> > 
> > There's an extremely good reason not to have a GUC variable, which is
> > that changes in it would fail to reflect into decisions about whether to
> > create TOAST tables.  When I first brought up the point I didn't see a
> > way around it, but now that I do, I don't think we should expose a
> > failure mode just to have a GUC.
> 
> It depends how it works. If the GUC was a default that was applied only
> at CREATE TABLE time, then it would be safe.

Well, if the GUC applies at CREATE TABLE, it is storing the GUC in
pg_class, which was my point.

> Changing default_with_oids didn't cause all tables to stop/start using
> oids. Why would it?

oid status is stored in pg_class.

> > > OK, but we need to throw a clear message when the TOAST table needs to
> > > be created by the administrator.
> > 
> > No, we just need to not have a GUC for this.  There's no GUC for default
> > fill factor; have you seen anyone complain about that?
> 
> I'd rather set it once than many times, thats all.

Let's find the optimal value for the default, and then you use that all
the time and just change it when you want to when you create the column.

> I certainly care more about temp_tablespaces than I do about this GUC...
> that is something I'll be moaning about if that gets deferred.

I don't see how they are related.


-- 
  Bruce Momjian  <[EMAIL PROTECTED]>  http://momjian.us
  EnterpriseDB   http://www.enterprisedb.com

  + If your life is a hard drive, Christ can be your backup. +

---(end of broadcast)---
TIP 4: Have you searched our list archives?

   http://archives.postgresql.org


Re: [HACKERS] Modifying TOAST thresholds

2007-04-04 Thread Simon Riggs
On Wed, 2007-04-04 at 16:26 -0400, Tom Lane wrote:
> Bruce Momjian <[EMAIL PROTECTED]> writes:
> > Simon Riggs wrote:
> >> Having both default GUC and individual table-level WITH parameters seems
> >> like the best way to me.
> 
> > Agreed.
> 
> There's an extremely good reason not to have a GUC variable, which is
> that changes in it would fail to reflect into decisions about whether to
> create TOAST tables.  When I first brought up the point I didn't see a
> way around it, but now that I do, I don't think we should expose a
> failure mode just to have a GUC.

It depends how it works. If the GUC was a default that was applied only
at CREATE TABLE time, then it would be safe.

Changing default_with_oids didn't cause all tables to stop/start using
oids. Why would it?

> > OK, but we need to throw a clear message when the TOAST table needs to
> > be created by the administrator.
> 
> No, we just need to not have a GUC for this.  There's no GUC for default
> fill factor; have you seen anyone complain about that?

I'd rather set it once than many times, thats all.

I certainly care more about temp_tablespaces than I do about this GUC...
that is something I'll be moaning about if that gets deferred.

> > The big question is whether this is for 8.3 or 8.4.
> 
> What I would definitely like to see for 8.3 is some performance testing
> done to determine whether we ought to change the current defaults.
> (Both TOAST_TUPLES_PER_PAGE and EXTERN_TUPLES_PER_PAGE ought to be looked
> at.)
> 
> Whether it's possible to get the storage parameter in there depends on
> how soon someone produces a patch.  Given that we understand this area
> fairly well, I personally would be willing to give it a pass on the
> "feature freeze" rule, as long as we have the patch by say mid-April.

I meant to say a clear "yes" to that, but I've other business stuff for
two weeks in mid-April so I'll need to rely on colleagues to take up the
challenge.

-- 
  Simon Riggs 
  EnterpriseDB   http://www.enterprisedb.com



---(end of broadcast)---
TIP 9: In versions below 8.0, the planner will ignore your desire to
   choose an index scan if your joining column's datatypes do not
   match


Re: [HACKERS] Modifying TOAST thresholds

2007-04-04 Thread Tom Lane
Bruce Momjian <[EMAIL PROTECTED]> writes:
> Simon Riggs wrote:
>> Having both default GUC and individual table-level WITH parameters seems
>> like the best way to me.

> Agreed.

There's an extremely good reason not to have a GUC variable, which is
that changes in it would fail to reflect into decisions about whether to
create TOAST tables.  When I first brought up the point I didn't see a
way around it, but now that I do, I don't think we should expose a
failure mode just to have a GUC.

> OK, but we need to throw a clear message when the TOAST table needs to
> be created by the administrator.

No, we just need to not have a GUC for this.  There's no GUC for default
fill factor; have you seen anyone complain about that?

> The big question is whether this is for 8.3 or 8.4.

What I would definitely like to see for 8.3 is some performance testing
done to determine whether we ought to change the current defaults.
(Both TOAST_TUPLES_PER_PAGE and EXTERN_TUPLES_PER_PAGE ought to be looked
at.)

Whether it's possible to get the storage parameter in there depends on
how soon someone produces a patch.  Given that we understand this area
fairly well, I personally would be willing to give it a pass on the
"feature freeze" rule, as long as we have the patch by say mid-April.

regards, tom lane

---(end of broadcast)---
TIP 9: In versions below 8.0, the planner will ignore your desire to
   choose an index scan if your joining column's datatypes do not
   match


Re: [HACKERS] Modifying TOAST thresholds

2007-04-04 Thread Bruce Momjian
Simon Riggs wrote:
> On Mon, 2007-04-02 at 22:23 -0400, Tom Lane wrote:
> > Chris Browne <[EMAIL PROTECTED]> writes:
> > > [EMAIL PROTECTED] (Tom Lane) writes:
> > >> ... tuning the TOAST parameters seems like
> > >> something we understand well enough already, we just need to put some
> > >> cycles into testing different alternatives.  I would have no objection
> > >> to someone working on that during April and delivering a final patch
> > >> sometime before beta.
> > 
> > > Here's a "drafty" patch that *tries* to do this using a GUC variable;
> > > it passes some interactive testing.
> 
> Having both default GUC and individual table-level WITH parameters seems
> like the best way to me.

Agreed.

> > I came across a couple of issues while fooling with decoupling
> > TOAST_TUPLE_THRESHOLD from TOAST_MAX_CHUNK_SIZE:
> > 
> > * Should TOAST_TUPLE_TARGET be configurable separately from
> > TOAST_TUPLE_THRESHOLD?  It certainly doesn't make sense for the target
> > to be larger, but perhaps it is sane to want it to be smaller.
> 
> I can't see I'd ever set them differently in practice. Sounds like too
> many people would get confused and set them wrong anyhow.

OK.

> > * There's a hardwired assumption in the system that
> > TOAST_TUPLE_THRESHOLD is invariant: we do not create a toast table at
> > all when we can prove that the maximum tuple width is less than
> > TOAST_TUPLE_THRESHOLD (see needs_toast_table() in toasting.c).
> > Clearly this will not do if TOAST_TUPLE_THRESHOLD can be changed.
> > Should we abandon the notion altogether, and create a toast table
> > anytime the table contains any toastable types?  
> 
> That will create many more catalog entries than we have now, which seems
> not that great a side-effect.

OK, but we need to throw a clear message when the TOAST table needs to
be created by the administrator.

> > Or should we revel
> > in configurability, and allow CREATE TABLE/ALTER TABLE behavior to vary
> > depending on the current threshold setting?  We'd have to fix the
> > toaster routines to not try to push stuff out-of-line when there is no
> > out-of-line to push to ... but I think we probably had better do that
> > anyway for robustness, if we're allowing any variability at all in these
> > numbers.
> 
> Sounds like the best plan.

Agreed.  If you add a GUC without pg_class storage of the value at
CREATE TABLE/ALTER TABLE time, the GUC has to be checked at INSERT time,
meaning if the GUC changes, you might need a TOAST table during an
INSERT, which is going to fail.

The big question is whether this is for 8.3 or 8.4.  I think adding a
GUC just for 8.3, without pg_class storage, will be a problem because
the GUC behavior will change once pg_class storage exists, i.e. GUC will
control at CREATE TABLE/ALTER TABLE rather than at INSERT time.

-- 
  Bruce Momjian  <[EMAIL PROTECTED]>  http://momjian.us
  EnterpriseDB   http://www.enterprisedb.com

  + If your life is a hard drive, Christ can be your backup. +

---(end of broadcast)---
TIP 9: In versions below 8.0, the planner will ignore your desire to
   choose an index scan if your joining column's datatypes do not
   match


Re: [HACKERS] Modifying TOAST thresholds

2007-04-04 Thread Simon Riggs
On Mon, 2007-04-02 at 22:23 -0400, Tom Lane wrote:
> Chris Browne <[EMAIL PROTECTED]> writes:
> > [EMAIL PROTECTED] (Tom Lane) writes:
> >> ... tuning the TOAST parameters seems like
> >> something we understand well enough already, we just need to put some
> >> cycles into testing different alternatives.  I would have no objection
> >> to someone working on that during April and delivering a final patch
> >> sometime before beta.
> 
> > Here's a "drafty" patch that *tries* to do this using a GUC variable;
> > it passes some interactive testing.

Having both default GUC and individual table-level WITH parameters seems
like the best way to me.

> I came across a couple of issues while fooling with decoupling
> TOAST_TUPLE_THRESHOLD from TOAST_MAX_CHUNK_SIZE:
> 
> * Should TOAST_TUPLE_TARGET be configurable separately from
> TOAST_TUPLE_THRESHOLD?  It certainly doesn't make sense for the target
> to be larger, but perhaps it is sane to want it to be smaller.

I can't see I'd ever set them differently in practice. Sounds like too
many people would get confused and set them wrong anyhow.

> * There's a hardwired assumption in the system that
> TOAST_TUPLE_THRESHOLD is invariant: we do not create a toast table at
> all when we can prove that the maximum tuple width is less than
> TOAST_TUPLE_THRESHOLD (see needs_toast_table() in toasting.c).
> Clearly this will not do if TOAST_TUPLE_THRESHOLD can be changed.
> Should we abandon the notion altogether, and create a toast table
> anytime the table contains any toastable types?  

That will create many more catalog entries than we have now, which seems
not that great a side-effect.

> Or should we revel
> in configurability, and allow CREATE TABLE/ALTER TABLE behavior to vary
> depending on the current threshold setting?  We'd have to fix the
> toaster routines to not try to push stuff out-of-line when there is no
> out-of-line to push to ... but I think we probably had better do that
> anyway for robustness, if we're allowing any variability at all in these
> numbers.

Sounds like the best plan.

-- 
  Simon Riggs 
  EnterpriseDB   http://www.enterprisedb.com



---(end of broadcast)---
TIP 7: You can help support the PostgreSQL project by donating at

http://www.postgresql.org/about/donate


Re: [HACKERS] Modifying TOAST thresholds

2007-04-03 Thread Luke Lonergan
Tom,

On 4/3/07 7:15 AM, "Tom Lane" <[EMAIL PROTECTED]> wrote:

> BTW, it strikes me that a GUC variable is quite the wrong way to go
> about this.  The right way is a table storage parameter, a la FILLFACTOR,
> so that it can be set on a per-table basis.  That would also give us a
> chance to fix my concern about needs_toast_table: the case where we
> might need a toast table that we didn't need before is where the toast
> threshold is lowered via ALTER TABLE SET, and we could reasonably make
> that command recheck the situation.

This also seems to also support further development along the vertical
partitioning path.  Any thought on support of per-column TOAST tables?

- Luke



---(end of broadcast)---
TIP 1: if posting/reading through Usenet, please send an appropriate
   subscribe-nomail command to [EMAIL PROTECTED] so that your
   message can get through to the mailing list cleanly


Re: [HACKERS] Modifying TOAST thresholds

2007-04-03 Thread Tom Lane
Chris Browne <[EMAIL PROTECTED]> writes:
> Here's a "drafty" patch that *tries* to do this using a GUC variable;
> it passes some interactive testing.

BTW, it strikes me that a GUC variable is quite the wrong way to go
about this.  The right way is a table storage parameter, a la FILLFACTOR,
so that it can be set on a per-table basis.  That would also give us a
chance to fix my concern about needs_toast_table: the case where we
might need a toast table that we didn't need before is where the toast
threshold is lowered via ALTER TABLE SET, and we could reasonably make
that command recheck the situation.

regards, tom lane

---(end of broadcast)---
TIP 5: don't forget to increase your free space map settings


Re: [HACKERS] Modifying TOAST thresholds

2007-04-03 Thread Zeugswetter Andreas ADI SD

> > ... should we revel
> > in configurability, and allow CREATE TABLE/ALTER TABLE behavior to 
> > vary depending on the current threshold setting?  We'd have to fix
the 
> > toaster routines to not try to push stuff out-of-line when there is
no 
> > out-of-line to push to ... but I think we probably had better do
that 
> > anyway for robustness, if we're allowing any variability at all in 
> > these numbers.
> 
> Actually, upon looking closely at the toast code, it already 
> does the right thing when there's no toast table.  Good on 
> someone for getting that right.  But we still need to think 
> about whether it's sane for CREATE/ALTER TABLE to condition 
> the create-a-toast-table decision on a parameter that may now 
> be changeable.

I think it is ok to decide during creation with current settings.
When a user wants a toast table that has not been created we can direct
them to use some dummy "alter table ... set storage ..." and create a
toast 
table if it does not exist (and the new settings opt for one).

And a new threshold has immediate consequences for inline compression,
so a change is not ignored. 

Andreas

---(end of broadcast)---
TIP 1: if posting/reading through Usenet, please send an appropriate
   subscribe-nomail command to [EMAIL PROTECTED] so that your
   message can get through to the mailing list cleanly


Re: [HACKERS] Modifying TOAST thresholds

2007-04-02 Thread Chris Browne
[EMAIL PROTECTED] (Tom Lane) writes:
> Chris Browne <[EMAIL PROTECTED]> writes:
>> [EMAIL PROTECTED] (Tom Lane) writes:
>>> ... tuning the TOAST parameters seems like
>>> something we understand well enough already, we just need to put some
>>> cycles into testing different alternatives.  I would have no objection
>>> to someone working on that during April and delivering a final patch
>>> sometime before beta.
>
>> Here's a "drafty" patch that *tries* to do this using a GUC variable;
>> it passes some interactive testing.
>
> I came across a couple of issues while fooling with decoupling
> TOAST_TUPLE_THRESHOLD from TOAST_MAX_CHUNK_SIZE:
>
> * Should TOAST_TUPLE_TARGET be configurable separately from
> TOAST_TUPLE_THRESHOLD?  It certainly doesn't make sense for the target
> to be larger, but perhaps it is sane to want it to be smaller.

In the longer run, it would be desirable for there to be by-table
configurability.  Ergo my use of the word "default" in the variable
name; that default can remain relevant even in a future 8.4
enhancement.

I'm not sure what to prefer with regards to TOAST_TUPLE_TARGET; as you
say, it oughtn't be larger than the THRESHOLD value, but I'm not sure
how to rationally set it to a specific lower value.

> * There's a hardwired assumption in the system that
> TOAST_TUPLE_THRESHOLD is invariant: we do not create a toast table at
> all when we can prove that the maximum tuple width is less than
> TOAST_TUPLE_THRESHOLD (see needs_toast_table() in toasting.c).
> Clearly this will not do if TOAST_TUPLE_THRESHOLD can be changed.
> Should we abandon the notion altogether, and create a toast table
> anytime the table contains any toastable types?  Or should we revel
> in configurability, and allow CREATE TABLE/ALTER TABLE behavior to vary
> depending on the current threshold setting?  We'd have to fix the
> toaster routines to not try to push stuff out-of-line when there is no
> out-of-line to push to ... but I think we probably had better do that
> anyway for robustness, if we're allowing any variability at all in these
> numbers.
>
> Comments?

In the 8.3 context, it seems to me that "simplicity rulez."

In some future version, it would be attractive to have this all
variable on a table by table basis; it would at present seem
preferable for the default behaviour to be as little divergent from
past behaviour as possible.

I think I'd be willing to live with the logic that there's no toast
table defined if it was "proven" at create time that we couldn't need
TOAST.  That would conform with present behaviour, and remains simple.

The other logical option would be to always create the TOAST table if
there exist extendible columns.

Those two seem to be the options that are most rational to choose
between.  I'm happy to defer to well-argued opinions on the matter...
-- 
(reverse (concatenate 'string "moc.enworbbc" "@" "enworbbc"))
http://cbbrowne.com/info/lsf.html
"Please, Captain.  Not in front of the Klingons."
-- Leonard Nimoy as Spock in Star Trek V, The Final Frontier

---(end of broadcast)---
TIP 1: if posting/reading through Usenet, please send an appropriate
   subscribe-nomail command to [EMAIL PROTECTED] so that your
   message can get through to the mailing list cleanly


Re: [HACKERS] Modifying TOAST thresholds

2007-04-02 Thread Tom Lane
I wrote:
> ... should we revel
> in configurability, and allow CREATE TABLE/ALTER TABLE behavior to vary
> depending on the current threshold setting?  We'd have to fix the
> toaster routines to not try to push stuff out-of-line when there is no
> out-of-line to push to ... but I think we probably had better do that
> anyway for robustness, if we're allowing any variability at all in these
> numbers.

Actually, upon looking closely at the toast code, it already does the
right thing when there's no toast table.  Good on someone for getting
that right.  But we still need to think about whether it's sane for
CREATE/ALTER TABLE to condition the create-a-toast-table decision on
a parameter that may now be changeable.

regards, tom lane

---(end of broadcast)---
TIP 5: don't forget to increase your free space map settings


Re: [HACKERS] Modifying TOAST thresholds

2007-04-02 Thread Tom Lane
I wrote:
> Gregory Stark <[EMAIL PROTECTED]> writes:
>> Is there any reason to experiment with this? I would have thought we would
>> divorce TOAST_MAX_CHUNK_SIZE from TOAST_THRESHOLD and hard code it as the 
>> same
>> expression that's there now. Ie, the largest size that can fit in a page.

> No, right now it's the largest size that you can fit 4 on a page.  It's
> not obvious to me that 4 is optimal once it's divorced from TOAST_THRESHOLD.
> It seems possible that the correct number is 1, and even if it's useful
> to keep the tuples smaller than that, there's no reason to assume 4 is
> the best number per page.

I've just committed changes that make it trivial to experiment with the
number of toast tuples per page:

#define EXTERN_TUPLES_PER_PAGE  4   /* tweak only this */

/* Note: sizeof(PageHeaderData) includes the first ItemId on the page */
#define EXTERN_TUPLE_MAX_SIZE   \
MAXALIGN_DOWN((BLCKSZ - \
   MAXALIGN(sizeof(PageHeaderData) + (EXTERN_TUPLES_PER_PAGE-1) 
* sizeof(ItemIdData))) \
  / EXTERN_TUPLES_PER_PAGE)

#define TOAST_MAX_CHUNK_SIZE\
(EXTERN_TUPLE_MAX_SIZE -\
 MAXALIGN(offsetof(HeapTupleHeaderData, t_bits)) -  \
 sizeof(Oid) -  \
 sizeof(int32) -\
 VARHDRSZ)

Anyone who's got time to run performance experiments, have at it ...

regards, tom lane

---(end of broadcast)---
TIP 3: Have you checked our extensive FAQ?

   http://www.postgresql.org/docs/faq


Re: [HACKERS] Modifying TOAST thresholds

2007-04-02 Thread Tom Lane
Chris Browne <[EMAIL PROTECTED]> writes:
> [EMAIL PROTECTED] (Tom Lane) writes:
>> ... tuning the TOAST parameters seems like
>> something we understand well enough already, we just need to put some
>> cycles into testing different alternatives.  I would have no objection
>> to someone working on that during April and delivering a final patch
>> sometime before beta.

> Here's a "drafty" patch that *tries* to do this using a GUC variable;
> it passes some interactive testing.

I came across a couple of issues while fooling with decoupling
TOAST_TUPLE_THRESHOLD from TOAST_MAX_CHUNK_SIZE:

* Should TOAST_TUPLE_TARGET be configurable separately from
TOAST_TUPLE_THRESHOLD?  It certainly doesn't make sense for the target
to be larger, but perhaps it is sane to want it to be smaller.

* There's a hardwired assumption in the system that
TOAST_TUPLE_THRESHOLD is invariant: we do not create a toast table at
all when we can prove that the maximum tuple width is less than
TOAST_TUPLE_THRESHOLD (see needs_toast_table() in toasting.c).
Clearly this will not do if TOAST_TUPLE_THRESHOLD can be changed.
Should we abandon the notion altogether, and create a toast table
anytime the table contains any toastable types?  Or should we revel
in configurability, and allow CREATE TABLE/ALTER TABLE behavior to vary
depending on the current threshold setting?  We'd have to fix the
toaster routines to not try to push stuff out-of-line when there is no
out-of-line to push to ... but I think we probably had better do that
anyway for robustness, if we're allowing any variability at all in these
numbers.

Comments?

regards, tom lane

---(end of broadcast)---
TIP 6: explain analyze is your friend


Re: [HACKERS] Modifying TOAST thresholds

2007-04-02 Thread Tom Lane
Bruce Momjian <[EMAIL PROTECTED]> writes:
> Tom, are you going to do this for 8.3?

Right, I promised to do that --- will work on it now.

regards, tom lane

---(end of broadcast)---
TIP 7: You can help support the PostgreSQL project by donating at

http://www.postgresql.org/about/donate


Re: [HACKERS] Modifying TOAST thresholds

2007-04-02 Thread Bruce Momjian

Tom, are you going to do this for 8.3?

---

Tom Lane wrote:
> In another thread I wrote:
> > ... One thing I was just thinking about is that it's silly to have
> > the threshold constrained so strongly by a desire that tuples in toast
> > tables not be toastable.  It would be trivial to tweak the heapam.c
> > routines so that they simply don't invoke the toaster when relkind is
> > 't', and then we could have independent choices of toast-tuple size and
> > main-tuple size.  This would be particularly good because in the current
> > scheme you can't modify toast-tuple size without an initdb, but if that
> > were decoupled there'd be no reason not to allow changes in the
> > main-tuple thresholds.
> 
> After thinking about this more I'm convinced that the above is a good
> idea, eg in heap_insert change
> 
> if (HeapTupleHasExternal(tup) || tup->t_len > TOAST_TUPLE_THRESHOLD)
> heaptup = toast_insert_or_update(relation, tup, NULL, use_wal);
> else
> heaptup = tup;
> 
> to
> 
> if (relation->rd_rel->relkind == RELKIND_TOASTVALUE)
> {
> /* toast table entries should never be recursively toasted */
> Assert(!HeapTupleHasExternal(tup));
> heaptup = tup;
> }
> else if (HeapTupleHasExternal(tup) || tup->t_len > TOAST_TUPLE_THRESHOLD)
> heaptup = toast_insert_or_update(relation, tup, NULL, use_wal);
> else
> heaptup = tup;
> 
> I also think that we ought to add TOAST_MAX_CHUNK_SIZE to the set of
> compiled-in parameters that are recorded in pg_control and checked for
> compatibility at startup (like BLCKSZ) --- this will prevent anyone from
> shooting themselves in the foot while experimenting.
> 
> Any objections?
> 
> regards, tom lane
> 
> ---(end of broadcast)---
> TIP 5: don't forget to increase your free space map settings

-- 
  Bruce Momjian  <[EMAIL PROTECTED]>  http://momjian.us
  EnterpriseDB   http://www.enterprisedb.com

  + If your life is a hard drive, Christ can be your backup. +

---(end of broadcast)---
TIP 1: if posting/reading through Usenet, please send an appropriate
   subscribe-nomail command to [EMAIL PROTECTED] so that your
   message can get through to the mailing list cleanly


Re: [HACKERS] Modifying TOAST thresholds

2007-04-02 Thread Chris Browne
[EMAIL PROTECTED] (Tom Lane) writes:
> "Simon Riggs" <[EMAIL PROTECTED]> writes:
>> Well it certainly seems worth separating them. It does seem possible
>> that recursive toasting effected some of the earlier results we looked
>> at.
>
>> Would you like me to do this, or will you?
>
> I'm willing to do the code changes to separate TOAST_THRESHOLD from
> the toast chunk size, but I do not have the time or facilities to do
> any performance testing for different parameter choices.  Anyone want
> to work on that?
>
>> I'd like to get some mechanism for reducing WAL volume into 8.3, whether
>> its configurable toast or WAL reduction for UPDATEs. If for no other
>> reason than making backup and availability solutions more manageable.
>
> I think the WAL-reduction proposal needs more time and thought than is
> feasible before 8.3.  OTOH, tuning the TOAST parameters seems like
> something we understand well enough already, we just need to put some
> cycles into testing different alternatives.  I would have no objection
> to someone working on that during April and delivering a final patch
> sometime before beta.

Here's a "drafty" patch that *tries* to do this using a GUC variable;
it passes some interactive testing.  It probably needs an
assign_hook() function to do further validation (probably to make sure
that sizes are rightly aligned on both 32 and 64 bit platforms); feel
free to consider me incompetent at this stage at generating such...

I would *very* much like to see something of this sort in 8.3; that
would be of definite value to some of our applications which store
data that is a bit too small to meet the present
TOAST_TUPLE_THRESHOLD.  (E.g. - it's worth noting that common
SOAP-like XML requests are in the 700-1000 byte range; such values are
generally nicely compressible and are often not likely to be used in
summary-oriented queries on mainline tables...)

I don't think I can come up with a performance "test suite" this week,
and will be unavailable from April 6-14th; if others were to find this
valuable, and volunteer to set up some sort of test in the interim,
that would be super.  Absent that, I should be able to do some work on
this in the latter half of April.

set toast_default_threshold TO 128;
create table sample (id serial primary key, txt text);
insert into sample (txt) values 
('12347890347902317849018732417094739027847891230978041237890341278904123789012347980123479081234789041378901234789041237980123478901423789014237980123478901342789014237890142378901423');
create table sample (id serial primary key, txt text);
insert into sample (txt) values 
('12347890347902317849018732417094739027847891230978041237890341278904123789012347980123479081234789041378901234789041237980123478901423789014237980123478901342789014237890142378901423');
insert into sample (txt) values 
('12347890347902317849018732417094739027847891230978041237890341278904123789012347980123479081234789041378901234789041237980123478901423789014237980123478901342789014237890142378901423');
insert into sample (txt) values 
('12347890347902317849018732417094739027847891230978041237890341278904123789012347980123479081234789041378901234789041237980123478901423789014237980123478901342789014237890142378901423');
insert into sample (txt) values 
('12347890347902317849018732417094739027847891230978041237890341278904123789012347980123479081234789041378901234789041237980123478901423789014237980123478901342789014237890142378901423');
insert into sample (txt) values 
('12347890347902317849018732417094739027847891230978041237890341278904123789012347980123479081234789041378901234789041237980123478901423789014237980123478901342789014237890142378901423');
insert into sample (txt) values 
('12347890347902317849018732417094739027847891230978041237890341278904123789012347980123479081234789041378901234789041237980123478901423789014237980123478901342789014237890142378901423');
insert into sample (txt) values 
('12347890347902317849018732417094739027847891230978041237890341278904123789012347980123479081234789041378901234789041237980123478901423789014237980123478901342789014237890142378901423');
insert into sample (txt) values 
('12347890347902317849018732417094739027847891230978041237890341278904123789012347980123479081234789041378901234789041237980123478901423789014237980123478901342789014237890142378901423');
insert into sample (txt) values 
('12347890347902317849018732417094739027847891230978041237890341278904123789012347980123479081234789041378901234789041237980123478901423789014237980123478901342789014237890142378901423');
insert into sample (txt) values 
('12347890347902317849018732417094739027847891230978041237890341278904123789012347980123479081234789041378901234789041237980123478901423789014237980123478901342789014237890142378901423');
insert into sample (txt) values 
('123478903479023178490187324170947390278478912309780412378903412789041237890123479801234790812347890413789012347890412379801234789014237890142379801234789013427890142378901423

Re: [HACKERS] Modifying TOAST thresholds

2007-03-30 Thread Christopher Browne
[EMAIL PROTECTED] (Tom Lane) wrote:
> I'm willing to do the code changes to separate TOAST_THRESHOLD from
> the toast chunk size, but I do not have the time or facilities to do
> any performance testing for different parameter choices.  Anyone
> want to work on that?

What have you got in mind there?  I might be able to do something,
assuming that it doesn't cause heartburn that I'll be "offline" April
6-14th.

Are we simply talking about having the option of #defining a different
threshold at which items get thrown out to TOAST?  Or one of the more
sophisticated options?  Or is the idea more simply that we might
consider having the default set somewhat lower than it is at present?

And I guess a good question is, what's going to get regarded as a
meaningful test?  I've got a couple local test cases I could draw
from, unfortunately, the interaction with TOAST will more than likely
be pretty trivial, showing off "Yeah, cutting the threshold was a good
idea."  And that may not be fair to everyone's case.

[The good news is, of course, that if the end "deliverable" is a
single #define parameter that's used as the denominator to the
fraction, delivery during "beta" time is, indeed, quite trivial...]
-- 
output = ("cbbrowne" "@" "gmail.com")
http://linuxdatabases.info/info/slony.html
Outside of a dog,  a book is man's best friend. Inside  of a dog, it's
too dark to read. -Groucho Marx

---(end of broadcast)---
TIP 1: if posting/reading through Usenet, please send an appropriate
   subscribe-nomail command to [EMAIL PROTECTED] so that your
   message can get through to the mailing list cleanly


Re: [HACKERS] Modifying TOAST thresholds

2007-03-29 Thread Simon Riggs
On Thu, 2007-03-29 at 12:05 -0400, Tom Lane wrote:
> I think the WAL-reduction proposal needs more time and thought than is
> feasible before 8.3.

Agreed.

We really need to focus on the major features.

-- 
  Simon Riggs 
  EnterpriseDB   http://www.enterprisedb.com



---(end of broadcast)---
TIP 3: Have you checked our extensive FAQ?

   http://www.postgresql.org/docs/faq


Re: [HACKERS] Modifying TOAST thresholds

2007-03-29 Thread Tom Lane
"Simon Riggs" <[EMAIL PROTECTED]> writes:
> Well it certainly seems worth separating them. It does seem possible
> that recursive toasting effected some of the earlier results we looked
> at.

> Would you like me to do this, or will you?

I'm willing to do the code changes to separate TOAST_THRESHOLD from
the toast chunk size, but I do not have the time or facilities to do
any performance testing for different parameter choices.  Anyone want
to work on that?

> I'd like to get some mechanism for reducing WAL volume into 8.3, whether
> its configurable toast or WAL reduction for UPDATEs. If for no other
> reason than making backup and availability solutions more manageable.

I think the WAL-reduction proposal needs more time and thought than is
feasible before 8.3.  OTOH, tuning the TOAST parameters seems like
something we understand well enough already, we just need to put some
cycles into testing different alternatives.  I would have no objection
to someone working on that during April and delivering a final patch
sometime before beta.

regards, tom lane

---(end of broadcast)---
TIP 7: You can help support the PostgreSQL project by donating at

http://www.postgresql.org/about/donate


Re: [HACKERS] Modifying TOAST thresholds

2007-03-29 Thread Simon Riggs
On Wed, 2007-03-28 at 14:08 -0400, Tom Lane wrote:
> Gregory Stark <[EMAIL PROTECTED]> writes:
> > "Tom Lane" <[EMAIL PROTECTED]> writes:
> >> I also think that we ought to add TOAST_MAX_CHUNK_SIZE to the set of
> >> compiled-in parameters that are recorded in pg_control and checked for
> >> compatibility at startup (like BLCKSZ) --- this will prevent anyone from
> >> shooting themselves in the foot while experimenting.
> 
> > Is there any reason to experiment with this? I would have thought we would
> > divorce TOAST_MAX_CHUNK_SIZE from TOAST_THRESHOLD and hard code it as the 
> > same
> > expression that's there now. Ie, the largest size that can fit in a page.
> 
> No, right now it's the largest size that you can fit 4 on a page.  It's
> not obvious to me that 4 is optimal once it's divorced from TOAST_THRESHOLD.
> It seems possible that the correct number is 1, and even if it's useful
> to keep the tuples smaller than that, there's no reason to assume 4 is
> the best number per page.

Well it certainly seems worth separating them. It does seem possible
that recursive toasting effected some of the earlier results we looked
at.

Would you like me to do this, or will you?

I'll look again at the possibility for setting TOAST_THRESHOLD and
re-cast the test patch I have for production use. But either way it's
going to be a couple of days after freeze now.

I'd like to get some mechanism for reducing WAL volume into 8.3, whether
its configurable toast or WAL reduction for UPDATEs. If for no other
reason than making backup and availability solutions more manageable.

-- 
  Simon Riggs 
  EnterpriseDB   http://www.enterprisedb.com



---(end of broadcast)---
TIP 4: Have you searched our list archives?

   http://archives.postgresql.org


Re: [HACKERS] Modifying TOAST thresholds

2007-03-28 Thread Tom Lane
Gregory Stark <[EMAIL PROTECTED]> writes:
> "Tom Lane" <[EMAIL PROTECTED]> writes:
>> I also think that we ought to add TOAST_MAX_CHUNK_SIZE to the set of
>> compiled-in parameters that are recorded in pg_control and checked for
>> compatibility at startup (like BLCKSZ) --- this will prevent anyone from
>> shooting themselves in the foot while experimenting.

> Is there any reason to experiment with this? I would have thought we would
> divorce TOAST_MAX_CHUNK_SIZE from TOAST_THRESHOLD and hard code it as the same
> expression that's there now. Ie, the largest size that can fit in a page.

No, right now it's the largest size that you can fit 4 on a page.  It's
not obvious to me that 4 is optimal once it's divorced from TOAST_THRESHOLD.
It seems possible that the correct number is 1, and even if it's useful
to keep the tuples smaller than that, there's no reason to assume 4 is
the best number per page.

regards, tom lane

---(end of broadcast)---
TIP 1: if posting/reading through Usenet, please send an appropriate
   subscribe-nomail command to [EMAIL PROTECTED] so that your
   message can get through to the mailing list cleanly


Re: [HACKERS] Modifying TOAST thresholds

2007-03-28 Thread Gregory Stark

"Tom Lane" <[EMAIL PROTECTED]> writes:

> I also think that we ought to add TOAST_MAX_CHUNK_SIZE to the set of
> compiled-in parameters that are recorded in pg_control and checked for
> compatibility at startup (like BLCKSZ) --- this will prevent anyone from
> shooting themselves in the foot while experimenting.

Is there any reason to experiment with this? I would have thought we would
divorce TOAST_MAX_CHUNK_SIZE from TOAST_THRESHOLD and hard code it as the same
expression that's there now. Ie, the largest size that can fit in a page.

That doesn't mean it shouldn't go in pg_control of course but it would just be
a function of BLCKSIZE and the architecture alignment and not dependent on any
user configurable value.

-- 
  Gregory Stark
  EnterpriseDB  http://www.enterprisedb.com


---(end of broadcast)---
TIP 9: In versions below 8.0, the planner will ignore your desire to
   choose an index scan if your joining column's datatypes do not
   match


[HACKERS] Modifying TOAST thresholds

2007-03-28 Thread Tom Lane
In another thread I wrote:
> ... One thing I was just thinking about is that it's silly to have
> the threshold constrained so strongly by a desire that tuples in toast
> tables not be toastable.  It would be trivial to tweak the heapam.c
> routines so that they simply don't invoke the toaster when relkind is
> 't', and then we could have independent choices of toast-tuple size and
> main-tuple size.  This would be particularly good because in the current
> scheme you can't modify toast-tuple size without an initdb, but if that
> were decoupled there'd be no reason not to allow changes in the
> main-tuple thresholds.

After thinking about this more I'm convinced that the above is a good
idea, eg in heap_insert change

if (HeapTupleHasExternal(tup) || tup->t_len > TOAST_TUPLE_THRESHOLD)
heaptup = toast_insert_or_update(relation, tup, NULL, use_wal);
else
heaptup = tup;

to

if (relation->rd_rel->relkind == RELKIND_TOASTVALUE)
{
/* toast table entries should never be recursively toasted */
Assert(!HeapTupleHasExternal(tup));
heaptup = tup;
}
else if (HeapTupleHasExternal(tup) || tup->t_len > TOAST_TUPLE_THRESHOLD)
heaptup = toast_insert_or_update(relation, tup, NULL, use_wal);
else
heaptup = tup;

I also think that we ought to add TOAST_MAX_CHUNK_SIZE to the set of
compiled-in parameters that are recorded in pg_control and checked for
compatibility at startup (like BLCKSZ) --- this will prevent anyone from
shooting themselves in the foot while experimenting.

Any objections?

regards, tom lane

---(end of broadcast)---
TIP 5: don't forget to increase your free space map settings