date:20130404

Re: [PATCH] Exorcise zero-dimensional arrays (Was: Re: [HACKERS] Should array_length() Return NULL)

2013-04-04 Thread Boszormenyi Zoltan


2013-04-03 20:58 keltezéssel, Gavin Flower írta:

On 04/04/13 05:36, David E. Wheeler wrote:

On Apr 3, 2013, at 9:30 AM, Tom Lanet...@sss.pgh.pa.us  wrote:


Fortran ... Basic ... actually I'd have thought that zero was a
minority position.  Fashions change I guess.

I say we turn the default lower bound up to 11.

David


In keeping with the level of irrationality in this thread, maybe we should set it to an 
irrational number like the square root of 2, or transcend our selves and make in a 
transcendental number like pi!  :-)


I suppose using the square root of minus one would be consider too 
imaginative???  :-)


Nah, that would make arrays have 2 dimensions as a minimum... :-)




Cheers,
Gavin



--
--
Zoltán Böszörményi
Cybertec Schönig  Schönig GmbH
Gröhrmühlgasse 26
A-2700 Wiener Neustadt, Austria
Web: http://www.postgresql-support.de
 http://www.postgresql.at/

Re: [HACKERS] Proposal for Allow postgresql.conf values to be changed via SQL [review]

2013-04-04 Thread Amit Kapila

On Thursday, April 04, 2013 2:52 AM Robert Haas wrote:
 On Wed, Apr 3, 2013 at 2:54 PM, Tom Lane t...@sss.pgh.pa.us wrote:
  Robert Haas robertmh...@gmail.com writes:
  On Tue, Apr 2, 2013 at 12:19 PM, Peter Eisentraut pete...@gmx.net
 wrote:
  It's weird that SET LOCAL and SET SESSION actually *set* the value,
 and
  the second key word determines how long the setting will last.  SET
  PERSISTENT doesn't actually set the value.  I predict that this
 will be
  a new favorite help-it-doesn't-work FAQ.
 
  I think this is another argument against this particular syntax.  I
  have always thought that something along the lines of ALTER SYSTEM
  would be more appropriate.  ALTER DATABASE .. SET and ALTER ROLE ..
  SET don't change the value immediately either, and nobody gets
  confused about that to my knowledge.  But I can see where SET
  PERSISTENT could cause that sort of confusion.
 
  Yeah, I think I argued for using the SET syntax to start with, but
  I'm coming around to the position that SET PERSISTENT is too much
  unlike the behavior of other varieties of SET.  ALTER is sounding
  more attractive to me now.  Not sure about ALTER SYSTEM in
 particular
  though --- it's not clear that that has any real merit other than
  already existing as a keyword.  (Not that that's negligible.)
  ALTER CONFIGURATION is another alternative using an existing keyword
  that might be worth considering.
 
 Yeah, I thought about something like that.  Aside from saving on
 keywords, the reason I like ALTER SYSTEM or similar is that I suspect
 there will be other system-wide things that we may want to let people
 ALTER in the future, so I think that route might avoid an unnecessary
 proliferation of top-level commands.  I am not, however, deadly
 attached to the idea, if someone's got a good reason for preferring
 something else.

I think second parameter in SET command telling the scope should be fine. As
I could see Oracle
also has similar syntax for it's ALTER SYSTEM Command (Alter System Scope
[Memory|Spfile|Both]). 
Description in short:
SPFILE indicates that the change is made in the server parameter file. The
new 
setting takes effect when the database is next shut down and started up
again.
MEMORY indicates that the change is made in memory, takes effect
immediately, 
and persists until the database is shut down.

The only reason to show above example is that second parameter telling Scope
exists in other databases as well.


However if you are not convinced with above reasoning, then the Alter syntax
can be as follows:
ALTER SYSTEM SET configuration_parameter {TO | =} {value, | 'value'};

With Regards,
Amit Kapila.





-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Page replacement algorithm in buffer cache

2013-04-04 Thread Amit Kapila

On Thursday, April 04, 2013 7:19 AM Greg Smith wrote:
 On 4/2/13 11:54 AM, Robert Haas wrote:
  But, having said that, I still think the best idea is what Andres
  proposed, which pretty much matches my own thoughts: the bgwriter
  needs to populate the free list, so that buffer allocations don't
 have
  to wait for linear scans of the buffer array.
 
 I was hoping this one would make it to a full six years of being on the
 TODO list before it came up again, missed it by a few weeks.  The
 funniest part is that Amit even submitted a patch on this theme a few
 months ago without much feedback:
 http://www.postgresql.org/message-
 id/6C0B27F7206C9E4CA54AE035729E9C382852FF97@szxeml509-mbs
   That stalled where a few things have, on a) needing more regression
 test workloads, and b) wondering just what the deal with large
 shared_buffers setting degrading performance was.

For b), below are links where it decreased due to large shared buffers.

http://www.postgresql.org/message-id/attachment/27489/Results.htm
http://www.postgresql.org/message-id/6C0B27F7206C9E4CA54AE035729E9C38285442C
5@szxeml509-mbx


As per my observation, it occur when I/O starts. The dip could be due to
fluctuation or may be due to some OS scheduling or it could be due to
Eviction of dirty pages sooner than it would otherwise.

I think the further investigation can be more meaningful if the results can
be taken by someone else other than me.

One idea to proceed in this line could be we start with this patch and then
based on results, do the further experiments to make it more useful.  

With Regards,
Amit Kapila.



-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Drastic performance loss in assert-enabled build in HEAD

2013-04-04 Thread Nicolas Barbier

2013/4/3 Tom Lane t...@sss.pgh.pa.us:

 Kevin Grittner kgri...@ymail.com writes:

 To be honest, I don't think I've personally seen a single use case
 for matviews where they could be used if you couldn't count on an
 error if attempting to use them without the contents reflecting a
 materialization of the associated query at *some* point in time.

 Well, if we remove the WITH NO DATA clause from CREATE MATERIALIZED
 VIEW, that minimum requirement is satisfied no?

An argument against that is that computing the contents may be very expensive.

 Granting that throwing an error is actually of some use to some people,
 I would not think that people would want to turn it on via a command
 that throws away the existing view contents altogether, nor turn it off
 with a full-throated REFRESH.  There are going to need to be ways to
 incrementally update matviews, and ways to disable/enable access that
 are not tied to a complete rebuild, not to mention being based on
 user-determined rather than hard-wired criteria for what's too stale.
 So I don't think this is a useful base to build on.

Am I correct when I think that you are saying here, that the “zero
pages == unscannable” logic is not very future-proof? In that case I
concur, and I also think that this knowledge leaks in way too many
other places (the VACUUM bug mentioned by Kevin is a good example).

 If you feel that scannability disable is an absolute must for version 0,
 let's invent a matview reloption or some such to implement it and let
 users turn it on and off as they wish.  That seems a lot more likely
 to still be useful two years from now.

(In the context of making an unlogged matview unscannable after a crash:)

Is it imaginable that such a reloption could (in a future
implementation) be changed during or right after crash recovery? For
example, by storing the set of “truncated by crash recovery” relations
in a shared catalog table, which is then inspected when connecting to
a database to continue the truncation (in the case of a matview by
making it unscannable)?

 And if you're absolutely convinced that unlogged matviews mustn't work as I
 suggest, we can lose those from 9.3, too.

+1. Having unlogged matviews without having incremental updates yet,
isn’t super useful anyway.

 What I'd actually rather see us spending time on right now is making
 some provision for incremental updates, which I will boldly propose
 could be supported by user-written triggers on the underlying tables
 if we only diked out the prohibitions against INSERT/UPDATE/DELETE on
 matviews, and allowed them to operate on a matview's contents just like
 it was a table.  Now admittedly that would foreclose allowing matviews
 to be updatable in the updatable-view sense, but that's a feature I
 would readily give up if it meant users could build incremental update
 mechanisms this year and not two years down the road.

Please make the syntax for updating the “extent” (physical
representation) of a matview different from updating the view’s
logical contents. Examples:

(1) Require to use a special function to update the extent:

SELECT pg_mv_maintain('INSERT INTO example_matview ...');

While parsing the INSERT, the parser would know that it must interpret
“example_matview” as the matview’s extent; As currently the extent and
the view are the same, nothing must be done except for only allowing
the INSERT when it is parsed in the context of pg_mv_maintain, and
otherwise saying that matviews aren’t updatable yet (“NOTICE: did you
mean to update the extent? in that case use pg_mv_maintain”).

(2) Use a different schema (cf. TOAST) for the extent, e.g., view
“public.example_matview” vs. extent “pg_mv_extent.example_matview”. I
imagine future implementations to possibly require multiple extents
anyway, e.g., for storing the “not yet applied changesets” or other
intermediate things.

 Why exactly do you think that what I'm suggesting would cause a dump and
 reload to not regenerate the data?

Expensiveness: Matviews are used in cases where the data is expensive
to compute.

 We *need* to get rid of that aspect of things.  If you must have
 scannability state in version 0, okay, but it has to be a catalog property
 not this.

+1

Nicolas

--
A. Because it breaks the logical sequence of discussion.
Q. Why is top posting bad?


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] CREATE EXTENSION BLOCKS

2013-04-04 Thread Dimitri Fontaine

Hi,

I though we were more specific about an extension's object itself not
living in a schema in our documentation, but I agree we still have room
for progress here.

David E. Wheeler da...@justatheory.com writes:
 +Note that only the extension objects will be placed into the named
 +schema; the extension itself is a database-global object.

I think you're patching the right place, but I'm not sure about the term
database-global object, that I can't find by grepping in sgml/ref.

Regards,
-- 
Dimitri Fontaine
http://2ndQuadrant.fr PostgreSQL : Expertise, Formation et Support


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] corrupt pages detected by enabling checksums

2013-04-04 Thread Simon Riggs

On 4 April 2013 02:39, Andres Freund and...@2ndquadrant.com wrote:

 Ok, I think I see the bug. And I think its been introduced in the
 checkpoints patch.

Well spotted. (I think you mean checksums patch).

 If by now the first backend has proceeded to PageSetLSN() we are writing
 different data to disk than the one we computed the checksum of
 before. Boom.

Right, so nothing else we were doing was wrong, that's why we couldn't
spot a bug. The problem is that we aren't replaying enough WAL because
the checksum on the WAL record is broke.

 I think the whole locking interactions in MarkBufferDirtyHint() need to
 be thought over pretty carefully.

When we write out a buffer with checksums enabled, we take a copy of
the buffer so that the checksum is consistent, even while other
backends may be writing hints to the same bufer.

I missed out on doing that with XLOG_HINT records, so the WAL CRC can
be incorrect because the data is scanned twice; normally that would be
OK because we have an exclusive lock on the block, but with hints we
only have share lock. So what we need to do is take a copy of the
buffer before we do XLogInsert().

Simple patch to do this attached for discussion. (Not tested).

We might also do this by modifying the WAL record to take the whole
block and bypass the BkpBlock mechanism entirely. But that's more work
and doesn't seem like it would be any cleaner. I figure lets solve the
problem first then discuss which approach is best.

-- 
 Simon Riggs   http://www.2ndQuadrant.com/
 PostgreSQL Development, 24x7 Support, Training  Services


copy_before_XLOG_HINT.v1.patch
Description: Binary data

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Page replacement algorithm in buffer cache

2013-04-04 Thread Robert Haas

On Wed, Apr 3, 2013 at 9:49 PM, Greg Smith g...@2ndquadrant.com wrote:
On 4/2/13 11:54 AM, Robert Haas wrote:
But, having said that, I still think the best idea is what Andres
proposed, which pretty much matches my own thoughts: the bgwriter
needs to populate the free list, so that buffer allocations don't have
to wait for linear scans of the buffer array.

I was hoping this one would make it to a full six years of being on the TODO
list before it came up again, missed it by a few weeks. The funniest part
is that Amit even submitted a patch on this theme a few months ago without
much feedback:
http://www.postgresql.org/message-id/6C0B27F7206C9E4CA54AE035729E9C382852FF97@szxeml509-mbs
That stalled where a few things have, on a) needing more regression test
workloads, and b) wondering just what the deal with large shared_buffers
setting degrading performance was.

Those are impressive results. I think we should seriously consider
doing something like that for 9.4. TBH, although more workloads to
test is always better, I don't think this problem is so difficult that
we can't have some confidence in a theoretical analysis. If I read
the original thread correctly (and I haven't looked at the patch
itself), the proposed patch would actually invalidate buffers before
putting them on the freelist. That effectively amounts to reducing
shared_buffers, so workloads that are just on the edge of what can fit
in shared_buffers will be harmed, and those that benefit incrementally
from increased shared_buffers will be as well.

What I think we should do instead is collect the buffers that we think
are evictable and stuff them onto the freelist without invalidating
them. When a backend allocates from the freelist, it can double-check
that the buffer still has usage_count 0. The odds should be pretty
good. But even if we sometimes notice that the buffer has been
touched again after being put on the freelist, we haven't expended all
that much extra effort, and that effort happened mostly in the
background. Consider a scenario where only 10% of the buffers have
usage count 0 (which is not unrealistic). We scan 5000 buffers and
put 500 on the freelist. Now suppose that, due to some accident of
the workload, 75% of those buffers get touched again before they're
allocated off the freelist (which I believe to be a pessimistic
estimate for most workloads). Now, that means that only 125 of those
500 buffers will succeed in satisfying an allocation request. That's
still a huge win, because it means that each backend only has examine
an average of 4 buffers before it finds one to allocate. If it had
needed to do the freelist scan itself, it would have had to touch 40
buffers before finding one to allocate.

In real life, I think the gains are apt to be, if anything, larger.
IME, it's common for most or all of the buffer pool to be pinned at
usage count 5. So you could easily have a situation where the arena
scan has to visit millions of buffers to find one to allocate. If
that's happening in the background instead of the foreground, it's a
huge win. Also, note that there's nothing to prevent the arena scan
from happening in parallel with allocations off of the freelist - so
while foreground processes are emptying the freelist, the background
process can be looking for more things to add to it.

--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

59 matches

Mail list logo