Re: [HACKERS] PostgreSQL 8.0.6 crash

Rick Gigger Thu, 09 Feb 2006 16:28:46 -0800


On Feb 9, 2006, at 12:49 PM, Mark Woodward wrote:

On Thu, Feb 09, 2006 at 02:03:41PM -0500, Mark Woodward wrote:
"Mark Woodward" <[EMAIL PROTECTED]> writes:
Again, regardless of OS used, hashagg will exceed "workingmemory" as
defined in postgresql.conf.
So? If you've got OOM kill enabled, it can zap a processwhether it's
strictly adhered to work_mem or not.  The OOM killer is entirely
capable
of choosing a victim process whose memory footprint hasn't changed
materially since it started (eg, the postmaster).
Sorry, I must strongly disagree here. The postgresql.conf"working mem"
is
a VERY IMPORTANT setting, it is intended to limit the consumption of
memory by the postgresql process. Often times PostgreSQL willwork along
Actually, no, it's not designed for that at all.
I guess that's a matter of opinion.
side other application servers on the same system, infact, may be a
sub-part of application servers on the same system. (This is, infact,
how
it is used on one of my site servers.)
Clearly, if the server will use 1000 times this number (Set for1024K,
but
exceeds 1G) this is broken, and it may cause other systems tofail or
perform very poorly.

If it is not something that can be fixed, it should be clearly
documented.
work_mem (integer)

    Specifies the amount of memory to be used by internal sort
operations and hash tables before switching to temporary diskfiles.The value is specified in kilobytes, and defaults to 1024kilobytes
    (1 MB). Note that for a complex query, several sort or hash
operations might be running in parallel; each one will beallowed touse as much memory as this value specifies before it starts toputdata into temporary files. Also, several running sessionscould bedoing such operations concurrently. So the total memory usedcould
    be many times the value of work_mem; it is necessary to keep this
fact in mind when choosing the value. Sort operations are usedfor
    ORDER BY, DISTINCT, and merge joins. Hash tables are used in hash
    joins, hash-based aggregation, and hash-based processing of IN
    subqueries.
So it says right there that it's very easy to exceed work_mem by averylarge amount. Granted, this is a very painful problem to deal withandwill hopefully be changed at some point, but it's pretty clear asto how
this works.
Well, if you read that paragraph carefully, I'll admit that I was alittletoo literal in my statement apliying it to the "process" and notspecific
functions within the process, but in the documentation:
"each one will be allowed to use as much memory as this valuespecifies
before it starts to put data into temporary files."
According to the documentation the behavior of hashagg is broken.It didnot use up to this amount and then start to use temporary files, itused
1000 times this limit and was killed by the OS.

I think it should be documented as the behavior is unpredictable.

It seems to me that the solution for THIS INCIDENT is to run ananalyze. That should fix the problem at hand. I have nothing to sayabout the OOM issue except that hopefully the analyze will preventhim from running out of memory at all.

However if hashagg truly does not obey the limit that is supposed tobe imposed by work_mem then it really ought to be documented. Isthere a misunderstanding here and it really does obey it? Or ishashagg an exception but the other work_mem associated operationswork fine? Or is it possible for them all to go out of bounds?

Even if you've got 100 terabyts of swap space though if seems like ifyour system is very heavy on reads then you would really want thatsingle backend to start using up your disk space and leave yourmemory alone so that most of your data can stay cached and largelyunaffeted by the problem of one backend.

If your bottleneck is writing to the disk then it doesn't really seemto matter. You just need to make sure that huge out of controlhashagg never occurs. If your disks get saturated with writesbecause of the hashagg of one backend then all other processes thatneed to write a lot of info to disk are going to come to a grindinghalt and queries are not going to complete quickly and build up andyou will have a huge mess on your hands that will essentially preventpostgres from being able to do it's job even if it doesn't actuallydie. In this situation disk bandwidth is a scarce commodity andwhether you let the OS handle it all with virtual memory or you letpostgres swap everything out to disc for that one operation you arestill using disc to make up for a lack of RAM. At some point youyou've either got to stock up on enough RAM to run your queriesproperly or alter how your queries run to use less RAM. Having aprocess go out of control in resource usage is going to cause bigproblems one way or another.


---------------------------(end of broadcast)---------------------------
TIP 4: Have you searched our list archives?

              http://archives.postgresql.org

Re: [HACKERS] PostgreSQL 8.0.6 crash

Reply via email to