Re: [PERFORM] select on 22 GB table causes "An I/O error occured while sending to the backend." exception

Matthew Wakeling Thu, 28 Aug 2008 13:29:45 -0700

On Thu, 28 Aug 2008, Craig James wrote:

If your processes do use the memory, then your performance goes into thetoilet, and you know it's time to buy more memory or a second server,but in the mean time your server processes at least keep running whileyou kill the rogue processes.

I'd argue against swap ALWAYS being better than overcommit. It's a choicebetween your performance going into the toilet or your processes dieing.

On the one hand, if someone fork-bombs you, the OOM killer has a chance ofsolving the problem for you, rather than you having to log onto anunresponsive machine to kill the process yourself. On the other hand, theOOM killer may kill the wrong thing. Depending on what else you use yourmachine for, either of the choices may be the right one.

Another point is that from a business perspective, a database that hasstopped responding is equally bad regardless of whether that is becausethe OOM killer has appeared or because the machine is thrashing. In bothcases, there is a maximum throughput that the machine can handle, and ifrequests appear quicker than that the system will collapse, especially ifthe requests start timing out and being retried.

This problem really is caused by the kernel not having enough informationon how much memory a process is going to use. I would be much in favour ofreplacing fork() with some more informative system call. For example,forkandexec() instead of fork() then exec() - the kernel would know thatthe new process will never need any of that duplicated RAM. However, thereis *far* too much legacy in the old fork() call to change that now.

Likewise, I would be all for Postgres managing its memory better. It wouldbe very nice to be able to set a maximum amount of work-memory, ratherthan a maximum amount per backend. Each backend could then make do withhowever much is left of the work-memory pool when it actually executesqueries. As it is, the server admin has no idea how many multiples ofwork-mem are going to be actually used, even knowing the maximum number ofbackends.


Matthew

--
Of course it's your fault. Everything here's your fault - it says so in your
contract.                                    - Quark

--
Sent via pgsql-performance mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance

Re: [PERFORM] select on 22 GB table causes "An I/O error occured while sending to the backend." exception

Reply via email to