Re: [GENERAL] Postgres 9.01, Amazon EC2/EBS, XFS, JDBC and lost connections

2011-10-11 Thread Sean Laurent
On Mon, Oct 10, 2011 at 8:09 AM, Craig Ringer ring...@ringerc.id.au wrote: On 10/07/2011 01:21 AM, Sean Laurent wrote: Within a few seconds of the backup, our application servers start throwing exceptions that indicate the database connection was closed. Meanwhile, Postgres still shows the

Re: [GENERAL] Postgres 9.01, Amazon EC2/EBS, XFS, JDBC and lost connections

2011-10-11 Thread Sean Laurent
On Fri, Oct 7, 2011 at 12:36 AM, Tom Lane t...@sss.pgh.pa.us wrote: Sean Laurent s...@studyblue.com writes: We've been running into a particularly strange problem that I'm trying to better understand. The super short version is that our application servers lose their connection to the

Re: [GENERAL] Postgres 9.01, Amazon EC2/EBS, XFS, JDBC and lost connections

2011-10-11 Thread Sean Laurent
On Tue, Oct 11, 2011 at 12:04 AM, Craig Ringer ring...@ringerc.id.au wrote: On 11/10/11 12:48, John R Pierce wrote: On 10/10/11 7:44 PM, Craig Ringer wrote: If blocking writes causes a server failure that persists once writes have been unblocked, that's a bug IMO. You might have a bit of a

Re: [GENERAL] Postgres 9.01, Amazon EC2/EBS, XFS, JDBC and lost connections

2011-10-11 Thread Scott Marlowe
On Tue, Oct 11, 2011 at 5:00 PM, Sean Laurent s...@studyblue.com wrote: As much as I would like Postgres to withstand a 2 second outage, I don't honestly care. I'd just like to figure out whether I'm looking at something that's actually a problem or if I should be looking elsewhere for the

Re: [GENERAL] Postgres 9.01, Amazon EC2/EBS, XFS, JDBC and lost connections

2011-10-11 Thread Sean Laurent
On Tue, Oct 11, 2011 at 8:50 PM, Scott Marlowe scott.marl...@gmail.com wrote: On Tue, Oct 11, 2011 at 5:00 PM, Sean Laurent s...@studyblue.com wrote: As much as I would like Postgres to withstand a 2 second outage, I don't honestly care. I'd just like to figure out whether I'm looking at

Re: [GENERAL] Postgres 9.01, Amazon EC2/EBS, XFS, JDBC and lost connections

2011-10-10 Thread Craig Ringer
On 10/07/2011 01:21 AM, Sean Laurent wrote: Within a few seconds of the backup, our application servers start throwing exceptions that indicate the database connection was closed. Meanwhile, Postgres still shows the connections and we start seeing a really high number (for us) of locks in the

Re: [GENERAL] Postgres 9.01, Amazon EC2/EBS, XFS, JDBC and lost connections

2011-10-10 Thread John R Pierce
On 10/06/11 10:21 AM, Sean Laurent wrote: We've been running into a particularly strange problem that I'm trying to better understand. The super short version is that our application servers lose their connection to the database when I run a backup during periods of higher load and fail to

Re: [GENERAL] Postgres 9.01, Amazon EC2/EBS, XFS, JDBC and lost connections

2011-10-10 Thread Craig Ringer
On 10/10/11 23:29, John R Pierce wrote: While xfs_freeze is in effect, all writes are blocked. This is NOT what you want to do here, postgres does NOT expect you to take an atomic snapshot of the database files, rather, by bracketing your backup with pg_start_backup and pg_stop_backup, it

Re: [GENERAL] Postgres 9.01, Amazon EC2/EBS, XFS, JDBC and lost connections

2011-10-10 Thread John R Pierce
On 10/10/11 7:44 PM, Craig Ringer wrote: If blocking writes causes a server failure that persists once writes have been unblocked, that's a bug IMO. You might have a bit of a backlog of writes to clear, but after that all should be well, and if it isn't then something needs fixing. the process

Re: [GENERAL] Postgres 9.01, Amazon EC2/EBS, XFS, JDBC and lost connections

2011-10-10 Thread Craig Ringer
On 11/10/11 12:48, John R Pierce wrote: On 10/10/11 7:44 PM, Craig Ringer wrote: If blocking writes causes a server failure that persists once writes have been unblocked, that's a bug IMO. You might have a bit of a backlog of writes to clear, but after that all should be well, and if it isn't

[GENERAL] Postgres 9.01, Amazon EC2/EBS, XFS, JDBC and lost connections

2011-10-06 Thread Sean Laurent
We've been running into a particularly strange problem that I'm trying to better understand. The super short version is that our application servers lose their connection to the database when I run a backup during periods of higher load and fail to reconnect. Here's an overview of the setup: -

Re: [GENERAL] Postgres 9.01, Amazon EC2/EBS, XFS, JDBC and lost connections

2011-10-06 Thread Tom Lane
Sean Laurent s...@studyblue.com writes: We've been running into a particularly strange problem that I'm trying to better understand. The super short version is that our application servers lose their connection to the database when I run a backup during periods of higher load and fail to