Re: [HACKERS] emergency outage requiring database restart

Jim Nasby Fri, 28 Oct 2016 13:16:47 -0700

On 10/28/16 8:23 AM, Merlin Moncure wrote:

On Thu, Oct 27, 2016 at 6:39 PM, Greg Stark <[email protected]> wrote:

On Thu, Oct 27, 2016 at 9:53 PM, Merlin Moncure <[email protected]> wrote:

I think we can rule out faulty storage


Nobody ever expects the faulty storage

LOL

Believe me, I know.  But the evidence points elsewhere in this case;
this is clearly application driven.

FWIW, just because it's triggered by specific application behaviordoesn't mean there isn't a storage bug. That's what makes datacorruption bugs such a joy to figure out.

BTW, if you haven't already, I would reset all your storage relatedoptions and GUCs to safe defaults... plain old FSYNC, no cute journal /FS / mount options, etc. Maybe this is related to the app, but the mosthelpful thing right now is to find some kind of safe config so you canstart bisecting.

I would also consider alternatives to plsh, just to rule it out ifnothing else. I'd certainly look at some way to get sqsh out of the loop(again, just to get something that doesn't crash). First idea that comesto mind is a stand-alone shell script that watches a named pipe for afilename; when it gets that file it runs it with sqsh and does somethingto signal completion.

--
Jim Nasby, Data Architect, Blue Treble Consulting, Austin TX
Experts in Analytics, Data Architecture and PostgreSQL
Data in Trouble? Get it in Treble! http://BlueTreble.com
855-TREBLE2 (855-873-2532)   mobile: 512-569-9461


--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] emergency outage requiring database restart

Reply via email to