On Mon, May 30, 2016 at 4:22 PM, Alex Ignatov <a.igna...@postgrespro.ru> wrote:
> > _____________________________ > From: Scott Marlowe <scott.marl...@gmail.com> > Sent: Monday, May 30, 2016 20:14 > Subject: Re: [GENERAL] Silent data loss in its pure form > To: Alex Ignatov <a.igna...@postgrespro.ru> > Cc: <pgsql-general@postgresql.org> > > > > On Mon, May 30, 2016 at 10:57 AM, Alex Ignatov <a.igna...@postgrespro.ru> > wrote: > > Following this bug reports from redhat > > https://bugzilla.redhat.com/show_bug.cgi?id=845233 > > > > it rising some dangerous issue: > > > > If on any reasons you data file is zeroed after some power loss(it is the > > most known issue on XFS in the past) when you do > > select count(*) from you_table you got zero if you table was in one > > 1GB(default) file or some other numbers !=count (*) from you_table before > > power loss > > No errors, nothing suspicious in logs. No any checksum errors. Nothing. > > > > Silent data loss is its pure form. > > > > And thanks to all gods that you notice it before backup recycling which > > contains good data. > > Keep in mind it while checking you "backups" in any forms (pg_dump or the > > more dangerous and short-spoken PITR file backup) > > > > You data is always in danger with "zeroed data file is normal file" > > paradigm. > > That bug shows as having been fixed in 2012. Are there any modern, > supported distros that would still have it? It sounds really bad btw. > > > -- > Sent via pgsql-general mailing list (pgsql-general@postgresql.org) > To make changes to your subscription: > http://www.postgresql.org/mailpref/pgsql-general > > It is not about modern distros it is about possible silent data loss in > old distros. We have replication, have some form of data check summing, but > we are powerless in front of this XFS bug just because "zeroed file is you > good friend in Postgres". > With "zero file is good file" paradigm and this noted XFS bug PG as it > is now is "colossus with feet of clay" It can do many things but it cant > even tell us that we have some trouble with our precious data. > No need to prevent or to some other AI magic and so on when zero doom day > has come. > What we need now is some error report about suspicious zeroed file. To > make us sure that something went wrong and we have to do recovery. > Today PG "power loss" recovery and this XFS bug poisoning our ensurance > that recovery went well . It went well even with zeroed file. It it not > healthy behavior. It like a walk on a mine field with eyes closed. > I think it is very dangerous view on data to have data files without any > header in it and without any files checking at least on PG start. > With this known XFS bug it can leads to undetected and unavoidable loss > of data. > For those not following -general this is basically an extension of the following thread. "Deleting a table file does not raise an error when the table is touched afterwards, why?" https://www.postgresql.org/message-id/flat/184509399.5590018.1464622534207.javamail.zim...@dbi-services.com#184509399.5590018.1464622534207.javamail.zim...@dbi-services.com David J.