On Thu, Jan 30, 2014 at 1:53 PM, Dan Fairs <[email protected]> wrote:

>
> Right, that's what I was afraid of. I'm currently restoring from a pg_dump
> just to check that we can recover. I suspect the next step will be to take
> another server, restore a pg_dump'd backup on it, and try a WAL-E setup on
> that one. If that works, then I expect we'll have to dump and reload our
> production server. Frustrating, as this all worked smoothly in our test
> environments! That's life, I guess...
>
>
> Yeah.  Testing backups is still a struggle -- even superficially
> starting up the cluster is not enough.  Some extra checking or
> monitoring integration will probably be seen in WAL-E over time,
> particularly with regard to Postgres checksums and figuring out how to
> deal with file system failures for those using checksummed file
> systems, but that is a ways off.
>
>
> Are there tickets for this? If there's any low-hanging fruit, we might be
> able to take a crack at some of them.
>

No, I don't think so.  No design I've heard so far is a slam dunk (so your
return on time is far from guaranteed), and so I think breaking them into
issues gives an illusion of control over how the problem would be
approached or in what pieces.  Finally, bugs that are from logic errors
within Postgres will necessarily defeat checksums on the pages.

If you wrote a software package to automagically a WAL-E backup and ran
pg_dumpall on it to /dev/null and report on the result somehow that'd go
some ways.  It wouldn't verify indexes though.  There are also some
interesting tools like https://github.com/tvondra/pg_check that could be
installed on those recovered databases.

Another line of inquiry might be see how corrupt btrfs/zfs file systems
present corruption.

Corruption detection -- both from storage media and rechecks to locate
postgres logic errors -- is basically pretty raw in Postgres.

Yet, I think writing software to automate recovery and dumping of a
database would be a great first step.

-- 
You received this message because you are subscribed to the Google Groups 
"wal-e" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.

Reply via email to