Re: [HACKERS] Add checksums without --initdb

Heikki Linnakangas Thu, 02 Jul 2015 12:54:43 -0700

On 07/02/2015 10:39 PM, David Christensen wrote:

Possible concerns here are whether checksums are included in WAL
full_page_writes or if they are independently calculated; if the
latter I think we’d be fine.  If checksums are all handled at the
layer below WAL than any streamed/processed changes should be fine to
get us to the point where we could come up as a master.

It's not full_page_writes that's the problem, but the server would notWAL-log hint bit updates, unless you also have wal_log_hints enabled.But that would be simple to just check - wal_log_hints can be enabledwith a server restart so that's not too onerous.

Andres suggested a separate tool that would basically rewrite the
existing data directory heap files in place, which I can also see a
use case for, but I also think there’s some benefit to be found in
having it happen while the replica is being streamed/built.

Ideas/thoughts/reasons this wouldn’t work?

You probably could make this work, but it seems like a prettycomplicated way to enable checksums. There's also interestingcorner-cases with replication; is it possible to connect a streamingreplica that's been restored from the checksums-enabled backup to achecksums-disabled master. The enable-in-place approach seems a lot morestraightforward to me. In a nutshell:

Add a "enabling-checksums" mode to the server where it calculateschecksums for anything it writes, but doesn't check or complain aboutincorrect checksums on reads. Put the server into that mode, and thenhave a background process that reads through all data in the cluster,calculates the checksum for every page, and writes all the data back.Once that's completed, checksums can be fully enabled.


- Heikki



--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Add checksums without --initdb

Reply via email to