Re: [zfs-discuss] zfs streams & data corruption

Greg Palmer Wed, 25 Feb 2009 15:19:49 -0800

Miles Nordin wrote:

    gp> Performing a checkpoint will perform such tasks as making sure
    gp> that all transactions recorded in the log but not yet written
    gp> to the database are written out and that the system is not in
    gp> the middle of a write when you grab the data.

great copying of buzzwords out of a glossary,

Wasn't copied from a glossary, I just tried to simplify it enough foryou to understand. I apologize if I didn't accomplish that goal.

but does it change my

claim or not? My claim is:

  that SQLite2 should be equally as tolerant of snapshot backups as it
  is of cord-yanking.

You're missing the point here Miles. The folks weren't asking for amethod to confirm their database was able to perform proper errorrecovery and confirm it would survive having the cord yanked out of thewall. They were asking for a reliable way to backup their data. The bestway to do that is not by snapshotting alone. The process of performingdatabase backups is well understood and supported throughout the industry.

Relying on the equivalent of crashing the database to perform backupsisn't how professionals get the job done. There is a reason thatdatabase vendor do not suggest you backup their databases by pulling theplug out of the wall or killing the running process. The best way tobackup a database is by using a checkpoint. Your comment aboutcheckpoints being for systems where snapshots are not available is notaccurate. That is the normal method of backing up databases underSolaris among others. Checkpoints are useful for all systems since theyguarantee that the database files are consistent and do not requirerecovery which doesn't always work no matter what the glossy brochurestell you. Typically they are used in concert with snapshots. Force thecheckpoint, trigger the snapshot and you're golden.

Let's take a simple case of a transaction which consists of threedatabase updates within a transaction. One of those writes succeeds, youtake a snapshot and then the two other writes succeed. Everyoneconcerned with the transaction believes it succeeded but your snapshotdoes not show that. When the database starts up again, the data it willhave in your snapshot indicates the transaction never succeeded andtherefore it will roll out the database transaction and you will losethat transaction. Well, it will assuming that all code involved in thatrecovery works flawlessly. Issuing a checkpoint on the other hand causesthe database to complete the transaction including ensuring consistencyof the database files before you take your snapshot. NOTE: If you issuea checkpoint and then perform a snapshot you will get consistent datawhich does not require the database perform recovery. Matter of fact,that's the best way to do it.

Your dismissal of write activity taking place is inaccurate. Snapshotstake a picture of the file system at a point in time. They have noknowledge of whether or not one of three writes required for thedatabase to be consistent have completed. (Refer to above example) Datadoes not hit the disk instantly, it takes some finite amount of time inbetween when the write command is issued for it to arrive at the disk.Plainly, terminating the writes between when they are issued and beforeit has completed is possible and a matter of timing. The database on theother hand does understand when the transaction has completed and allowsoutside processes to take advantage of this knowledge via checkpointing.

All real database systems have flaws in the recovery process and so farevery database system I've seen has had issues at one time or another.If we were in a perfect world it SHOULD work every time but we aren't ina perfect world. ZFS promises on disk consistency but as we saw in therecent thread about "Unreliable for professional usage" it is possibleto have issues. Likewise with database systems.


Regards,
 Greg
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] zfs streams & data corruption

Reply via email to