Re: [zfs-discuss] Sun Flash Accelerator F20 numbers

Bob Friesenhahn Thu, 01 Apr 2010 08:50:09 -0700

On Thu, 1 Apr 2010, Edward Ned Harvey wrote:


Dude, don't be so arrogant.  Acting like you know what I'm talking about
better than I do.  Face it that you have something to learn here.


Geez!

Yes, all the transactions in a transaction group are either committed
entirely to disk, or not at all.  But they're not necessarily committed to
disk in the same order that the user level applications requested.  Meaning:
If I have an application that writes to disk in "sync" mode intentionally
... perhaps because my internal file format consistency would be corrupt if
I wrote out-of-order ... If the sysadmin has disabled ZIL, my "sync" write
will not block, and I will happily issue more write operations.  As long as
the OS remains operational, no problem.  The OS keeps the filesystem
consistent in RAM, and correctly manages all the open file handles.  But if
the OS dies for some reason, some of my later writes may have been committed
to disk while some of my earlier writes could be lost, which were still
being buffered in system RAM for a later transaction group.

The purpose of the ZIL is to act like a fast "log" for synchronouswrites. It allows the system to quickly confirm a synchronous writerequest with the minimum amount of work. As you say, "OS keeps thefilesystem consistent in RAM". There is no 1:1 ordering betweenapplication write requests and zfs writes and in fact, if the sameportion of file is updated many times, or the file is created/deletedmany times, zfs only writes the updated data which is current when thenext TXG is written. For a synchronous write, zfs advances its indexin the slog once the corresponding data has been committed in a TXG.In other words, the "sync" and "async" write paths are the same whenit comes to writing final data to disk.

There is however the recovery case where synchronous writes wereaffirmed which were not yet written in a TXG and the systemspontaneously reboots. In this case the synchronous writes will occurbased on the slog, and uncommitted async writes will have been lost.Perhaps this is the case you are worried about.

It does seem like rollback to a snapshot does help here (to assurethat sync & async data is consistent), but it certainly does not helpany NFS clients. Only a broken application uses sync writessometimes, and async writes at other times.


Bob
--
Bob Friesenhahn
bfrie...@simple.dallas.tx.us, http://www.simplesystems.org/users/bfriesen/
GraphicsMagick Maintainer,    http://www.GraphicsMagick.org/
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Sun Flash Accelerator F20 numbers

Reply via email to