> From: cas...@holland.sun.com [mailto:cas...@holland.sun.com] On Behalf
> Of casper....@sun.com
> 
> >On Mon, 19 Apr 2010, Edward Ned Harvey wrote:
> >> Improbability assessment aside, suppose you use something like the
> DDRDrive
> >> X1 ... Which might be more like 4G instead of 32G ... Is it even
> physically
> >> possible to write 4G to any device in less than 10 seconds?
> Remember, to
> >> achieve worst case, highest demand on ZIL log device, these would
> all have
> >> to be <32kbyte writes (default configuration), because larger writes
> will go
> >> directly to primary storage, with only the intent landing on the
> ZIL.
> >
> >Note that ZFS always writes data in order so I believe that the
> >statement "larger writes will go directly to primary storage" really
> >should be "larger writes will go directly to the ZIL implemented in
> >primary storage (which always exists)".  Otherwise, ZFS would need to
> >write a new TXG whenever a new "large" block of data appeared (which
> >may be puny as far as the underlying store is concerned) in order to
> >assure proper ordering.  This would result in a very high TXG issue
> >rate.  Pool fragmentation would be increased.
> >
> >I am sure that someone will correct me if this is wrong.
> 
> There's a difference between "written" and "the data is referenced by
> the
> uberblock".  There is no need to start a new TXG when a large datablock
> is written.  (If the system resets, the data will be on disk but not
> referenced and is lost unless the TXG it belongs to is comitted)

*Also* it turns out, what I said was not strictly correct either.  I think
I'm too sleepy to get this correct right now, but ...

My (hopefully corrected) understanding is now:

By default, all sync writes will go to ZIL entirely, regardless of size.
Only if you change the ... what is it ... logbias to ... throughput.  Then,
if you have a large sync write, the bulk of data will be written to primary
storage, while just a tiny little intent will be written to the SSD.

I think I misunderstood the default.  I previously thought throughput was
the default, not latency.

_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to