On Apr 25, 2012, at 11:00 PM, Nico Williams wrote:

> On Thu, Apr 26, 2012 at 12:10 AM, Richard Elling
> <richard.ell...@gmail.com> wrote:
>> On Apr 25, 2012, at 8:30 PM, Carson Gaspar wrote:
>> Reboot requirement is a lame client implementation.
> 
> And lame protocol design.  You could possibly migrate read-write NFSv3
> on the fly by preserving FHs and somehow updating the clients to go to
> the new server (with a hiccup in between, no doubt), but only entire
> shares at a time -- you could not migrate only part of a volume with
> NFSv3.

Requirements, requirements, requirements... boil the ocean while we're at it? 
:-)

> Of course, having migration support in the protocol does not equate to
> getting it in the implementation, but it's certainly a good step in
> that direction.

NFSv4 has support for migrating volumes and managing the movement
of file handles. The technique includes filehandle expiry, similar to methods
used in other distributed FSs.

>> You are correct, a ZFS send/receive will result in different file handles on
>> the receiver, just like
>> rsync, tar, ufsdump+ufsrestore, etc.
> 
> That's understandable for NFSv2 and v3, but for v4 there's no reason
> that an NFSv4 server stack and ZFS could not arrange to preserve FHs
> (if, perhaps, at the price of making the v4 FHs rather large).

This is already in the v4 spec.

> Although even for v3 it should be possible for servers in a cluster to
> arrange to preserve devids...

We've been doing that for many years.

> 
> Bottom line: live migration needs to be built right into the protocol.

Agree, and volume migration support is already in the NFSv4 spec.

> For me one of the exciting things about Lustre was/is the idea that
> you could just have a single volume where all new data (and metadata)
> is distributed evenly as you go.  Need more storage?  Plug it in,
> either to an existing head or via a new head, then flip a switch and
> there it is.  No need to manage allocation.  Migration may still be
> needed, both within a cluster and between clusters, but that's much
> more manageable when you have a protocol where data locations can be
> all over the place in a completely transparent manner.

Many distributed file systems do this, at the cost of being not quite POSIX-ish.
In the brave new world of storage vmotion, nosql, and distributed object stores,
it is not clear to me that coding to a POSIX file system is a strong 
requirement.

Perhaps people are so tainted by experiences with v2 and v3 that we can explain
the non-migration to v4 as being due to poor marketing? As a leader of NFS, Sun
had unimpressive marketing.
 -- richard

--
ZFS Performance and Training
richard.ell...@richardelling.com
+1-760-896-4422







_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to