Re: Some very basic questions

Avi Kivity Wed, 22 Oct 2008 07:28:26 -0700

jim owens wrote:

Avi Kivity wrote:
jim owens wrote:
Remember that the device bandwidth is the limiter so even
when each host has a dedicated path to the device (as in
dual port SAS or FC), that 2nd host cuts the throughput by
more than 1/2 with uncoordinated seeks and transfers.
That's only a problem if there is a single shared device. Sincebtrfs supports multiple devices, each host could own a device set andaccess from other hosts would be through the owner. You would needRDMA to get reasonable performance and some kind of dual-porting toget high availability. Each host could control the allocation treefor its devices.
No.  Every device including a monster $$$ array has the problem.

As I said before, unless the application is partitioned
there is always data host2 needs from host1's disk and that
slows down host1.

The CPU load should not be significant if you have RDMA. Or are youtalking about the seek load? Since host1's load should be distributedover all devices in the system, overall seek capacity increases as youadd more nodes.


If host2 seldom needs any host1 data, then you are describing
a configuration that can be done easily by each host having a
separate filesystem for the device it owns by default.  Each
host nfs mounts the other host's data and if host1 fails, host2
can direct mount host1-fs from the shared array.

Separate namespaces are uninteresting to me. That's just pushing backthe problem to the user.

Even with multiple disks under the same filesystem as separate
allocated storage there is still the problem of shared namespace
metadata that slows down both hosts.  If you don't need shared
namespaces then you absolutely don't want a cluster fs.

If you separate the allocation metadata to the storage owning node, andthe file metadata to the actively using node, the slowdown should be lowin most cases. Problems begin when all nodes access the same file, butthat's relatively rare. Even then, when the file size does not changeand when the data is preallocated, it's possible to achieve acceptableoverhead.

A cluster fs is useful, but the cost can be high so using
it for a single-host fs is not a good idea.

Development costs, yes. But I don't see why the runtime overhead can'tdisappear when running on a single host. Sort of like running an smpkernel on uniprocessor (I agree the fs problem is much bigger).


--
error compiling committee.c: too many arguments to function

--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: Some very basic questions

Reply via email to