Re: [Gluster-users] is glusterfs DHT really distributed?

Wei Dong Mon, 28 Sep 2009 07:51:28 -0700

Your reply makes all sense to me. I remember that auto-heal happens atfile reading; doest that mean opening a file for read is also a globaloperation? Do you mean that there's no other way of copying 30 millionfiles to our 66-node glusterfs cluster for parallel processing otherthan waiting for half a month? Can I somehow disable self-heal and geta seedup?


Things turn out to be too bad for me.


- Wei


Mark Mielke wrote:

On 09/28/2009 10:35 AM, Wei Dong wrote:
Hi All,
I noticed a very weird phenomenon when I'm copying data (200KB imagefiles) to our glusterfs storage. When I run only run client, itcopies roughly 20 files per second and as soon as I start a secondclient on another machine, the copy rate of the first clientimmediately degrade to 5 files per second. When I stop the secondclient, the first client will immediately speed up to the original 20files per second. When I run 15 clients, the aggregate throughput isabout 8 files per second, much worse than running only one client.Neither CPU nor network is saturated. My volume file is attached.The servers are running on a 66 node cluster and the clients are a15-node cluster.
We have 33x2 servers and at most 15 separate machines, with eachserver serving < 0.5 clients on average. I cannot think of a reasonfor a distributed system to behave like this. There must be somekind of central access point.
Although there is probably room for the GlusterFS folk to optimize...
You should consider directory write operations to involve the wholecluster. Creating a file is a directory write operation. Think of howit might have to do self-heal across the cluster, make sure the nameis right and not already in use across the cluster, and such things.
Once you get to reads and writes for a particular file, it should bedistributed.
Cheers,
mark


_______________________________________________
Gluster-users mailing list
Gluster-users@gluster.org
http://gluster.org/cgi-bin/mailman/listinfo/gluster-users

Re: [Gluster-users] is glusterfs DHT really distributed?

Reply via email to