Re: [Gluster-users] is glusterfs DHT really distributed?

Mark Mielke Mon, 28 Sep 2009 07:40:31 -0700

On 09/28/2009 10:35 AM, Wei Dong wrote:

Hi All,
I noticed a very weird phenomenon when I'm copying data (200KB imagefiles) to our glusterfs storage. When I run only run client, itcopies roughly 20 files per second and as soon as I start a secondclient on another machine, the copy rate of the first clientimmediately degrade to 5 files per second. When I stop the secondclient, the first client will immediately speed up to the original 20files per second. When I run 15 clients, the aggregate throughput isabout 8 files per second, much worse than running only one client.Neither CPU nor network is saturated. My volume file is attached.The servers are running on a 66 node cluster and the clients are a15-node cluster.
We have 33x2 servers and at most 15 separate machines, with eachserver serving < 0.5 clients on average. I cannot think of a reasonfor a distributed system to behave like this. There must be some kindof central access point.


Although there is probably room for the GlusterFS folk to optimize...

You should consider directory write operations to involve the wholecluster. Creating a file is a directory write operation. Think of how itmight have to do self-heal across the cluster, make sure the name isright and not already in use across the cluster, and such things.

Once you get to reads and writes for a particular file, it should bedistributed.


Cheers,
mark

--
Mark Mielke<m...@mielke.cc>

_______________________________________________
Gluster-users mailing list
Gluster-users@gluster.org
http://gluster.org/cgi-bin/mailman/listinfo/gluster-users

Re: [Gluster-users] is glusterfs DHT really distributed?

Reply via email to