Re: [Gluster-users] very low file creation rate with glusterfs -- result updates

Wei Dong Fri, 11 Sep 2009 07:06:55 -0700

By using booster, I actually avoid being root on the client side. Itwould be perfect if the servers can also be run by regular users, evenif that means that some features have to be deleted. Can someoneexplain a little bit why the server side must be run by root?

I know that I should not ask for too much when the robustness of thecurrent codebase is the most import issue at the time. I just want tohear a story about that and maybe hack the code myself.


- Wei

Wei Dong wrote:

I think it is fuse that causes the slowness. I ran all experimentswith booster enabled and here's the new figure:http://www.cs.princeton.edu/~wdong/gluster/summary-booster.gif . Thenumbers are MUCH better than NFS in most cases except for the localsetting, which is not practically interesting. The interesting thingis that all of a sudden, the deleting rate drop by 4-10 times --though I don't really care about file deletion.
I must say that I'm totally satisfied by the results.

- Wei


Wei Dong wrote:
Hi All,
I complained about the low file creation rate with the glusterfs onmy cluster weeks ago and Avati suggested I started with a smallnumber of nodes. I finally get sometime to seriously benchmarkglusterfs with Bonnie++ today and the results confirms that glusterfsis indeed slow in terms of file creating. My application is to storea large number of ~200KB image files. I use the following bonnie++command for evaluation (create 10K files of 200KiB each scatteredunder 100 directories):
bonnie++ -d . -s 0 -n 10:200000:200000:100
Since sequential I/O is not that interesting to me, I only keep therandom I/O results.
My hardware configuration is 2xquadcore Xeon E5430 2.66GHz, 16GBmemory, 4 x Seagate 1500GiB 7200RPM hard drive. The machines areconnected with gigabit ethernet.
I ran several GlusterFS configurations, each named as N-R-T, where Nis the number of replicated volumes aggregated, R is the number ofreplications and T is number of server side I/O thread. I use onemachine to serve one volume so there are NxR servers and one separateclient running for each experiment. On the client side, the servervolumes are first replicated and then aggregated -- even with 1-1-2configuration, the single volume is wrapped by a replicate and adistribute translator. To show the overhead of those translators, Ialso run a "simple" configuration which is 1-1-2 without the extrareplicate & distribute translators, and a "local" configuration whichis "simple" with client & server running on the same machine. Theseconfigurations are compared to "nfs" and "nfs-local", which is NFSwith server and client on the same machine. The GlusterFS volumefile templates are attached to the email.
The result is athttp://www.cs.princeton.edu/~wdong/gluster/summary.gif . Thebars/numbers shown are operations/second, so the larger the better.
Following are the messages shown by the figure:
1. GlusterFS is doing a exceptionally good job on deleting files,but creates and reads files much slower than both NFS.2. At least for one node server configuration, network doesn'taffects the file creation rate and does affects file read rate.3. The extra dummy replicate & distribute translators lowers filecreation rate by almost half. 4. Replication doesn't hurtperformance a lot.5. I'm running only single-threaded benchmark, so it's hard to sayabout scalability, but adding more servers does helps a little biteven in single-threaded setting.
Note that my results are not really that different fromhttp://gluster.com/community/documentation/index.php/GlusterFS_2.0_I/O_Benchmark_Results,where the single node configuration file create rate is about 30/second.
I see no reason why GlusterFS has to be that slower than NFS in filecreation in single node configuration. I'm wondering if someone herecan help me figure out what's wrong in my configuration or what'swrong in the GlusterFS implementation.
- Wei

Server volume:

volume posix
 type storage/posix
 option directory /state/partition1/wdong/gluster
end-volume

volume lock
 type features/locks
 subvolumes posix
end-volume

volume brick
 type performance/io-threads
 option thread-count 2
 subvolumes lock
end-volume

volume server
 type protocol/server
 option transport-type tcp
 option auth.addr.brick.allow 192.168.99.*
 option transport.socket.listen-port 6999
 subvolumes brick
end-volume


Client volume

volume brick-0-0
 type protocol/client
 option transport-type tcp
 option remote-host c8-0-0
 option remote-port 6999
 option remote-subvolume brick
end-volume

volume brick-0-1 ...

volume rep-0
type cluster/replicate
subvolumes brick-0-0 brick-0-1 ...

...
volume union
type cluster/distribute
subvolumes rep-0 rep-1 rep-2 rep-3 rep-4 rep-5 rep-6 rep-7
end-volume

volume client
 type performance/write-behind
 option cache-size 32MB
 option flush-behind on
 subvolumes union
end-volume
For those who are interested enough to see the real configurationfiles, I have all the configuration files and server/client logsuploaded to http://www.cs.princeton.edu/~wdong/gluster/run.tar.gz .


_______________________________________________
Gluster-users mailing list
Gluster-users@gluster.org
http://gluster.org/cgi-bin/mailman/listinfo/gluster-users

Re: [Gluster-users] very low file creation rate with glusterfs -- result updates

Reply via email to