Re: [Gluster-users] small files and cluster/stripe

Jeff Anderson-Lee Thu, 13 May 2010 17:28:00 -0700

On 5/13/2010 5:05 PM, Craig Carl wrote:

Jeff -
   Two comments/ideas.
1. If you are limited to four pieces of hardware, the minimum forstripe, and you want to stripe some of the data and just distributeother files there is a way to do that. Ideally you would use yourhardware RAID controllers to create two LUNs on each host, one fordistribute, the other for stripe. If you don't have hardware RAID youcould use LVM2 or ZFS to achieve the same thing. (or you could usefolders)1a. Once you have two file systems created useglusterfs-volgen to create the vol files for the distribute exportjust like you normally would.1b. Move the files you just created to the storage servers andclients.1c. Re-run glusterfs-volgen this time for the stripe, addingthe -p option and specifying a port. (something above 1024, not 6996).1d. Move the files you just created to the storage servers andclients.1e . Start Gluster twice on all the servers, specifying thedifferent vol files.1f. You now have two GlusterFS exports, one distribute, theother mirror.
1g. You can mount one inside the other on the client if thatmakes management easier.There are advantages to this model, having two separate Glusterinstances significantly improves parallelism on the storage servers.You can manage the two instances as if they are on different iron.
2. The use case for stripe is vanishingly small. If you have verylarge files (at least 2X the amount of memory in your storage serversand a minimum of 50GB) with very limited writes and simultaneousaccess from hundreds of clients then maybe stripe might beappropriate. Stripe was designed for a specific type of HPC problemsolving, not general file serving. Our video streaming users don't usestripe, even though that is an obvious use, there are better ways toconfigure Gluster for that. If you could share the type ofcontent/access methods/iops per sec we could make some specificsuggestions.

We *are* a quasi-HPC environment. We have 100+ batch compute serverswith 500+ cores, all with GbT interfaces, pounding on an old NAS storageserver. We are trying to replace the old shared staging area with newhardware. We've been looking at an Isilon solution, which performs wellfor the task but costs 4x to 5x what a Gluster solution would price outat for similar-sized hardware/space.

Some our users have millions of small files, some have thousands oflarge files, some have one or two humongous files. If all the data wasjust one size or another all would be well. All files are currentlystored in the same shared staging area. Our users are not HPCprogrammers and tend to program in HLL such as matlab, so we try to beas accommodating as possible, rather than force them to manage the datadistribution.

We'd love a solution that would (a) spread small files over multiplevolumes as well as (b) spread large files over multiple volumes.Cluster/distribute would work for the former and cluster/stripe for thelatter. A marriage of the two would be great.

Right now I'm trying to patch together a temporary testbed using a bunchof old machines with two 143GB drives each. The problem is that manyfiles are multi-GB and unless they are striped they could easily fill upa volume with poor hash distributions. Likewise many small files couldswamp the low-end disk in a stripe volume.

I suppose we could create two pools and tell the predominantly smallfile users to use one and the predominantly large file users to use theother, but somehow I would not hold my breath on it working out.


Jeff

_______________________________________________
Gluster-users mailing list
[email protected]
http://gluster.org/cgi-bin/mailman/listinfo/gluster-users

Re: [Gluster-users] small files and cluster/stripe

Reply via email to