Re: [Gluster-users] cluster.min-free-disk separate for each, brick

Deyan Chepishev - SuperHosting.BG Wed, 17 Aug 2011 06:28:54 -0700

Hello,

This is really bad news, because I already migrated my data and I just realizedthat I am screwed because Gluster just does not care about the brick sizes.

It is impossible to move to uniform brick sizes.

Currently we use 2TB HDDs, but the disks are growing and soon we will probablyuse 3TB hdds or whatever other larges sizes appear on the market. So if wechoose to use raid5 and some level of redundancy (for example 6hdds in raid5, nomatter what their size is) this sooner or later will lead us to non uniformbricks which is a problem and it is not correct to expect that we always can orwant to provide uniform size bricks.

With this way of thinking if we currently have 10T from 6x2T in hdd5, at somepoint when there is a 10T on a single disk we will have to use no raid justbecause gluster can not handle non uniform bricks.


Regards,
Deyan






Dan Bretherton wrote:

On 15/08/11 20:00, gluster-users-requ...@gluster.org wrote:
Message: 1
Date: Sun, 14 Aug 2011 23:24:46 +0300
From: "Deyan Chepishev - SuperHosting.BG"<dchepis...@superhosting.bg>
Subject: [Gluster-users] cluster.min-free-disk  separate for each
    brick
To: gluster-users@gluster.org
Message-ID:<4e482f0e.3030...@superhosting.bg>
Content-Type: text/plain; charset=UTF-8; format=flowed

Hello,

I have a gluster set up with very different brick sizes.

brick1: 9T
brick2: 9T
brick3: 37T

with this configuration if I set the parameter cluster.min-free-disk to 10% it
applies to all bricks which is quite uncomfortable with these brick sizes,
because 10% for the small bricks are ~ 1T but for the big brick it is ~3.7T and
what happens at the end is that if all brick go to 90% usage and I continue
writing, the small ones eventually fill up to 100% while the big one has enough
free space.

My question is, is there a way to set cluster.min-free-disk per brick instead
setting it for the entire volume or any other way to work around this problem ?

Thank you in advance

Regards,
Deyan
Hello Deyan,
I have exactly the same problem and I have asked about it before - see linksbelow.
http://community.gluster.org/q/in-version-3-1-4-how-can-i-set-the-minimum-amount-of-free-disk-space-on-the-bricks/
http://gluster.org/pipermail/gluster-users/2011-May/007788.html
My understanding is that the patch referred to in Amar's reply in the Maythread prevents a "migrate-data" rebalance operation failing by running out ofspace on smaller bricks, but that doesn't solve the problem we are having.Being able to set min-free-disk for each brick separately would be useful, aswould being able to set this value as a number of bytes rather than apercentage. However, even if these features were present we would still havea problem when the amount of free space becomes less than min-free-disk,because this just results in a warning message in the logs and doesn'tactually prevent more files from being written. In other words, min-free-diskis a soft limit rather than a hard limit. When a volume is more than 90% fullthere may still be hundreds of gigabytes of free space spread over the largebricks, but the small bricks may each only have a few gigabytes left of evenless. Users do "df" and see lots of free space in the volume so they continuewriting files. However, when GlusterFS chooses to write a file to a smallbrick, the write fails with "device full" errors if the file grows too large,which is often the case here with files typically several gigabytes in sizefor some applications.
I would really like to know if there is a way to make min-free-disk a hardlimit. Ideally, GlusterFS would chose a brick on which to write a file basedon how much free space it has left rather than choosing a brick at random (orhowever it is done now). That would solve the problem of non-uniform bricksizes without the need for a hard min-free-disk limit.
Amar's comment in the May thread about QA testing being done only on volumeswith uniform brick sizes prompted me to start standardising on a uniform bricksize for each volume in my cluster. My impression is that implementing thefeatures needed for users with non-uniform brick sizes is not a priority forGluster, and that users are all expected to use uniform brick sizes. I reallythink this fact should be stated clearly in the GlusterFS documentation, inthe sections on creating volumes in the Administration Guide for example.That would stop other users from going down the path that I did initially,which has given me a real headache because I am now having to move tens ofterabytes of data off bricks that are larger than the new standard size.
Regards
Dan.

_______________________________________________
Gluster-users mailing list
Gluster-users@gluster.org
http://gluster.org/cgi-bin/mailman/listinfo/gluster-users

Re: [Gluster-users] cluster.min-free-disk separate for each, brick

Reply via email to