Re: [ovirt-users] Gluster storage question

Bartosiak-Jentys, Chris Sat, 11 Feb 2017 11:35:19 -0800

Thanks for the links, I will add them to my reading list. Absolutelywould read the docs before deploying ovirt in production and definitelywould not use this storage configuration, this is purely to keep fromwasting electricity.


Chris.


On 2017-02-11 19:18, Doug Ingham wrote:

On 11 February 2017 at 15:39, Bartosiak-Jentys, Chris<chris.bartosiak-jen...@certico.co.uk> wrote:
Thank you for your reply Doug,
I didn't use localhost as I was preparing to follow instructions (blogpost:http://community.redhat.com/blog/2014/11/up-and-running-with-ovirt-3-5-part-two/)for setting up CTDB and had already created hostnames for thefloating IP when I decided to ditch that and go with the hosts filehack. I already had the volumes mounted on those hostnames but you areabsolutely right, simply using localhost would be the best option.
oVirt 3.5? 2014? That's oooold. Both oVirt & Gluster have moved on alot since then. I would strongly recommend studying Gluster'sdocumentation before implementing it in production. It's notcomplicated, but you have to have a good understanding of what you'redoing & why if you want to protect the integrity of your data & avoidwaking up one day to find everything in meltdown.
https://gluster.readthedocs.io/en/latest/
Red Hat's portal is also very good & full of detailed tips for tuningyour setup, however their "stable" versions (which they have tosupport) are of course much older than the project's own latest stable,so keep this in mind when considering their advice.
https://access.redhat.com/documentation/en/red-hat-storage/
Likewise with their oVirt documentation, although their supported oVirtversions are much closer to the current stable release. It alsofeatures a lot of very good advice for configuring & tuning an oVirt(RHEV) & GlusterFS (RHGS) hyperconverged setup.
https://access.redhat.com/documentation/en/red-hat-virtualization/
For any other Gluster specific questions, you can usually get good &timely responses on their mailing list & IRC channel.
Thank you for your suggested outline of how to power up/down thecluster, I hadn't considered the fact that turning on two out of datenodes would clobber data on the new node. This is something I will needto be very careful to avoid. The setup is mostly for lab work so notreally mission critical but I do run a few VM's (freeIPA, GitLab andpfSense) that I'd like to keep up 24/7. I make regular backups (outsideof ovirt) of those just in case.
Thanks, I will do some reading on how gluster handles quorum and healoperations but your procedure sounds like a sensible way to operatethis cluster.
Regards,

Chris.

On 2017-02-11 18:08, Doug Ingham wrote:
On 11 February 2017 at 13:32, Bartosiak-Jentys, Chris<chris.bartosiak-jen...@certico.co.uk> wrote:
Hello list,
Just wanted to get your opinion on my ovirt home lab setup. While thisis not a production setup I would like it to run relatively reliably soplease tell me if the following storage configuration is likely toresult in corruption or just bat s**t insane.
I have a 3 node hosted engine setup, VM data store and engine datastore are both replica 3 gluster volumes (one brick on each host).I do not want to run all 3 hosts 24/7 due to electricity costs, I onlypower up the larger hosts (2 Dell R710's) when I need additionalresources for VM's.
I read about using CTDB and floating/virtual IP's to allow the storagemount point to transition between available hosts but after somethought decided to go about this another, simpler, way:
I created a common hostname for the storage mount points: gfs-data andgfs-engine
On each host I edited /etc/hosts file to have these hostnames resolveto each hosts IP i.e. on host1 gfs-data & gfs-engine --> host1 IP
on host2 gfs-data & gfs-engine --> host2 IP
etc.
In ovirt engine each storage domain is mounted as gfs-data:/data andgfs-engine:/engineMy thinking is that this way no matter which host is up and acting asSPM it will be able to mount the storage as its only dependent on thathost being up.
I changed gluster options for server-quorum-ratio so that the volumesremain up even if quorum is not met, I know this is risky but its justa lab setup after all.
So, any thoughts on the /etc/hosts method to ensure the storage mountpoint is always available? Is data corruption more or less inevitablewith this setup? Am I insane ;) ?
Why not just use localhost? And no need for CTDB with a floating IP,oVirt uses libgfapi for Gluster which deals with that all natively.
As for the quorum issue, I would most definitely *not* run with quorumdisabled when you're running more than one node. As you say youspecifically plan for when the other 2 nodes of the replica 3 set willbe active or not, I'd do something along the lines of the following...
Going from 3 nodes to 1 node:
- Put nodes 2 & 3 in maintenance to offload their virtual load;
- Once the 2 nodes are free of load, disable quorum on the Glustervolumes;
- Power down the 2 nodes.

Going from 1 node to 3 nodes:
- Power on *only* 1 of the pair of nodes (if you power on both &self-heal is enabled, Gluster will "heal" the files on the main nodewith the older files on the 2 nodes which were powered down);
- Allow Gluster some time to detect that the files are in split-brain;
- Tell Gluster to heal the files in split-brain based on modificationtime;- Once the 2 nodes are in sync, re-enable quorum & power on the lastnode, which will be resynchronised automatically;
- Take the 2 hosts out of maintenance mode.
If you want to power on the 2nd two nodes at the same time, makeabsolutely sure self-heal is disabled first! If you don't, Gluster willsee the 2nd two nodes as in quorum & heal the data on your 1st nodewith the out-of-date data.
--
Doug


--

Chris Bartosiak-Jentys
Certico
Tel: 03333 444 884
Mob: 077 0246 8132
e-mail: ch...@certico.co.uk
www.certico.co.uk

-------------------------

Confidentiality Notice: the information contained in this email and anyattachments may be legally privileged and confidential.If you are not an intended recipient, you are hereby notified that anydissemination, distribution, or copying of this e-mail is strictlyprohibited.If you have received this e-mail in error, please notify the sender andpermanently delete the e-mail and any attachments immediately.You should not retain, copy or use this e-mail or any attachments forany purpose, nor disclose all or any part of the contents to any otherperson.Certico is a trading name of "Certico Trading Limited" England & Walesregistered company no. 5819172.


--
Doug

On 11 February 2017 at 15:39, Bartosiak-Jentys, Chris<chris.bartosiak-jen...@certico.co.uk> wrote:

Thank you for your reply Doug,
I didn't use localhost as I was preparing to follow instructions (blogpost:http://community.redhat.com/blog/2014/11/up-and-running-with-ovirt-3-5-part-two/)for setting up CTDB and had already created hostnames for the floatingIP when I decided to ditch that and go with the hosts file hack. Ialready had the volumes mounted on those hostnames but you areabsolutely right, simply using localhost would be the best option.
Thank you for your suggested outline of how to power up/down thecluster, I hadn't considered the fact that turning on two out of datenodes would clobber data on the new node. This is something I will needto be very careful to avoid. The setup is mostly for lab work so notreally mission critical but I do run a few VM's (freeIPA, GitLab andpfSense) that I'd like to keep up 24/7. I make regular backups (outsideof ovirt) of those just in case.
Thanks, I will do some reading on how gluster handles quorum and healoperations but your procedure sounds like a sensible way to operatethis cluster.
Regards,

Chris.

On 2017-02-11 18:08, Doug Ingham wrote:
On 11 February 2017 at 13:32, Bartosiak-Jentys, Chris<chris.bartosiak-jen...@certico.co.uk> wrote:
Hello list,
Just wanted to get your opinion on my ovirt home lab setup. While thisis not a production setup I would like it to run relatively reliably soplease tell me if the following storage configuration is likely toresult in corruption or just bat s**t insane.
I have a 3 node hosted engine setup, VM data store and engine datastore are both replica 3 gluster volumes (one brick on each host).I do not want to run all 3 hosts 24/7 due to electricity costs, I onlypower up the larger hosts (2 Dell R710's) when I need additionalresources for VM's.
I read about using CTDB and floating/virtual IP's to allow the storagemount point to transition between available hosts but after somethought decided to go about this another, simpler, way:
I created a common hostname for the storage mount points: gfs-data andgfs-engine
On each host I edited /etc/hosts file to have these hostnames resolveto each hosts IP i.e. on host1 gfs-data & gfs-engine --> host1 IP
on host2 gfs-data & gfs-engine --> host2 IP
etc.
In ovirt engine each storage domain is mounted as gfs-data:/data andgfs-engine:/engineMy thinking is that this way no matter which host is up and acting asSPM it will be able to mount the storage as its only dependent on thathost being up.
I changed gluster options for server-quorum-ratio so that the volumesremain up even if quorum is not met, I know this is risky but its justa lab setup after all.
So, any thoughts on the /etc/hosts method to ensure the storage mountpoint is always available? Is data corruption more or less inevitablewith this setup? Am I insane ;) ?
Why not just use localhost? And no need for CTDB with a floating IP,oVirt uses libgfapi for Gluster which deals with that all natively.
As for the quorum issue, I would most definitely *not* run with quorumdisabled when you're running more than one node. As you say youspecifically plan for when the other 2 nodes of the replica 3 set willbe active or not, I'd do something along the lines of the following...
Going from 3 nodes to 1 node:
- Put nodes 2 & 3 in maintenance to offload their virtual load;
- Once the 2 nodes are free of load, disable quorum on the Glustervolumes;
- Power down the 2 nodes.

Going from 1 node to 3 nodes:
- Power on *only* 1 of the pair of nodes (if you power on both &self-heal is enabled, Gluster will "heal" the files on the main nodewith the older files on the 2 nodes which were powered down);
- Allow Gluster some time to detect that the files are in split-brain;
- Tell Gluster to heal the files in split-brain based on modificationtime;- Once the 2 nodes are in sync, re-enable quorum & power on the lastnode, which will be resynchronised automatically;
- Take the 2 hosts out of maintenance mode.
If you want to power on the 2nd two nodes at the same time, makeabsolutely sure self-heal is disabled first! If you don't, Gluster willsee the 2nd two nodes as in quorum & heal the data on your 1st nodewith the out-of-date data.
--
Doug



_______________________________________________
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

Re: [ovirt-users] Gluster storage question

Reply via email to