Re: Feedback about iSCSI High Availability Clustering Using DRBD and Pacemaker on RHEL 9

Matt Kereczman Fri, 15 Mar 2024 09:20:19 -0700

On 3/14/24 00:23, Strahil Nikolov wrote:

Hi All,


Do we have a thread about the HA iSCSI article ?
I saw a few things that can be optimized/updated:


Hello Strahil,

Thank you for reaching out and starting this thread! I just want tosay up top that we don't often receive feedback on our tech guides,but we absolutely value and appreciate it.

1. Point 3.7.1 Shouldn't be done as pcs has a native way to auth,assemble and start the cluster and corosync tinkering is no longerneeded. Something like this:
echo 'somepass' | passwd --stdin hacluster
pcs host auth node1 node1.example.com node2 node2.example.com
pcs cluster setup CLUSTERNAME node1 node2 totem token=10000 --enable--start

Great point. While I was aware of this, I'm very used to the oldermanual process of configuring the cluster communication layer. Thiscomment applies to a lot of our newer guides that use pcs, and we'llwork on addressing this throughout them all.

2. Point 3.8 is against any high availability and against Red Hatsupport policy - it should have a red lavel that this is done for thedemo !

Absolutely agree. There is a big red "WARNING" at the bottom ofsection 3.8 that touches on this point, but we can move it to the topfor better visibility.

It is my understanding that Red Hat doesn't support a few things thatLINBIT does support in regards to Pacemaker, including clusterswithout node level fencing - albeit strongly suggested whenever possible.

3. In point 3.10 I highly recommend setting scsi_sn for theocf:heartbeat:iSCSILogicalUnit - the software by default picks thesame SN for the first LUN and when 1 client attaches 2 LUNs (from 2srparate clusters) multipath will treat the 2 sources as one andaggregate the paths - becomes a real mess

Good catch. The automation we use internally for testing HA iSCSIdeployments sets the scsi_sn, so I'm surprised that I missed it inthis guide. I believe I recall older VMware products required thescsi_sn match for smooth failover as well, either way, will update.

4. Consider LVM filter for the DRBD device - a client might use theLUN as a PV in a volume group and then thr situation will get messy -the cluster won't be able to demote the node and pacemaker will fence it.


Excellent suggestion for a common use case. Will include.

5. Consider using fencing delay when using 2-node clusters - in caseof split brain scenario the node with more resources will survive:
pcs resource defaults update priority=1
pcs property set priority-fencing-delay=10

<snip>

We definitely practice this for 2-node clusters we deploy with fencingconfigured.

I think it doesn't make sense to include in this guide where we do notconfigure fencing, but it should be included in a more generalreference on, "how to properly configure fencing". I will make surethat we have this information somewhere for public consumption, andpossibly link to it within all our tech guides.


Best Regards,
Matt Kereczman

P.S. We are happy to receive feedback and suggestions through thismailing list. Thanks again for what you have provided. You can alsomake suggestions about user's guides through opening issues in ourGitHub repository (https://github.com/LINBIT/linbit-documentation).For technical "how-to" guides or other documentation content thatisn't a user's guide, such as a knowledge base article or blogarticle, another option is to reach out to [email protected].

Re: Feedback about iSCSI High Availability Clustering Using DRBD and Pacemaker on RHEL 9

Reply via email to