Re: [ClusterLabs] 答复: the PAF switchover does not happen if the VIP resource is stopped

2018-04-26 Thread emmanuel segura
But I think using ifdown isn't the correct way to test the cluster, this topic was discussed many times 2018-04-26 9:53 GMT+02:00 范国腾 : > 1. There is no failure in initial status. sds1 is master > > > > 2. ifdown the sds1 VIP network card. > > 3. ifup the sds1 VIP network

Re: [ClusterLabs] the PAF switchover does not happen if the VIP resource is stopped

2018-04-25 Thread emmanuel segura
https://oss.clusterlabs.org/pipermail/pacemaker/2013-July/019224.html 2018-04-25 10:58 GMT+02:00 范国腾 : > Hi, > > > > Our lab has two resource: (1) PAF (master/slave)(2) VIP (bind to the > master PAF node). The configuration is in the attachment. > > Each node has two

Re: [ClusterLabs] HALVM monitor action fail on slave node. Possible bug?

2018-04-13 Thread emmanuel segura
the first thing that you need to configure is the stonith, because you have this constraint "constraint order promote DrbdResClone then start HALVM" To recover and promote drbd to master when you crash a node, configurare the drbd fencing handler. pacemaker execute monitor in both nodes, so this

Re: [ClusterLabs] custom resource agent FAILED (blocked)

2018-04-12 Thread emmanuel segura
the start function, need to start the resource when monitor doesn't return success 2018-04-12 23:38 GMT+02:00 Bishoy Mikhael : > Hi All, > > I'm trying to create a resource agent to promote a standby HDFS namenode > to active when the virtual IP failover to another node. >

Re: [ClusterLabs] pacemaker pingd with ms drbd = double masters short time when disconnected networks.

2017-12-19 Thread emmanuel segura
You need to configure the stonith and drbd stonith handler 2017-12-19 8:19 GMT+01:00 Прокопов Павел : > Hello! > > pacemaker pingd with ms drbd = double masters short time when disconnected > networks. > > My crm config: > > node 168885811: pp-pacemaker1.heliosoft.ru > node

Re: [ClusterLabs] set node in maintenance - stop corosync - node is fenced - is that correct ?

2017-10-16 Thread emmanuel segura
I put a node in maintenance mode? do you mean you put the cluster in maintenance mode 2017-10-16 19:24 GMT+02:00 Lentes, Bernd : > Hi, > > i have the following behavior: I put a node in maintenance mode, > afterwards stop corosync on that node with

Re: [ClusterLabs] IP clone issue

2017-09-05 Thread emmanuel segura
he IP, so > that if any node goes down, another node can take over the failed node’s > "request bucket". Otherwise, requests intended for the failed node would be > discarded." > > To have this functionality do I must have a static IP set on the > interfaces ? > > > &

Re: [ClusterLabs] IP clone issue

2017-09-05 Thread emmanuel segura
I never tried to set an virtual ip in one interfaces without ip, because the vip is a secondary ip that switch between nodes, not primary ip 2017-09-05 15:41 GMT+02:00 Octavian Ciobanu : > Hello all, > > I've encountered an issue with IP cloning. > > Based the "Pacemaker

Re: [ClusterLabs] DRBD or SAN ?

2017-07-18 Thread emmanuel segura
yes, if you are using drbd in master/slave, first promote the resource to master and then start vm on the node, if you use drbd in multimaster, only start the vm when drbd is started. Use SAN, with multipath. 2017-07-18 16:34 GMT+02:00 Lentes, Bernd : > > >

Re: [ClusterLabs] DRBD split brain after Cluster node recovery

2017-07-12 Thread emmanuel segura
you need to configure cluster fencing and drbd fencing handler, in this way, the cluster can recevory without manual intervention. 2017-07-12 11:33 GMT+02:00 ArekW : > Hi, > Can in be fixed that the drbd is entering split brain after cluster > node recovery? After few tests I

Re: [ClusterLabs] Oracle 12c with Pacemaker and GFS2

2017-07-07 Thread emmanuel segura
I think is a good idea, if you first show your config and cluster logs, because I never any limitation to run active/active in pacemaker. 2017-07-06 21:52 GMT+02:00 Jesse P. Johnson : > ALL, > > > I have setup an active/passive cluster using Pacemaker, CLVM, and

Re: [ClusterLabs] PCSD Certificate

2017-07-06 Thread emmanuel segura
I don't know what can happen, if the ssl expired, but looking in /usr/lib/pcsd/ssl.rb I found the function. def generate_cert_key_pair(server_name) name = "/C=US/ST=MN/L=Minneapolis/O=pcsd/OU=pcsd/CN=#{server_name}" ca = OpenSSL::X509::Name.parse(name) key = OpenSSL::PKey::RSA.new(2048)

Re: [ClusterLabs] Installing on SLES 12 -- Where's the Repos?

2017-06-16 Thread emmanuel segura
you can go ahead without updates, anyway, if you don't to pay for support, use centos or other distro. 2017-06-16 10:14 GMT+02:00 Eric Robinson : > > > Ø You could test it for free, you just need to register > > Ø to https://scc.suse.com/login > > Ø After that, you

Re: [ClusterLabs] Need to replace EMC shared disk with an EMC disk from a different EMC Storage

2017-06-15 Thread emmanuel segura
please, give more information and if you are using lvm, share your lvm cluster information and the cluster config too. 2017-06-15 9:22 GMT+02:00 : > Hi. We need to clear an old EMC storage and the only thing that's left > there is the shared disk of our Pacemaker cluster.

Re: [ClusterLabs] Antw: Re: Antw: Re: Antw: Re: Q: cluster-dlm[4494]: setup_cpg_daemon: daemon cpg_join error retrying

2017-03-06 Thread emmanuel segura
that you say to the cluster, to not perform any action, because you are doing an intervention. 2017-03-06 9:14 GMT+01:00 Ulrich Windl <ulrich.wi...@rz.uni-regensburg.de>: >>>> emmanuel segura <emi2f...@gmail.com> schrieb am 03.03.2017

Re: [ClusterLabs] Antw: Re: Antw: Re: Q: cluster-dlm[4494]: setup_cpg_daemon: daemon cpg_join error retrying

2017-03-03 Thread emmanuel segura
use something like standby? 2017-03-03 16:02 GMT+01:00 Ulrich Windl <ulrich.wi...@rz.uni-regensburg.de>: >>>> emmanuel segura <emi2f...@gmail.com> schrieb am 03.03.2017 um 15:35 in > Nachricht >

Re: [ClusterLabs] Antw: Re: Q: cluster-dlm[4494]: setup_cpg_daemon: daemon cpg_join error retrying

2017-03-03 Thread emmanuel segura
I think is a good idea to put your cluster in maintenance mode, when you do an update. 2017-03-03 15:11 GMT+01:00 Ulrich Windl <ulrich.wi...@rz.uni-regensburg.de>: >>>> emmanuel segura <emi2f...@gmail.com> schrieb am 03.03.2017 um 14:22 in > Nachricht >

Re: [ClusterLabs] Q: cluster-dlm[4494]: setup_cpg_daemon: daemon cpg_join error retrying

2017-03-03 Thread emmanuel segura
your cluster was in maintenance state? 2017-03-03 13:59 GMT+01:00 Ulrich Windl : > Hello! > > After Update and reboot of 2nd of three nodes (SLES11 SP4) I see a > "cluster-dlm[4494]: setup_cpg_daemon: daemon cpg_join error retrying" message > when I expected

Re: [ClusterLabs] Oralsnr/Oracle resources agents

2017-02-23 Thread emmanuel segura
I think no, in /usr/lib/ocf/resource.d/heartbeat/oralsnr start function, oralsnr_start = "output=`echo lsnrctl start $listener | runasdba`" stop function, oralsnr_stop = "output=`echo lsnrctl stop $listener | runasdba`" Where listener variable is the resource agent parameter given by pacemaker :

Re: [ClusterLabs] Antw: Oracle Stopping

2017-02-22 Thread emmanuel segura
The first place where you need to look is oracle log. 2017-02-22 8:43 GMT+01:00 Ulrich Windl : Chad Cravens schrieb am 22.02.2017 um 02:44 in > Nachricht >

Re: [ClusterLabs] SBD with shared block storage (and watchdog?)

2017-02-13 Thread emmanuel segura
I missed that, the same device for partition and sbd :(, really bad idea. 2017-02-13 19:04 GMT+01:00 Klaus Wenninger : > On 02/13/2017 06:34 PM, dur...@mgtsciences.com wrote: >> I am working to get an active/active cluster running. >> I have Windows 10 running 2 Fedora 25

Re: [ClusterLabs] Can't create a resource for ocf:heartbeat:oracle and oraclsnr.

2017-01-30 Thread emmanuel segura
Resource Agent > OCF:heartbeat:oracle and ocf:heartbeat:orclsnr. > > Thanks > > Jihed M’SELMI > Mobile: +21658433664 > http://about.me/jihed.mselmi > > On Tue, Jan 31, 2017 at 12:16 AM, emmanuel segura <emi2f...@gmail.com> > wrote: >> >>

Re: [ClusterLabs] Can't create a resource for ocf:heartbeat:oracle and oraclsnr.

2017-01-30 Thread emmanuel segura
please, if you need help, the first thing is show, your cluster configuration. 2017-01-30 23:15 GMT+01:00 Jihed M'selmi : > I tried to install two resources: a resource for oracle database and oracle > listener: but the pcmk can't install the resource (red hat 7.3) usint

Re: [ClusterLabs] VirtualDomain started in two hosts

2017-01-17 Thread emmanuel segura
thinks resource vm-vdicdb01 is only > started on node vdicnode01-priv. > > Thanks a lot. > > > > 2017-01-17 10:03 GMT+01:00 emmanuel segura <emi2f...@gmail.com>: >> >> sorry, >> >> But do you mean, when you say, you migrated the vm outside of the >>

Re: [ClusterLabs] VirtualDomain started in two hosts

2017-01-17 Thread emmanuel segura
sorry, But do you mean, when you say, you migrated the vm outside of the cluster? one server out side of you cluster? 2017-01-17 9:27 GMT+01:00 Oscar Segarra : > Hi, > > I have configured a two node cluster whewe run 4 kvm guests on. > > The hosts are: > vdicnode01 >

Re: [ClusterLabs] setting up SBD_WATCHDOG_TIMEOUT, stonith-timeout and stonith-watchdog-timeout

2016-12-08 Thread emmanuel segura
the only thing that I can say is: sbd is a realtime process 2016-12-08 11:47 GMT+01:00 Jehan-Guillaume de Rorthais : > Hello, > > While setting this various parameters, I couldn't find documentation and > details about them. Bellow some questions. > > Considering the watchdog

Re: [ClusterLabs] OS Patching Process

2016-11-22 Thread emmanuel segura
I been using this mode: iscsi_disks -> lvm volume -> drbd_on_top_of_lvm -> filesystem resize: add_one_iscsi_device_to_every_cluster_node_first -> now_add_device_the_volume_group_on_every_cluster_node -> now_resize_the_volume_on_every_cluster_node : now you have every cluster with the same logical

Re: [ClusterLabs] Antw: Re: OCFS2 on cLVM with node waiting for fencing timeout

2016-10-13 Thread emmanuel segura
If you want to reduce the multipath switching time, when one controller goes down https://www.redhat.com/archives/dm-devel/2009-April/msg00266.html 2016-10-13 10:27 GMT+02:00 Ulrich Windl : Eric Ren schrieb am 13.10.2016 um 09:31 in

Re: [ClusterLabs] Pacemaker and Oracle ASM

2016-10-10 Thread emmanuel segura
why you don't use oracle rac with asm? 2016-10-07 18:46 GMT+02:00 Chad Cravens : > Hello: > > I'm working on a project where the client is using Oracle ASM (volume > manager) for database storage. I have implemented a cluster before using > LVM with ext4 and understand

Re: [ClusterLabs] ocf:heartbeat:LVM or /etc/lvm/lvm.conf settings question

2016-08-10 Thread emmanuel segura
> cdrom drives: > filter = [ "r|/dev/.*/by-path/.*|", "r|/dev/.*/by-id/.*|", > "r|/dev/fd.*|", "r|/dev/cdrom|", "a/.*/" ] > > -Original Message- > From: emmanuel segura [mailto:emi2f...@gmail.com] > Sent: Wednesday, Aug

Re: [ClusterLabs] ocf:heartbeat:LVM or /etc/lvm/lvm.conf settings question

2016-08-10 Thread emmanuel segura
your lvm filter include the drbd devices /dev/drbdX ? 2016-08-10 21:38 GMT+02:00 Darren Kinley : > Hi, > > I have an LVM logical volume and used DRBD to replicate it to another > server. > The /dev/drbd0 has PV/VG/LVs which are mostly working. > I have colocation and

Re: [ClusterLabs] Bloody Newbie needs help for OCFS2 on pacemaker+corosync+pcs

2016-08-02 Thread emmanuel segura
er, I am just becoming familiar with all that stuff. > > Thanks for any help, > > Thomas Hluchnik > > > Am Tuesday 02 August 2016 15:28:17 schrieb emmanuel segura: >> why you don't use the resource agent for using o2cb? This script for >> begin used with ocfs legacy mo

Re: [ClusterLabs] Bloody Newbie needs help for OCFS2 on pacemaker+corosync+pcs

2016-08-02 Thread emmanuel segura
why you don't use the resource agent for using o2cb? This script for begin used with ocfs legacy mode. 2016-08-02 12:39 GMT+02:00 Kyle O'Donnell : > er forgot > > primitive p_o2cb lsb:o2cb \ > op monitor interval="10" timeout="30" \ > op start interval="0"

Re: [ClusterLabs] Antw: Two node Active/Active Asterisk+GFS2+DLM+fence_xvm Cluster

2016-07-15 Thread emmanuel segura
maybe you need interleave=true in your clones 2016-07-15 8:32 GMT+02:00 Ulrich Windl : TEG AMJG schrieb am 14.07.2016 um 23:47 in Nachricht > : >> Dear list >> >> I am

Re: [ClusterLabs] Clusvcadm -Z substitute in Pacemaker

2016-07-13 Thread emmanuel segura
using pcs resource unmanage leave the monitoring resource actived, I usually set the monitor interval=0 :) 2016-07-11 10:43 GMT+02:00 Tomas Jelinek : > Dne 9.7.2016 v 06:39 jaspal singla napsal(a): >> >> Hello Everyone, >> >> I need little help, if anyone can give some

Re: [ClusterLabs] RES: Pacemaker and OCFS2 on stand alone mode

2016-07-07 Thread emmanuel segura
dlm_tool dump ? 2016-07-07 18:57 GMT+02:00 Carlos Xavier : > Tank you for the fast reply > >> >> have you configured the stonith and drbd stonith handler? >> > > Yes. they were configured. > The cluster was running fine for more than 4 years, until we loose one host >

Re: [ClusterLabs] Pacemaker and OCFS2 on stand alone mode

2016-07-07 Thread emmanuel segura
have you configured the stonith and drbd stonith handler? 2016-07-07 16:43 GMT+02:00 Carlos Xavier : > Hi. > We had a Pacemaker cluster running OCFS2 filesystem over a DRBD device and we > completely lost one of the hosts. > Now I need some help to recover the data on

Re: [ClusterLabs] Antw: RES: Performance of a mirrored LV (cLVM) with OCFS: Attempt to monitor it

2016-05-27 Thread emmanuel segura
Hi, But the latest lvm version doesn't worries about the aligned? 2016-05-27 18:37 GMT+02:00 Ken Gaillot : > On 05/27/2016 12:58 AM, Ulrich Windl wrote: >> Hi! >> >> Thanks for this info. We actually run the "noop" scheduler for the SAN >> storage (as per menufacturer's

Re: [ClusterLabs] Antw: ringid interface FAULTY no resource move

2016-05-04 Thread emmanuel segura
use fencing and drbd fencing handler 2016-05-04 14:46 GMT+02:00 Rafał Sanocki : > Resources shuld move to second node when any interface is down. > > > > > W dniu 2016-05-04 o 14:41, Ulrich Windl pisze: > > Rafal Sanocki schrieb am

Re: [ClusterLabs] Set "start-failure-is-fatal=false" on only one resource?

2016-03-25 Thread emmanuel segura
try to use on-fail for single resource. 2016-03-25 0:22 GMT+01:00 Adam Spiers : > Sam Gardner wrote: >> I'm having some trouble on a few of my clusters in which the DRBD Slave >> resource does not want to come up after a reboot until I manually run >>

Re: [ClusterLabs] Issue with Stonith Resource parameters

2016-03-08 Thread emmanuel segura
I think you should give the parameters to the stonith agent, anyway show your config. 2016-03-09 5:29 GMT+01:00 vija ar : > I have configured SLEHA cluster on cisco ucs boxes with ipmi configured, i > have tested IPMI using impitool, however ipmitool to function neatly i have >

Re: [ClusterLabs] Pacemaker issue lsb service

2016-03-05 Thread emmanuel segura
If you need help, the first thing that you need to do is show your cluster logs. 2016-03-05 15:17 GMT+01:00 Thorsten Stremetzne : > Hello all, > > I have built an HA setup for a OpenVPN server. > In my setup there are two hosts, running Ubuntu Linux, pacemaker & >

Re: [ClusterLabs] Pacemaker issue when ethernet interface is pulled down

2016-02-14 Thread emmanuel segura
use fence and after you configured the fencing you need to use iptables for testing your cluster, with iptables you can block 5404 and 5405 ports 2016-02-14 14:09 GMT+01:00 Debabrata Pani : > Hi, > We ran into some problems when we pull down the ethernet interface

Re: [ClusterLabs] HA configuration

2016-02-04 Thread emmanuel segura
you need to be sure that your redis resources has master/slave support and I think this colocation need to be invert colocation resource_location1 inf: redis_clone:Master kamailio to colocation resource_location1 inf: kamailio redis_clone:Master You need a order too: order resource_order1

Re: [ClusterLabs] DLM not working on my GFS2/pacemaker cluster

2016-01-19 Thread emmanuel segura
please share your cluster config and say if your fencing is working. 2016-01-19 3:47 GMT+01:00 : > One of my clusters is having a problem. It's no longer able to set up its > GFS2 mounts. I've narrowed the problem down a bit. Here's the output when I > try to start the DLM

Re: [ClusterLabs] DLM not working on my GFS2/pacemaker cluster

2016-01-19 Thread emmanuel segura
://pastebin.com/eAiq2yJ9 > > Another cluster is running fine with an identical configuration. > > On 2016-01-19 03:49, emmanuel segura wrote: >> >> please share your cluster config and say if your fencing is working. >> >> 2016-01-19 3:47 GMT+01:00 <dan...@be

Re: [ClusterLabs] Cluster resources -restart automatically

2016-01-11 Thread emmanuel segura
you can use on-fail in the stop operation and for your other questions you can use colocation + order or better if you use a group: for example group mygroup resource1 resource2 When resource1 monitor fails the resource2 restarts 2016-01-11 17:09 GMT+01:00 John Gogu : >

Re: [ClusterLabs] SBD Latency Warnings

2015-12-30 Thread emmanuel segura
I'm not sbd expert but I try to describe one of this warnings. sbd: WARN: Pacemaker state outdated (age: 4) in sbd source code "./src/sbd-md.c"

Re: [ClusterLabs] start service after filesystemressource

2015-11-20 Thread emmanuel segura
using group is more simple example: group mygroup resource1 resource2 resource 3 order o_drbd_before_services inf: ms_drbd_export:promote mygroup:start 2015-11-20 15:45 GMT+01:00 Andrei Borzenkov : > 20.11.2015 16:38, haseni...@gmx.de пишет: > >> Hi, >> I want to start

Re: [ClusterLabs] split brain cluster

2015-11-16 Thread emmanuel segura
: > Hi Emmanuel, > > I'm not sure, how can i check it? > > Greetings Richard > > Op ma 16 nov. 2015 om 14:58 schreef emmanuel segura <emi2f...@gmail.com>: >> >> you configured the stonith? >> >> 2015-11-16 14:43 GMT+01:00 Richard Korsten <rich...@r

Re: [ClusterLabs] split brain cluster

2015-11-16 Thread emmanuel segura
ngs. > > Op ma 16 nov. 2015 om 15:09 schreef emmanuel segura <emi2f...@gmail.com>: >> >> >> http://clusterlabs.org/doc/en-US/Pacemaker/1.1/html/Clusters_from_Scratch/ch08.html >> and >> https://github.com/ClusterLabs/pacemaker/blob/master/doc/pcs-crmsh