Re: [ClusterLabs] Re: Fencing errors

2019-05-24 Thread Lopez, Francisco Javier [Global IT]
Hello guys.

Please forget about this issue; I set up a process that asks for the status 
every 10 secs and I realized
the process takes around 25 secs when it fails. If this helps any other, this 
is what I did in a loop:

# time fence_vmware_soap --ip  --username "x" -p "x" --ssl 
--ssl-insecure --action status --plug 
ao-pg02-p.axadmin.net,ao-pg01-p.axadmin.net
Status: ON

real0m21.999s  <<<---
user0m15.190s
sys  0m0.294s

The normal execution takes around 14 secs, hence it does not fail.
Since I updated the pcmk_monitor_timeout to 30 the process is running as 
expected.

Now it's my turn to review why of that difference at vmware.

Thx.
Javier

Francisco Javier​   Lopez
IT System Engineer   |  Global IT
O: +34 619 728 249|  M: +34 619 728 
249|
franciscojavier.lo...@solera.com<mailto:franciscojavier.lo...@solera.com>   
 |  Solera.com<https://www.solera.com/>
Audatex Datos, S.A.  |  Avda. de Bruselas, 36, Salida 16, A‑1 
(Diversia),   Alcobendas  ,   Madrid  ,   28108   , 
  Spain
[cid:image630630.png@5A05821E.C9C08C85]

On 5/23/2019 8:29 PM, Lopez, Francisco Javier [Global IT] wrote:
Hello again Ken et all.

I realized about many things investigating this issue but I feel I need a bit 
more help from you guys.

It's clear the monitoring process is reporting a timeout. Although I've 
increased this timeout to 30c using pcmk_monitoring_timeout,
and during this last 2 hours the process did not fail, I'd like to understand 
more in detail how this process works and if I'm
getting a timeout after 20 secs, it looks to me something else could be 
happening in my systems.

I tried enabling debug again and, as before, the 'debug' option creates the 
file but does not update anything unless I enable 'verbose'.
Funny thing because when I enable it, I hit a bug and the fencing does not 
start:

https://bugzilla.redhat.com/show_bug.cgi?id=1549366<https://nam01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fbugzilla.redhat.com%2Fshow_bug.cgi%3Fid%3D1549366=01%7C01%7C%7C052e99fae77e4771667008d6dfaca27f%7Cc45b48f313bb448b9356ba7b863c2189%7C1=R7BIdUDQuFGRCknMramt0zN3E%2ByjrUctVhf7bxpoBpw%3D=0>

I enabled debug at corosync layer and I got some more information that was nice 
to better understand this issue but still, not enough
information to narrow down where the issue comes from.

Said this, I'd like to know, if there is a way to review more in detail what 
the monitoring process is doing like ping, status, etc
and it that time is dedicated to the same action all those secs.

Any idea will be more than welcome.

As always, appreciate your help.

Regards
Javier



Francisco Javier​   Lopez
IT System Engineer   |  Global IT
O: +34 619 728 249|  M: +34 619 728 
249|
franciscojavier.lo...@solera.com<mailto:franciscojavier.lo...@solera.com>   
 |  
Solera.com<https://nam01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.solera.com%2F=01%7C01%7C%7C052e99fae77e4771667008d6dfaca27f%7Cc45b48f313bb448b9356ba7b863c2189%7C1=vaYYebi86RJFfIDUlI5UiL2M7UGfv3kgbp%2FE9K8A7UE%3D=0>
Audatex Datos, S.A.  |  Avda. de Bruselas, 36, Salida 16, A‑1 
(Diversia),   Alcobendas  ,   Madrid  ,   28108   , 
  Spain
[cid:part6.A6B7221B.10233C2B@solera.com]

On 5/21/2019 6:19 PM, Ken Gaillot wrote:

On Tue, 2019-05-21 at 11:10 +, Lopez, Francisco Javier [Global IT]
wrote:


Hello guys !

Need your help to try to understand and debug what I'm facing in one
of my clusters.

I set up fencing with this detail:

# pcs -f stonith_cfg stonith create fence_ao_pg01 fence_vmware_soap
ipaddr= ssl_insecure=1 login="" passwd=""
pcmk_reboot_action=reboot pcmk_host_list="ao-pg01-p.axadmin.net"
power_wait=3 op monitor interval=60s
# pcs -f stonith_cfg stonith create fence_ao_pg02 fence_vmware_soap
ipaddr= ssl_insecure=1 login="" passwd=""
pcmk_reboot_action=reboot pcmk_host_list="ao-pg02-p.axadmin.net"
power_wait=3 op monitor interval=60s

# pcs -f stonith_cfg constraint location fence_ao_pg01 avoids ao-
pg01-p.axadmin.net=INFINITY
# pcs -f stonith_cfg constraint location fence_ao_pg02 avoids ao-
pg02-p.axadmin.net=INFINITY

# pcs cluster cib-push stonith_cfg

The pcs status shows all ok during some time and then it turns to:

[root@ao-pg01-p ~]# pcs status --full
Cluster name: ao_cl_p_01
Stack: corosync
Current DC: ao-pg01-p.axadmin.net (1) (version 1.1.19-8.el7_6.4-
c3c624ea3d) - partition with quorum
Last updated: Tue May 21 12:18:46 2019
Last change: Fri May 17 18:54:32 2019 by hacluster via crmd on ao-
pg01-p.axadmin.net

2 nodes configured
3 resources configured

Online: [ ao-pg01-p.axadmin.net (1) ao-pg02-p.axadmin.net (2) ]

Full list of resources:

 ao-cl-p-01-vip01(ocf::heartbeat:IPaddr2):Started ao-pg01-
p.axadmin.net
 fence_ao_pg01(st

Re: [ClusterLabs] Re: Fencing errors

2019-05-23 Thread Lopez, Francisco Javier [Global IT]
Hello again Ken et all.

I realized about many things investigating this issue but I feel I need a bit 
more help from you guys.

It's clear the monitoring process is reporting a timeout. Although I've 
increased this timeout to 30c using pcmk_monitoring_timeout,
and during this last 2 hours the process did not fail, I'd like to understand 
more in detail how this process works and if I'm
getting a timeout after 20 secs, it looks to me something else could be 
happening in my systems.

I tried enabling debug again and, as before, the 'debug' option creates the 
file but does not update anything unless I enable 'verbose'.
Funny thing because when I enable it, I hit a bug and the fencing does not 
start:

https://bugzilla.redhat.com/show_bug.cgi?id=1549366

I enabled debug at corosync layer and I got some more information that was nice 
to better understand this issue but still, not enough
information to narrow down where the issue comes from.

Said this, I'd like to know, if there is a way to review more in detail what 
the monitoring process is doing like ping, status, etc
and it that time is dedicated to the same action all those secs.

Any idea will be more than welcome.

As always, appreciate your help.

Regards
Javier



Francisco Javier​   Lopez
IT System Engineer   |  Global IT
O: +34 619 728 249|  M: +34 619 728 
249|
franciscojavier.lo...@solera.com<mailto:franciscojavier.lo...@solera.com>   
 |  Solera.com<https://www.solera.com/>
Audatex Datos, S.A.  |  Avda. de Bruselas, 36, Salida 16, A‑1 
(Diversia),   Alcobendas  ,   Madrid  ,   28108   , 
  Spain
[cid:image790996.png@A70D2A26.F4AADDCB]

On 5/21/2019 6:19 PM, Ken Gaillot wrote:

On Tue, 2019-05-21 at 11:10 +0000, Lopez, Francisco Javier [Global IT]
wrote:


Hello guys !

Need your help to try to understand and debug what I'm facing in one
of my clusters.

I set up fencing with this detail:

# pcs -f stonith_cfg stonith create fence_ao_pg01 fence_vmware_soap
ipaddr= ssl_insecure=1 login="" passwd=""
pcmk_reboot_action=reboot pcmk_host_list="ao-pg01-p.axadmin.net"
power_wait=3 op monitor interval=60s
# pcs -f stonith_cfg stonith create fence_ao_pg02 fence_vmware_soap
ipaddr= ssl_insecure=1 login="" passwd=""
pcmk_reboot_action=reboot pcmk_host_list="ao-pg02-p.axadmin.net"
power_wait=3 op monitor interval=60s

# pcs -f stonith_cfg constraint location fence_ao_pg01 avoids ao-
pg01-p.axadmin.net=INFINITY
# pcs -f stonith_cfg constraint location fence_ao_pg02 avoids ao-
pg02-p.axadmin.net=INFINITY

# pcs cluster cib-push stonith_cfg

The pcs status shows all ok during some time and then it turns to:

[root@ao-pg01-p ~]# pcs status --full
Cluster name: ao_cl_p_01
Stack: corosync
Current DC: ao-pg01-p.axadmin.net (1) (version 1.1.19-8.el7_6.4-
c3c624ea3d) - partition with quorum
Last updated: Tue May 21 12:18:46 2019
Last change: Fri May 17 18:54:32 2019 by hacluster via crmd on ao-
pg01-p.axadmin.net

2 nodes configured
3 resources configured

Online: [ ao-pg01-p.axadmin.net (1) ao-pg02-p.axadmin.net (2) ]

Full list of resources:

 ao-cl-p-01-vip01(ocf::heartbeat:IPaddr2):Started ao-pg01-
p.axadmin.net
 fence_ao_pg01(stonith:fence_vmware_soap):Stopped
 fence_ao_pg02(stonith:fence_vmware_soap):Stopped

Node Attributes:
* Node ao-pg01-p.axadmin.net (1):
* Node ao-pg02-p.axadmin.net (2):

Migration Summary:
* Node ao-pg02-p.axadmin.net (2):
   fence_ao_pg01: migration-threshold=100 fail-count=100
last-failure='Sat May 18 00:22:22 2019'
* Node ao-pg01-p.axadmin.net (1):
   fence_ao_pg02: migration-threshold=100 fail-count=100
last-failure='Fri May 17 20:52:53 2019'

Failed Actions:
* fence_ao_pg01_start_0 on ao-pg02-p.axadmin.net 'unknown error' (1):
call=22, status=Timed Out, exitreason='',
last-rc-change='Sat May 18 00:19:49 2019', queued=0ms,
exec=20022ms
* fence_ao_pg02_start_0 on ao-pg01-p.axadmin.net 'unknown error' (1):
call=84, status=Timed Out, exitreason='',
last-rc-change='Fri May 17 20:52:33 2019', queued=0ms,
exec=20032ms

PCSD Status:
  ao-pg02-p.axadmin.net: Online
  ao-pg01-p.axadmin.net: Online

Daemon Status:
  corosync: active/disabled
  pacemaker: active/disabled
  pcsd: active/enabled


>From the output I see there seems to be a 'Timed Out' but I'd like to
understand if this is a configuration issue
or something else I'm not aware of.



When pacemaker starts a fence device, it issues a monitor command to
the fence agent. That command is what's timing out here.

The first thing I'd try is running the monitor command manually using
the parameters in the device configuration. The fence agent likely has
a debug option you could turn on to get more details.




I'm attaching part of the log that shows the problem related to 17-
May.

Regards
Francisco JavierLopez IT System Engineer |
Global IT

[ClusterLabs] Fencing errors

2019-05-21 Thread Lopez, Francisco Javier [Global IT]
Hello guys !

Need your help to try to understand and debug what I'm facing in one of my 
clusters.

I set up fencing with this detail:

# pcs -f stonith_cfg stonith create fence_ao_pg01 fence_vmware_soap ipaddr= 
ssl_insecure=1 login="" passwd="" pcmk_reboot_action=reboot 
pcmk_host_list="ao-pg01-p.axadmin.net" power_wait=3 op monitor interval=60s
# pcs -f stonith_cfg stonith create fence_ao_pg02 fence_vmware_soap ipaddr= 
ssl_insecure=1 login="" passwd="" pcmk_reboot_action=reboot 
pcmk_host_list="ao-pg02-p.axadmin.net" power_wait=3 op monitor interval=60s

# pcs -f stonith_cfg constraint location fence_ao_pg01 avoids 
ao-pg01-p.axadmin.net=INFINITY
# pcs -f stonith_cfg constraint location fence_ao_pg02 avoids 
ao-pg02-p.axadmin.net=INFINITY

# pcs cluster cib-push stonith_cfg

The pcs status shows all ok during some time and then it turns to:

[root@ao-pg01-p ~]# pcs status --full
Cluster name: ao_cl_p_01
Stack: corosync
Current DC: ao-pg01-p.axadmin.net (1) (version 1.1.19-8.el7_6.4-c3c624ea3d) - 
partition with quorum
Last updated: Tue May 21 12:18:46 2019
Last change: Fri May 17 18:54:32 2019 by hacluster via crmd on 
ao-pg01-p.axadmin.net

2 nodes configured
3 resources configured

Online: [ ao-pg01-p.axadmin.net (1) ao-pg02-p.axadmin.net (2) ]

Full list of resources:

 ao-cl-p-01-vip01(ocf::heartbeat:IPaddr2):Started ao-pg01-p.axadmin.net
 fence_ao_pg01(stonith:fence_vmware_soap):Stopped
 fence_ao_pg02(stonith:fence_vmware_soap):Stopped

Node Attributes:
* Node ao-pg01-p.axadmin.net (1):
* Node ao-pg02-p.axadmin.net (2):

Migration Summary:
* Node ao-pg02-p.axadmin.net (2):
   fence_ao_pg01: migration-threshold=100 fail-count=100 
last-failure='Sat May 18 00:22:22 2019'
* Node ao-pg01-p.axadmin.net (1):
   fence_ao_pg02: migration-threshold=100 fail-count=100 
last-failure='Fri May 17 20:52:53 2019'

Failed Actions:
* fence_ao_pg01_start_0 on ao-pg02-p.axadmin.net 'unknown error' (1): call=22, 
status=Timed Out, exitreason='',
last-rc-change='Sat May 18 00:19:49 2019', queued=0ms, exec=20022ms
* fence_ao_pg02_start_0 on ao-pg01-p.axadmin.net 'unknown error' (1): call=84, 
status=Timed Out, exitreason='',
last-rc-change='Fri May 17 20:52:33 2019', queued=0ms, exec=20032ms

PCSD Status:
  ao-pg02-p.axadmin.net: Online
  ao-pg01-p.axadmin.net: Online

Daemon Status:
  corosync: active/disabled
  pacemaker: active/disabled
  pcsd: active/enabled


>From the output I see there seems to be a 'Timed Out' but I'd like to 
>understand if this is a configuration issue
or something else I'm not aware of.

I'm attaching part of the log that shows the problem related to 17-May.

Regards
Francisco Javier​   Lopez
IT System Engineer   |  Global IT
O: +34 619 728 249|  M: +34 619 728 
249|
franciscojavier.lo...@solera.com   
 |  Solera.com
Audatex Datos, S.A.  |  Avda. de Bruselas, 36, Salida 16, A‑1 
(Diversia),   Alcobendas  ,   Madrid  ,   28108   , 
  Spain
[cid:image613011.png@EE748CA2.38730175]




" Este e-mail y sus archivos adjuntos son confidenciales y están dirigidos 
exclusivamente a la(s) persona(s) destinataria prevista. Si ha recibido este 
mensaje por error, por favor, notifique inmediatamente al remitente y elimine 
este mensaje. La empresa no firma contratos por e-mail y todas las 
negociaciones están sujetas a la firma de un contrato por escrito.

This e-mail and any attached files are confidential and intended for the named 
addressee(s) only. If you have received this message in error, please notify 
the sender and delete the email immediately. The company does not conclude 
contracts by email and all negotiations are subject to written contract. "

May 17 20:38:40 [127218] ao-pg01-p.axadmin.netpengine: info: 
unpack_node_loop:  Node 1 is already processed
May 17 20:38:40 [127218] ao-pg01-p.axadmin.netpengine: info: 
common_print:  ao-cl-p-01-vip01(ocf::heartbeat:IPaddr2):   
Started ao-pg01-p.axadmin.net
May 17 20:38:40 [127218] ao-pg01-p.axadmin.netpengine: info: 
common_print:  fence_ao_pg01   (stonith:fence_vmware_soap):Started 
ao-pg02-p.axadmin.net
May 17 20:38:40 [127218] ao-pg01-p.axadmin.netpengine: info: 
common_print:  fence_ao_pg02   (stonith:fence_vmware_soap):Started 
ao-pg01-p.axadmin.net
May 17 20:38:40 [127218] ao-pg01-p.axadmin.netpengine: info: 
pe_get_failcount:  fence_ao_pg02 has failed 12 times on ao-pg01-p.axadmin.net
May 17 20:38:40 [127218] ao-pg01-p.axadmin.netpengine: info: 
check_migration_threshold: fence_ao_pg02 can fail 88 more times on 
ao-pg01-p.axadmin.net before being forced off
...
...

May 17 20:52:33 [127215] ao-pg01-p.axadmin.net stonith-ng: info: 
st_child_term: Child 48496 timed out, sending SIGTERM
May 17 20:52:33 [127215] 

Re: [ClusterLabs] Re: Fencing questions

2019-05-09 Thread Lopez, Francisco Javier [Global IT]
Thx Klaus, appreciate your feedback.

Regards
Javier

Francisco Javier​   Lopez

IT System Engineer   |  Global IT

O: +34 619 728 249|  M: +34 619 728 
249|

franciscojavier.lo...@solera.com<mailto:franciscojavier.lo...@solera.com>   
 |  Solera.com<https://www.solera.com/>

Audatex Datos, S.A.  |  Avda. de Bruselas, 36, Salida 16, A‑1 
(Diversia),   Alcobendas  ,   Madrid  ,   28108   , 
  Spain


[cid:image339311.png@D1525B14.71452121]



On 5/9/2019 1:14 PM, Klaus Wenninger wrote:

On 5/9/19 1:03 PM, Lopez, Francisco Javier [Global IT] wrote:


Good day guys !

I'm implementing fencing in my two node cluster with this detail:

- fence_vmware_soap
- PostgreSql release 10.X
- CentOS 7.X

As far as I know, to create the resources, I can use two different ways:

- Create only one resource for both nodes, following this way:

  # pcs -f stonith_cfg stonith create fence_nodes \
  fence_vmware_soap \
  ipaddr= \
  ssl_insecure=1 \
  login="" \
  passwd="" \
  action=reboot \
  pcmk_host_list="node_01,node_02" \
  power_wait=3 op monitor interval=60s

- Create two different resources, one for each node, following this way:

  # pcs -f stonith_cfg stonith create fence_node_01 \
  fence_vmware_soap  \
  ipaddr= \
  ssl_insecure=1 \
  login="" \
  passwd="" \
  action=reboot \
  pcmk_host_list="node_01" \
  power_wait=3 op monitor interval=60s

  # pcs -f stonith_cfg stonith create fence_node_02 \
  fence_vmware_soap  \
  ipaddr= \
  ssl_insecure=1 \
  login="" \
  passwd="" \
  action=reboot \
  pcmk_host_list="node_02" \
  power_wait=3 op monitor interval=60s

  With this method I understand I will have to add some constraints, etc.

Said this, my questions are:

- Which are the differences among them ? (if any).
- Is there any recommended option and why ?


For your 2-node-cluster there might be a definite reason why you'd
like to have 2 fence-resources as to prevent fence-races when both
are still alive but just don't see each other. If you make one
wait a little longer to fence you can determine which one would
rather win.
If the aim is just to prevent that they fence each other at the same
time introduction of a random delay should be fine as well.
(That is pcmk_delay_max and if you want to further tailor the delay
with astatic delay component you can use pcmk_delay_base on top.)

Klaus



Regards
Javier
Francisco Javier​   Lopez

IT System Engineer   |  Global IT

O: *+34 619 728 249*  
 |
M: *+34 619 728 249*  
 |
*franciscojavier.lo...@solera.com*
<mailto:franciscojavier.lo...@solera.com><mailto:franciscojavier.lo...@solera.com>
   |  *Solera.com*
<https://nam01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.solera.com%2Fdata=01%7C01%7C%7C201d67c0bdaf4291190008d6d46f8b8e%7Cc45b48f313bb448b9356ba7b863c2189%7C1sdata=ccIbmo%2F%2FE7ifL7TAsFDqnJHEAl3pKEpbZDMwfuZ4CCo%3Dreserved=0><https://nam01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.solera.com%2Fdata=01%7C01%7C%7C201d67c0bdaf4291190008d6d46f8b8e%7Cc45b48f313bb448b9356ba7b863c2189%7C1sdata=ccIbmo%2F%2FE7ifL7TAsFDqnJHEAl3pKEpbZDMwfuZ4CCo%3Dreserved=0>

Audatex Datos, S.A.  |
Avda. de Bruselas, 36, Salida 16, A‑1 (Diversia),   Alcobendas  
,
Madrid  ,   28108   ,   Spain






" Este e-mail y sus archivos adjuntos son confidenciales y están
dirigidos exclusivamente a la(s) persona(s) destinataria prevista. Si
ha recibido este mensaje por error, por favor, notifique
inmediatamente al remitente y elimine este mensaje. La empresa no
firma contratos por e-mail y todas las negociaciones están sujetas a
la firma de un contrato por escrito.

This e-mail and any attached files are confidential and intended for
the named addressee(s) only. If you have received this message in
error, please notify the sender and delete the email immediately. The
company does not conclude contracts by email and all negotiations are
subject to written contract. "

___
Manage your subscription:
https://nam01.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.clusterlabs.org%2Fmailman%2Flistinfo%2Fusersdata=01%7C01%7C%7C201d67c0bdaf4291190008d6d46f8b8e%7Cc45b48f313bb448b9356ba7b863c2189%7C1sdata=Hgno43Tl4vCx5AEIwoZkT6Xq2AhzpKoaiQ9n%2FWcjGcQ%3Dreserved=0

ClusterLabs home: 
https://nam01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.clusterlabs.org%2Fdata=01%7C01%7C%7C201d67c0bdaf4291190008d6d46f8b8e%7Cc45b48f313bb448b9356ba7b863c2189%7C1sdata=cCoTTaMC8MnieDaur7KPQSKHlGKBaeWMM6kRu6SOzE4%3Dreserved=0



___
Manage your subscription:
https://nam01.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.cl

[ClusterLabs] Fencing questions

2019-05-09 Thread Lopez, Francisco Javier [Global IT]
Good day guys !

I'm implementing fencing in my two node cluster with this detail:

- fence_vmware_soap
- PostgreSql release 10.X
- CentOS 7.X

As far as I know, to create the resources, I can use two different ways:

- Create only one resource for both nodes, following this way:

  # pcs -f stonith_cfg stonith create fence_nodes \
  fence_vmware_soap \
  ipaddr= \
  ssl_insecure=1 \
  login="" \
  passwd="" \
  action=reboot \
  pcmk_host_list="node_01,node_02" \
  power_wait=3 op monitor interval=60s

- Create two different resources, one for each node, following this way:

  # pcs -f stonith_cfg stonith create fence_node_01 \
  fence_vmware_soap  \
  ipaddr= \
  ssl_insecure=1 \
  login="" \
  passwd="" \
  action=reboot \
  pcmk_host_list="node_01" \
  power_wait=3 op monitor interval=60s

  # pcs -f stonith_cfg stonith create fence_node_02 \
  fence_vmware_soap  \
  ipaddr= \
  ssl_insecure=1 \
  login="" \
  passwd="" \
  action=reboot \
  pcmk_host_list="node_02" \
  power_wait=3 op monitor interval=60s

  With this method I understand I will have to add some constraints, etc.

Said this, my questions are:

- Which are the differences among them ? (if any).
- Is there any recommended option and why ?

Regards
Javier
Francisco Javier​   Lopez

IT System Engineer   |  Global IT

O: +34 619 728 249|  M: +34 619 728 
249|

franciscojavier.lo...@solera.com   
 |  Solera.com

Audatex Datos, S.A.  |  Avda. de Bruselas, 36, Salida 16, A‑1 
(Diversia),   Alcobendas  ,   Madrid  ,   28108   , 
  Spain


[cid:image660500.png@443BEB68.0955AF5D]






" Este e-mail y sus archivos adjuntos son confidenciales y están dirigidos 
exclusivamente a la(s) persona(s) destinataria prevista. Si ha recibido este 
mensaje por error, por favor, notifique inmediatamente al remitente y elimine 
este mensaje. La empresa no firma contratos por e-mail y todas las 
negociaciones están sujetas a la firma de un contrato por escrito.

This e-mail and any attached files are confidential and intended for the named 
addressee(s) only. If you have received this message in error, please notify 
the sender and delete the email immediately. The company does not conclude 
contracts by email and all negotiations are subject to written contract. "
___
Manage your subscription:
https://lists.clusterlabs.org/mailman/listinfo/users

ClusterLabs home: https://www.clusterlabs.org/

[ClusterLabs] Fencing | Dummy questions

2019-03-13 Thread Lopez, Francisco Javier [Global IT]
Hello guys !

I'm dealing since some time ago with this configuration:

- Two node cluster.
- Vmware boxes.
- PostgreSql release 10.X: Master/Slave.

On top of this I've set up Pacemaker/Corosync and RA/PAF.

Now it's time to play with different fencing scenarios.
I'd like to know from more experienced people about this. Based on the 
configuration I have (or similar)
perhaps some of you can tell me, based on your experience, what kind of fencing 
sounds more appropriate,
either node or resource level and based on what.

Any quick reference would help me a lot.

I reviewed the threads and did not find any specific to this. I'd like to have 
this information to better
understand what agents to install and manage.

Thank you
Javier



Francisco Javier​   Lopez

IT System Engineer   |  Global IT

O: +34 619 728 249|  M: +34 619 728 
249|

franciscojavier.lo...@solera.com   
 |  Solera.com

Audatex Datos, S.A.  |  Avda. de Bruselas, 36, Salida 16, A‑1 
(Diversia),   Alcobendas  ,   Madrid  ,   28108   , 
  Spain


[cid:image309044.png@85A46B17.DD7465F2]






" Este e-mail y sus archivos adjuntos son confidenciales y están dirigidos 
exclusivamente a la(s) persona(s) destinataria prevista. Si ha recibido este 
mensaje por error, por favor, notifique inmediatamente al remitente y elimine 
este mensaje. La empresa no firma contratos por e-mail y todas las 
negociaciones están sujetas a la firma de un contrato por escrito.

This e-mail and any attached files are confidential and intended for the named 
addressee(s) only. If you have received this message in error, please notify 
the sender and delete the email immediately. The company does not conclude 
contracts by email and all negotiations are subject to written contract. "
___
Users mailing list: Users@clusterlabs.org
https://lists.clusterlabs.org/mailman/listinfo/users

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org


Re: [ClusterLabs] Re: How to generate RPMs for Pacemaker release 2.x on Centos

2018-10-17 Thread Lopez, Francisco Javier [Global IT]
Hello guys.

I finally created the RPMs for Pacemaker and Resource Agents. I will paste
in this thread the way to do that so it can help any other like me :-)

I need a final update, hopefully, from you guys about this issue ...

I need to know if there is any compatibility or certification matrix somewhere.
I'm asking this because as I'm creating the packages from source, I need to be
sure that Pacemaker, PCS, Corosync, the Agents, ... releases match. This would
guarantee that if we find an issue using the product, the problem is not a
compatibility among them.

These are the releases I have now:

- Pacemaker: 2.0.0, created by me.
- Resource Agents: 4.1.1, created by me.
- Corosync: corosynclib-2.4.3-2.el7_5.1.x86_64
corosync-2.4.3-2.el7_5.1.x86_64
  Installed from repos.
- PCS: pcs-0.9.162-5.el7.centos.1.x86_64
  Installed from repos.

As before, I appreciate all your help

Best regards


Francisco Javier​   Lopez

IT System Engineer   |  Global IT

O: +34 619 728 249|  M: +34 619 728 
249|

franciscojavier.lo...@solera.com<mailto:franciscojavier.lo...@solera.com>   
 |  Solera.com

Audatex Datos, S.A.  |  Avda. de Bruselas, 36, Salida 16, A‑1 
(Diversia),   Alcobendas  ,   Madrid  ,   28108   , 
  Spain


[cid:image336898.png@3855FCA3.73CA15EA]



On 15/10/18 16:42, Ken Gaillot wrote:

On Mon, 2018-10-15 at 14:37 +, Lopez, Francisco Javier [Global IT]
wrote:


Klaus/Ken.

Thx. for you reply.

The issue is ...

- I downloaded the source from GIT.
- Downloaded the OS required packages.
- Unzipped the source.
- ./autogen.sh + ./configure ---> OK
- Then, indeed, I tried: make rpm
  But I got thousands of errors:



Ah, I forgot it uses information from the repository. Rather than
download the source, you'd have to git clone the repository, and run
from there. By default you'll be in the latest master branch; if you
prefer to run a released version, you can check it out like "git
checkout Pacemaker-2.0.0".



$ make rpm
fatal: Not a git repository (or any of the parent directories): .git
fatal: Not a git repository (or any of the parent directories): .git
fatal: Not a git repository (or any of the parent directories): .git
fatal: Not a git repository (or any of the parent directories): .git
fatal: Not a git repository (or any of the parent directories): .git
fatal: Not a git repository (or any of the parent directories): .git
/bin/sh: -c: line 0: syntax error near unexpected token `Pacemaker-*'
/bin/sh: -c: line 0: `case  in Pacemaker-*) echo '' | cut -c11-;; *)
git log --pretty=format:%h -n 1 '';; esac'
...
...

What made me think, that downloading the source to the box I'm
testing could not be the
best, so I decided to ask the experts.

Best Regards

Francisco JavierLopez IT System Eng
ineer|  Global IT O: +34 619 728 249
 |  M: +34 619 728 249   |
franciscojavier.lo...@solera.com<mailto:franciscojavier.lo...@solera.com>   
 |  Solera.com
  Audatex Datos, S.A.|  Avda. de Bruselas, 36, Salida
 16, A‑1 (Diversia) ,   Alcobendas  ,   Madr
id  ,   28108   ,   Spain On
15/10/18 16:27, Ken Gaillot wrote:


On Mon, 2018-10-15 at 14:39 +0200, Klaus Wenninger wrote:


On 10/15/2018 01:52 PM, Lopez, Francisco Javier [Global IT]
wrote:


Hello guys !

We are planning to use Pacemaker as a base HA Software in our
Company.

Our requirements will be:

- Centos-7
- PostgreSql-10

We did several tests with Pacemaker release 1.1.8 and fixed the
problems found with
the RA. We finally created new RPMs from source (4.x).

Now we want to test Pacemaker release 2.x but, as we will have
to
create some clusters,
we want to create new RPMs for this release instead of doing
manual
installation on
each new box. As per what I see the RPMs for our Centos have
not
been created yet.

We could run 'autogen' + 'configure' but I do not find the way
to
generate the RPMs.
Anyone could share with me the correct paths to do this please
?




The spec-file found in the pacemaker-github-repo should work
straight
forward
using mock to build against the repos of your Centos-Version.
Just check that you are on current corosync, libqb, knet, ...
Pacemaker 2 seems to build well against the packages coming with
Centos 7.5.
Maybe others can comment on how advisable it is running that
combo
though.

Klaus



Also, there is a convenient target for building RPMs from the spec
file, you can just run "make rpm" (after autogen.sh + configure).



Perhaps there are some steps written somewhere and I did not
find
them out ...

Appreciate your help.

Regards
Javier
Francisco JavierLopez IT Syst
em E
ngineer  |  Global IT O: +34 619 728 249

 |  M: +34 619 728 249   |
franciscojavier.lo...@solera.com<mailto:franciscojavier.lo...@solera.com>   
 |  Solera.com

Re: [ClusterLabs] Re: How to generate RPMs for Pacemaker release 2.x on Centos

2018-10-15 Thread Lopez, Francisco Javier [Global IT]
He he.

Sounds like that will do the trick then.

I could not do that way as this box does not have access to Internet.
Will see how to deal with this.

Guys I really appreciate your replies and help.

Cheers

Francisco Javier​   Lopez

IT System Engineer   |  Global IT

O: +34 619 728 249|  M: +34 619 728 
249|

franciscojavier.lo...@solera.com<mailto:franciscojavier.lo...@solera.com>   
 |  Solera.com

Audatex Datos, S.A.  |  Avda. de Bruselas, 36, Salida 16, A‑1 
(Diversia),   Alcobendas  ,   Madrid  ,   28108   , 
  Spain


[cid:image090871.png@C4C98636.2963693A]



On 15/10/18 16:42, Ken Gaillot wrote:

On Mon, 2018-10-15 at 14:37 +, Lopez, Francisco Javier [Global IT]
wrote:


Klaus/Ken.

Thx. for you reply.

The issue is ...

- I downloaded the source from GIT.
- Downloaded the OS required packages.
- Unzipped the source.
- ./autogen.sh + ./configure ---> OK
- Then, indeed, I tried: make rpm
  But I got thousands of errors:



Ah, I forgot it uses information from the repository. Rather than
download the source, you'd have to git clone the repository, and run
from there. By default you'll be in the latest master branch; if you
prefer to run a released version, you can check it out like "git
checkout Pacemaker-2.0.0".



$ make rpm
fatal: Not a git repository (or any of the parent directories): .git
fatal: Not a git repository (or any of the parent directories): .git
fatal: Not a git repository (or any of the parent directories): .git
fatal: Not a git repository (or any of the parent directories): .git
fatal: Not a git repository (or any of the parent directories): .git
fatal: Not a git repository (or any of the parent directories): .git
/bin/sh: -c: line 0: syntax error near unexpected token `Pacemaker-*'
/bin/sh: -c: line 0: `case  in Pacemaker-*) echo '' | cut -c11-;; *)
git log --pretty=format:%h -n 1 '';; esac'
...
...

What made me think, that downloading the source to the box I'm
testing could not be the
best, so I decided to ask the experts.

Best Regards

Francisco JavierLopez IT System Eng
ineer|  Global IT O: +34 619 728 249
 |  M: +34 619 728 249   |
franciscojavier.lo...@solera.com<mailto:franciscojavier.lo...@solera.com>   
 |  Solera.com
  Audatex Datos, S.A.|  Avda. de Bruselas, 36, Salida
 16, A‑1 (Diversia) ,   Alcobendas  ,   Madr
id  ,   28108   ,   Spain On
15/10/18 16:27, Ken Gaillot wrote:


On Mon, 2018-10-15 at 14:39 +0200, Klaus Wenninger wrote:


On 10/15/2018 01:52 PM, Lopez, Francisco Javier [Global IT]
wrote:


Hello guys !

We are planning to use Pacemaker as a base HA Software in our
Company.

Our requirements will be:

- Centos-7
- PostgreSql-10

We did several tests with Pacemaker release 1.1.8 and fixed the
problems found with
the RA. We finally created new RPMs from source (4.x).

Now we want to test Pacemaker release 2.x but, as we will have
to
create some clusters,
we want to create new RPMs for this release instead of doing
manual
installation on
each new box. As per what I see the RPMs for our Centos have
not
been created yet.

We could run 'autogen' + 'configure' but I do not find the way
to
generate the RPMs.
Anyone could share with me the correct paths to do this please
?




The spec-file found in the pacemaker-github-repo should work
straight
forward
using mock to build against the repos of your Centos-Version.
Just check that you are on current corosync, libqb, knet, ...
Pacemaker 2 seems to build well against the packages coming with
Centos 7.5.
Maybe others can comment on how advisable it is running that
combo
though.

Klaus



Also, there is a convenient target for building RPMs from the spec
file, you can just run "make rpm" (after autogen.sh + configure).



Perhaps there are some steps written somewhere and I did not
find
them out ...

Appreciate your help.

Regards
Javier
Francisco JavierLopez IT Syst
em E
ngineer  |  Global IT O: +34 619 728 249

 |  M: +34 619 728 249   |
franciscojavier.lo...@solera.com<mailto:franciscojavier.lo...@solera.com>   
 |  Solera.com
  Audatex Datos, S.A.|  Avda. de Bruselas, 36,
Sali
da 16, A‑1 (Diversia)   ,   Alcobendas  ,
Madrid  ,   28108   ,   Spain


"This e-mail, any associated files and the information
contained in
them are confidential and is intended for the addressee(s)
only. If
you have received this message in error please notify the
originator and delete the email immediately. The unauthorised
use,
disclosure, copying or alteration of this message is strictly
forbidden. E-mails to and from the company are monitored for
operational reasons and in accordance with lawful business
practices. Any opinions expressed are those of the individual
and
do not necessarily represent the views

Re: [ClusterLabs] Re: How to generate RPMs for Pacemaker release 2.x on Centos

2018-10-15 Thread Lopez, Francisco Javier [Global IT]
Klaus/Ken.

Thx. for you reply.

The issue is ...

- I downloaded the source from GIT.
- Downloaded the OS required packages.
- Unzipped the source.
- ./autogen.sh + ./configure ---> OK
- Then, indeed, I tried: make rpm
  But I got thousands of errors:

$ make rpm
fatal: Not a git repository (or any of the parent directories): .git
fatal: Not a git repository (or any of the parent directories): .git
fatal: Not a git repository (or any of the parent directories): .git
fatal: Not a git repository (or any of the parent directories): .git
fatal: Not a git repository (or any of the parent directories): .git
fatal: Not a git repository (or any of the parent directories): .git
/bin/sh: -c: line 0: syntax error near unexpected token `Pacemaker-*'
/bin/sh: -c: line 0: `case  in Pacemaker-*) echo '' | cut -c11-;; *) git log 
--pretty=format:%h -n 1 '';; esac'
...
...

What made me think, that downloading the source to the box I'm testing could 
not be the
best, so I decided to ask the experts.

Best Regards

Francisco Javier​   Lopez

IT System Engineer   |  Global IT

O: +34 619 728 249|  M: +34 619 728 
249|

franciscojavier.lo...@solera.com<mailto:franciscojavier.lo...@solera.com>   
 |  Solera.com

Audatex Datos, S.A.  |  Avda. de Bruselas, 36, Salida 16, A‑1 
(Diversia),   Alcobendas  ,   Madrid  ,   28108   , 
  Spain


[cid:image824429.png@9010732B.855BF16B]



On 15/10/18 16:27, Ken Gaillot wrote:

On Mon, 2018-10-15 at 14:39 +0200, Klaus Wenninger wrote:


On 10/15/2018 01:52 PM, Lopez, Francisco Javier [Global IT] wrote:


Hello guys !

We are planning to use Pacemaker as a base HA Software in our
Company.

Our requirements will be:

- Centos-7
- PostgreSql-10

We did several tests with Pacemaker release 1.1.8 and fixed the
problems found with
the RA. We finally created new RPMs from source (4.x).

Now we want to test Pacemaker release 2.x but, as we will have to
create some clusters,
we want to create new RPMs for this release instead of doing manual
installation on
each new box. As per what I see the RPMs for our Centos have not
been created yet.

We could run 'autogen' + 'configure' but I do not find the way to
generate the RPMs.
Anyone could share with me the correct paths to do this please ?



The spec-file found in the pacemaker-github-repo should work straight
forward
using mock to build against the repos of your Centos-Version.
Just check that you are on current corosync, libqb, knet, ...
Pacemaker 2 seems to build well against the packages coming with
Centos 7.5.
Maybe others can comment on how advisable it is running that combo
though.

Klaus



Also, there is a convenient target for building RPMs from the spec
file, you can just run "make rpm" (after autogen.sh + configure).






Perhaps there are some steps written somewhere and I did not find
them out ...

Appreciate your help.

Regards
Javier
Francisco JavierLopez IT System E
ngineer  |  Global IT O: +34 619 728 249
 |  M: +34 619 728 249   |
franciscojavier.lo...@solera.com<mailto:franciscojavier.lo...@solera.com>   
 |  Solera.com
  Audatex Datos, S.A.|  Avda. de Bruselas, 36, Sali
da 16, A‑1 (Diversia)   ,   Alcobendas  ,
Madrid  ,   28108   ,   Spain


"This e-mail, any associated files and the information contained in
them are confidential and is intended for the addressee(s) only. If
you have received this message in error please notify the
originator and delete the email immediately. The unauthorised use,
disclosure, copying or alteration of this message is strictly
forbidden. E-mails to and from the company are monitored for
operational reasons and in accordance with lawful business
practices. Any opinions expressed are those of the individual and
do not necessarily represent the views of the company. The company
does not conclude contracts by email and all negotiations are
subject to contract. We make every effort to maintain our network
free from computer viruses but accept no responsibility for any
viruses which might be transferred by this e-mail."


___
Users mailing list: Users@clusterlabs.org<mailto:Users@clusterlabs.org>
https://na01.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.clusterlabs.org%2Fmailman%2Flistinfo%2Fusersdata=01%7C01%7C%7C83b5c8abd9ff48080f2008d632aa4a43%7Cc45b48f313bb448b9356ba7b863c2189%7C1sdata=W%2FtkThOis%2FEkM5maP7dVWggwYg8cOR3%2FAJjAn1Jxaf0%3Dreserved=0

Project Home: 
https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.clusterlabs.orgdata=01%7C01%7C%7C83b5c8abd9ff48080f2008d632aa4a43%7Cc45b48f313bb448b9356ba7b863c2189%7C1sdata=X6bGa81egPcfwyAZvo6Ce4cH2PfQXvvQlNo6KPWXxvg%3Dreserved=0
Getting started: 
https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.clusterlabs.org%2Fdoc%2FCluster_from_Scratcdata=01%7

[ClusterLabs] How to generate RPMs for Pacemaker release 2.x on Centos

2018-10-15 Thread Lopez, Francisco Javier [Global IT]
Hello guys !

We are planning to use Pacemaker as a base HA Software in our Company.

Our requirements will be:

- Centos-7
- PostgreSql-10

We did several tests with Pacemaker release 1.1.8 and fixed the problems found 
with
the RA. We finally created new RPMs from source (4.x).

Now we want to test Pacemaker release 2.x but, as we will have to create some 
clusters,
we want to create new RPMs for this release instead of doing manual 
installation on
each new box. As per what I see the RPMs for our Centos have not been created 
yet.

We could run 'autogen' + 'configure' but I do not find the way to generate the 
RPMs.
Anyone could share with me the correct paths to do this please ?
Perhaps there are some steps written somewhere and I did not find them out ...

Appreciate your help.

Regards
Javier
Francisco Javier​   Lopez

IT System Engineer   |  Global IT

O: +34 619 728 249|  M: +34 619 728 
249|

franciscojavier.lo...@solera.com   
 |  Solera.com

Audatex Datos, S.A.  |  Avda. de Bruselas, 36, Salida 16, A‑1 
(Diversia),   Alcobendas  ,   Madrid  ,   28108   , 
  Spain


[cid:image413103.png@0EB14876.AE1A8FF9]






"This e-mail, any associated files and the information contained in them are 
confidential and is intended for the addressee(s) only. If you have received 
this message in error please notify the originator and delete the email 
immediately. The unauthorised use, disclosure, copying or alteration of this 
message is strictly forbidden. E-mails to and from the company are monitored 
for operational reasons and in accordance with lawful business practices. Any 
opinions expressed are those of the individual and do not necessarily represent 
the views of the company. The company does not conclude contracts by email and 
all negotiations are subject to contract. We make every effort to maintain our 
network free from computer viruses but accept no responsibility for any viruses 
which might be transferred by this e-mail."
___
Users mailing list: Users@clusterlabs.org
https://lists.clusterlabs.org/mailman/listinfo/users

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org