Re: [ClusterLabs] resource-agents v4.2.0

2018-10-24 Thread Kristoffer Grönlund
On Wed, 2018-10-24 at 10:21 +0200, Oyvind Albrigtsen wrote:
> ClusterLabs is happy to announce resource-agents v4.2.0.
> Source code is available at:
> https://github.com/ClusterLabs/resource-agents/releases/tag/v4.2.0
> 

[snip]

>   - ocf.py: new Python library and dev guide
> 

I just wanted to highlight the Python library since I think it can make
agent development a lot easier in the future, especially as we expand
the library with more utilities that are commonly needed when writing
agents.

Any agents written in Python should (for now at least) be compatible
both with Python 2.7+ and Python 3.3+. We still need to expand the CI
to actually verify that agents do support these versions, so anyone who
would like to help out improving the test setup is more than welcome to
do so :)

The biggest example of an agent using it that we have now is the azure-
events agent [1], so I would recommend anyone interested in working on
new agents to take a look at that. For a more compact example, I wrote
a version of the Dummy resource agent using the ocf.py library and put
it in a gist [2], and then there is a small example in the document
describing the library and how to use it [3].

[1]: https://github.com/ClusterLabs/resource-agents/blob/master/heartbe
at/azure-events.in
[2]: https://gist.github.com/krig/6676d0ae065fd852fac8b445410e1c95
[3]: https://github.com/ClusterLabs/resource-agents/blob/master/doc/dev
-guides/writing-python-agents.md

Cheers,
Kristoffer

___
Users mailing list: Users@clusterlabs.org
https://lists.clusterlabs.org/mailman/listinfo/users

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org


[ClusterLabs] resource-agents v4.2.0

2018-10-24 Thread Oyvind Albrigtsen

ClusterLabs is happy to announce resource-agents v4.2.0.
Source code is available at:
https://github.com/ClusterLabs/resource-agents/releases/tag/v4.2.0

The most significant enhancements in this release are:
- new resource agents:
 - azure-events
 - aliyun-vpc-move-ip
 - gcp-pd-move
 - gcp-vpc-move-ip
 - gcp-vpc-move-route (improved Python version of gcp-vpc-move-ip)
 - gcp-vpc-move-vip
 - openstack-cinder-volume
 - openstack-floating-ip
 - openstack-info
 - podman
 - sybaseASE

- bugfixes and enhancements:
 - apache: fix return value for silent_status (introduced in the
   retry PID check mentioned below)
 - rabbitmq-cluster: fail monitor when node is in minority partition
 - ZFS: check status without locks when possible
 - pgsql: fix to avoid failing on masters re-promote
 - CI: fixes for bash path, strncpy in GCC 8 and missing docbook-style-xsl
 - CTDB: fix "ctdb_recovery_lock" validation
 - CTDB: fix incorrect DB corruption reports (ensure health check is run)
 - Filesystem: support symlink as mountpoint directory
 - IPaddr2: return OCF_ERR_GENERIC when failing due to IPv4 address collision
 - LVM-activate: fix for dashes in volume group and logical volume names
 - LVM-activate: read parameters for stop-action
 - LVM-activate: return OCF_ERR_CONFIGURED for incorrect vg_access_mode
 - LVM: added missing dash for activation parameters
 - SAPDatabase: add info to meta-data
 - SAPInstance: add monitored services for ENSA2 (bsc#1092384)
 - SAPInstance: implement reload action to avoid resource restarts after a
   non-unique parameter has been changed
 - SAPInstance: improve SAP instance profile detection
 - SAPInstance: improve stop-action logging
 - Squid: use ss if netstat is not available
 - VirtualDomain: add stateless support
 - VirtualDomain: correctly create logfile and set permissions
 - Xen: add utilization support for cpu and hv_memory
 - apache: retry PID check
 - aws-vpc-move-ip: check routing table during monitor probe action
 - aws-vpc-move-ip: fix backward-compatibility
 - aws-vpc-move-ip: use ip utility to check address
 - awseip: fix allocation_id not found error
 - awseip: update required IAM role permissions
 - awsvip: get network-id from metadata
 - awsvip: improve secondary-private-ip query
 - configure: add Python path detection
 - exportfs: fix square bracket stripping in clientspec
 - findif: improve IPv6 NIC detection
 - findif: only match lines containing netmasks (for newer iputils)
 - garbd: support netstat and ss
 - iSCSITarget: support CHAP authentication for lio-t
 - ipsec: add tunnel fallback option
 - ldirectord: add manpage to systemd unit file
 - lvmlockd: add cmirrord support
 - mysql: remove obsolete DEBUG_LOG functionality (bsc#1021689)
 - nfsserver: fix rpcpipefs_dir and nfs_shared_infodir issues
 - ocf-binaries: use SSH-path detected by configure
 - ocf.py: new Python library and dev guide
 - oracle: improve dbopen error
 - pgsql: create replication slots after promoting master
 - pgsql: dont change ownership of /dev/null
 - pgsql: support PostgreSQL 11 or later
 - portblock: support ss and netstat (partial)
 - ra-dev-guide: update instructions for GitHub
 - rabbitmq-cluster: get cluster status from mnesia during monitor
 - rabbitmq-cluster: retry start when cluster join fails
 - redis: do not use absolute path in pidof calls
 - sg_persist: correctly pickup old keys
 - syslog-ng: add Premium Edition 6 and 7 support
 - systemd-tmpfiles: configure path with --with-rsctmpdir

The full list of changes for resource-agents is available at:
https://github.com/ClusterLabs/resource-agents/blob/v4.2.0/ChangeLog

Everyone is encouraged to download and test the new release candidate.
We do many regression tests and simulations, but we can't cover all
possible use cases, so your feedback is important and appreciated.

Many thanks to all the contributors to this release.


Best,
The resource-agents maintainers
___
Users mailing list: Users@clusterlabs.org
https://lists.clusterlabs.org/mailman/listinfo/users

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org


[ClusterLabs] Antw: About the Pacemaker

2018-10-24 Thread Ulrich Windl
>>> "T. Ladd Omar"  schrieb am 23.10.2018 um 15:06 in 
>>> Nachricht
:
> Hi all, I send this message to get some answers for my questions about
> Pacemaker.
> 1. In order to cleanup start-failed resources automatically, I add
> failure-timeout attribute for resources, however, the common way to trigger
> the recovery is by cluster-recheck whose interval is 15min by default. I
> wonder how lower value could I set for the cluster-recheck-interval. I had
> to let the failed resources recover somewhat quickly while little impact
> taken by the more frequent cluster-recheck.

I think if your agents fail periodically, and you need to do a periodic cleanup 
of failed actions, your configuration is not stable enough for production. 
Also, if you always cleanup failed actions, resources may not move to a good 
node of one node has a problem.

> Or, is there another way to automatically cleanup start-failed resources ?
> 2. Is Pacemaker suitable for the Master-Slave model HA ? I had some
> productive problems when I use Pacemaker. If only one resource stopped on
> one node, should I failover all this node for the whole cluster? If not,
> the transactions from the ports on this node may fail for this failure. If
> yes, it seems to be big action for just one resource failure.




___
Users mailing list: Users@clusterlabs.org
https://lists.clusterlabs.org/mailman/listinfo/users

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org


Re: [ClusterLabs] VirtualBox : an improved OCF agent

2018-10-24 Thread Ioannis Ioannou

Hello again,


The original script had been posted on VB's forum and it is at Howtos there.

I subscribed, tried to reach the original author but got no response so far.

For me GPL (eg v2) is ok.

Cheers

John


On 5/10/2018 15:00, users-requ...@clusterlabs.org wrote:

Date: Thu, 04 Oct 2018 09:19:55 -0500 From: Ken Gaillot 
 To: Cluster Labs - All topics related to 
open-source clustering welcomed  Subject: Re: 
[ClusterLabs] VirtualBox : an improved OCF agent Message-ID: 
<1538662795.4624.6.ca...@redhat.com> Content-Type: text/plain; 
charset="UTF-8" On Thu, 2018-10-04 at 13:35 +0300, Ioannis Ioannou wrote:

Hello to all,

This is the first time I use HA on Linux. I'm still learning, and I
like?
it a lot.

I was interested to a active/passive system that can support virtual?
machines with live migration. For various reason I prefer VirtualBox
as?
the hypervisor.

I used an existing agent that I found here:?
https://forums.virtualbox.org/viewtopic.php?f=7&t=35372, improved it
and?
I would like to share it.

It is now more robust and ocf compliant and supports live migration
(aka?
teleportation in VBox's terminology). I'm testing it one week now on?
Debian Stretch and seems stable enough for production.


The assumptions in order to be used:

1) Both nodes should mount the same filesystem, the same time (ie
iscsi,?
cluster, nfs, and so on). I used glusterfs and works like a charm. I
put?
VM settings and images on it.

2) Both nodes should have reasonable similar hardware - close enough
so?
that teleportation can work - check VirtualBox's documentation
regarding?
teleportation for this.

3) Both VMs should have the same settings- if you put the settings
on?
the common filesystem as I did, then it is easy: create the VM on
one?
node, just add it on the other.

4) SSH keys should be exchanged between nodes. You see, migrate_to?
should be able to handle both nodes in order live migration?
(teleporting) to work.

5) Teleportation works by putting the receiving node listening on a
port?
- by default 6000. Ensure that the port is correct and accessible
from?
the other node.


Any comments are more than welcomed


Cheers

John (aka Ioannis)

That's awesome. If you can find the original author and get agreement
to put it under an open source license, it would be a great addition to
the resource-agents package.
-- Ken Gaillot 



___
Users mailing list: Users@clusterlabs.org
https://lists.clusterlabs.org/mailman/listinfo/users

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org