Re: [ClusterLabs] Adding and removing a node dyamically

2015-10-02 Thread Ken Gaillot
On 10/02/2015 05:36 AM, Vijay Partha wrote:
> could someone help me out with this please? i am making use of cman and
> pacemaker. pcs cluster node add/remove  is not working as it throws
> pcsd service is not running on .

pcs relies on pcsd running on all nodes.

Make sure pcs is installed on all nodes, and pcsd is enabled to start at
boot (via service or systemctl depending on which you are using). Then
set a password (same on all nodes) for the hacluster user. Finally, run
"pcs cluster auth " on the machine you want to run pcs from,
and give it the hacluster user/pass.

You may want to review the Clusters From Scratch documentation to see
other common configuration that needs to be done (firewall, SELinux,
hostnames, etc.):

http://clusterlabs.org/doc/en-US/Pacemaker/1.1-pcs/html-single/Clusters_from_Scratch/index.html


> On Fri, Oct 2, 2015 at 1:17 PM, Vijay Partha 
> wrote:
> 
>> Hi,
>>
>> I would like to add and remove a node dynamically in pacemaker. What
>> commands are to be given for this to be done.
>>
>> Thanking you
>>
>> --
>> With Regards
>> P.Vijay
>>
> 
> 
> 
> 
> 
> ___
> Users mailing list: Users@clusterlabs.org
> http://clusterlabs.org/mailman/listinfo/users
> 
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org
> 


___
Users mailing list: Users@clusterlabs.org
http://clusterlabs.org/mailman/listinfo/users

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org


Re: [ClusterLabs] Adding and removing a node dyamically

2015-10-02 Thread Sven Moeller
Hi,

what do mean with add or remove? Do you want to remove a node from a cluster 
completely, not being a cluster member any more? Or do you want to remove it 
just for maintenance temporarely?

Regards,
sven

Am 02.10.2015 09:47 schrieb Vijay Partha :
>
> Hi,
>  
> I would like to add and remove a node dynamically in pacemaker. What commands 
> are to be given for this to be done.
>  
> Thanking you
>
> -- 
> With Regards
> P.Vijay
___
Users mailing list: Users@clusterlabs.org
http://clusterlabs.org/mailman/listinfo/users

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org


[ClusterLabs] Current DC becomes None suddenly

2015-10-02 Thread Pritam Kharat
Hi,

I have set up a ACTIVE/PASSIVE HA

*Issue 1) *

*corosync.conf*  file is

# Please read the openais.conf.5 manual page

totem {

version: 2

# How long before declaring a token lost (ms)
token: 1

# How many token retransmits before forming a new configuration
token_retransmits_before_loss_const: 20

# How long to wait for join messages in the membership protocol (ms)
join: 1

# How long to wait for consensus to be achieved before starting a
new round of membership configuration (ms)
consensus: 12000

# Turn off the virtual synchrony filter
vsftype: none

# Number of messages that may be sent by one processor on receipt
of the token
max_messages: 20

# Limit generated nodeids to 31-bits (positive signed integers)
clear_node_high_bit: yes

# Disable encryption
secauth: off

# How many threads to use for encryption/decryption
threads: 0

# Optionally assign a fixed node id (integer)
# nodeid: 1234

# This specifies the mode of redundant ring, which may be none,
active, or passive.
rrp_mode: none
interface {
# The following values need to be set based on your
environment
ringnumber: 0
bindnetaddr: 192.168.101.0
mcastport: 5405
}

transport: udpu
}

amf {
mode: disabled
}

quorum {
# Quorum for the Pacemaker Cluster Resource Manager
provider: corosync_votequorum
expected_votes: 1
}


nodelist {

node {
ring0_addr: 192.168.101.73
}

node {
ring0_addr: 192.168.101.74
}
}

aisexec {
user:   root
group:  root
}


logging {
fileline: off
to_stderr: yes
to_logfile: yes
to_syslog: yes
syslog_facility: daemon
logfile: /var/log/corosync/corosync.log
debug: off
timestamp: on
logger_subsys {
subsys: AMF
debug: off
tags: enter|leave|trace1|trace2|trace3|trace4|trace6
}
}

And I have added 5 resources - 1 is VIP and 4 are upstart jobs
Node names are configured as -> sc-node-1(ACTIVE) and sc-node-2(PASSIVE)
Resources are running on ACTIVE node

Default cluster properties -

  






  


But sometimes after 2-3 migrations from ACTIVE to STANDBY and then from
STANDBY to ACTIVE,
both nodes become OFFLINE and Current DC becomes None, I have disabled the
stonith property and even quorum is ignored

root@sc-node-2:/usr/lib/python2.7/dist-packages/sc# crm status
Last updated: Sat Oct  3 00:01:40 2015
Last change: Fri Oct  2 23:38:28 2015 via crm_resource on sc-node-1
Stack: corosync
Current DC: NONE
2 Nodes configured
5 Resources configured

OFFLINE: [ sc-node-1 sc-node-2 ]

What is going wrong here ? What is the reason for node Current DC becoming
None suddenly ? Is corosync.conf okay ? Are default cluster properties fine
? Help will be appreciated.


*Issue 2)*
Command used to add upstart job is

crm configure primitive service upstart:service meta allow-migrate=true
migration-threshold=5 failure-timeout=30s op monitor interval=15s
 timeout=60s

But still sometimes I see fail count going to INFINITY. Why ? How can we
avoid it ? Resource should have migrated as soon as it reaches migration
threshold.

* Node sc-node-2:
   service: migration-threshold=5 fail-count=100 last-failure='Fri Oct
 2 23:38:53 2015'
   service1: migration-threshold=5 fail-count=100 last-failure='Fri Oct
 2 23:38:53 2015'

Failed actions:
service_start_0 (node=sc-node-2, call=-1, rc=1, status=Timed Out,
last-rc-change=Fri Oct  2 23:38:53 2015
, queued=0ms, exec=0ms
): unknown error
service1_start_0 (node=sc-node-2, call=-1, rc=1, status=Timed Out,
last-rc-change=Fri Oct  2 23:38:53 2015
, queued=0ms, exec=0ms




-- 
Thanks and Regards,
Pritam Kharat.
___
Users mailing list: Users@clusterlabs.org
http://clusterlabs.org/mailman/listinfo/users

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org


[ClusterLabs] Adding and removing a node dyamically

2015-10-02 Thread Vijay Partha
Hi,

I would like to add and remove a node dynamically in pacemaker. What
commands are to be given for this to be done.

Thanking you

-- 
With Regards
P.Vijay
___
Users mailing list: Users@clusterlabs.org
http://clusterlabs.org/mailman/listinfo/users

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org