[tickets] [opensaf:tickets] #1867 HEADLESS : Payloads went for reboot, in headless state as CPSV got TIMEOUT rc for CLM API

2017-10-24 Thread Alex Jones via Opensaf-tickets
- **Component**: unknown --> ckpt
- **Blocker**:  --> False



---

** [tickets:#1867] HEADLESS : Payloads went for reboot, in headless state as 
CPSV got TIMEOUT rc for CLM API**

**Status:** unassigned
**Milestone:** future
**Created:** Wed Jun 08, 2016 10:54 AM UTC by Srikanth R
**Last Updated:** Tue Sep 20, 2016 05:44 PM UTC
**Owner:** nobody


Version : Opensaf 5.0. GA
Setup : Two payloads with three controllers.

 Steps performed :
 
 -> Initially all the nodes are part of the cluster.
 -> Induced failover by bringing down active, standby and spare in the order.
 Aug  7 20:30:08 SCALE_SLOT-94 kernel: [5993776.936794] TIPC: Lost contact with 
<1.1.1>
Aug  7 20:30:08 SCALE_SLOT-94 osafimmnd[2748]: NO Sleep done registering IMMND 
with MDS
Aug  7 20:30:08 SCALE_SLOT-94 osafimmnd[2748]: NO MDS: mds_register_callback: 
dest 2040fa5bb6016 already exist
Aug  7 20:30:08 SCALE_SLOT-94 osafimmnd[2748]: NO SUCCESS IN REGISTERING IMMND 
WITH MDS
Aug  7 20:30:08 SCALE_SLOT-94 osafimmnd[2748]: NO Re-introduce-me 
highestProcessed:6859 highestReceived:6859
Aug  7 20:30:13 SCALE_SLOT-94 osafimmnd[2748]: WA MDS Send Failed to 
service:IMMD rc:2
Aug  7 20:30:14 SCALE_SLOT-94 osafamfnd[2767]: WA AMF director unexpectedly 
crashed

 -> On the both payloads, CKPTND restarted with the following error in syslog.
 
 Aug  7 20:30:17 SCALE_SLOT-94 osafckptnd[2787]: ER cpnd clm node get failed 
with return value:5
Aug  7 20:30:17 SCALE_SLOT-94 osafamfnd[2767]: NO 
'safComp=CPND,safSu=PL-4,safSg=NoRed,safApp=OpenSAF' faulted due to 'avaDown' : 
Recovery is 'componentRestart'
Aug  7 20:30:17 SCALE_SLOT-94 osafckptnd[14434]: Started

-> But CKPTND Instantation failed and finally the node went for reboot.

Aug  7 20:30:27 SCALE_SLOT-94 osafimmnd[2748]: NO Re-introduce-me 
highestProcessed:6859 highestReceived:6859
Aug  7 20:30:27 SCALE_SLOT-94 osafimmnd[2748]: WA MDS Send Failed to 
service:IMMD rc:2
Aug  7 20:30:27 SCALE_SLOT-94 osafamfnd[2767]: NO Instantiation of 
'safComp=CPND,safSu=PL-4,safSg=NoRed,safApp=OpenSAF' failed
Aug  7 20:30:27 SCALE_SLOT-94 osafamfnd[2767]: NO Reason: component 
registration timer expired
Aug  7 20:30:27 SCALE_SLOT-94 osafckptnd[14451]: Started
...

Aug  7 20:30:38 SCALE_SLOT-94 osafamfnd[2767]: NO Instantiation of 
'safComp=CPND,safSu=PL-4,safSg=NoRed,safApp=OpenSAF' failed
Aug  7 20:30:38 SCALE_SLOT-94 osafamfnd[2767]: NO Reason: component 
registration timer expired
Aug  7 20:30:38 SCALE_SLOT-94 osafimmnd[2748]: NO Re-introduce-me 
highestProcessed:6859 highestReceived:6859
Aug  7 20:30:38 SCALE_SLOT-94 osafimmnd[2748]: WA MDS Send Failed to 
service:IMMD rc:2
Aug  7 20:30:38 SCALE_SLOT-94 osafamfnd[2767]: WA 
'safComp=CPND,safSu=PL-4,safSg=NoRed,safApp=OpenSAF' Presence State RESTARTING 
=> INSTANTIATION_FAILED
Aug  7 20:30:38 SCALE_SLOT-94 osafamfnd[2767]: NO avnd_di_oper_send() deferred 
as AMF director is offline
Aug  7 20:30:38 SCALE_SLOT-94 osafamfnd[2767]: WA Director is down. Remove all 
SIs from 'safSu=PL-4,safSg=NoRed,safApp=OpenSAF'
Aug  7 20:30:38 SCALE_SLOT-94 osafamfnd[2767]: NO Component Failover trigerred 
for 'safSu=PL-4,safSg=NoRed,safApp=OpenSAF': Failed component: 
'safComp=CPND,safSu=PL-4,safSg=NoRed,safApp=OpenSAF'
Aug  7 20:30:38 SCALE_SLOT-94 osafamfnd[2767]: ER 
'safComp=CPND,safSu=PL-4,safSg=NoRed,safApp=OpenSAF'got Inst failed
Aug  7 20:30:38 SCALE_SLOT-94 osafamfnd[2767]: Rebooting OpenSAF NodeId = 
132111 EE Name = , Reason: NCS component Instantiation failed, OwnNodeId = 
132111, SupervisionTime = 60



---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets


[tickets] [opensaf:tickets] #1782 PLM: entities can get permanently stuck in shutting-down state

2017-10-24 Thread Alex Jones via Opensaf-tickets
Oct 24 20:01:11.882559 osafplmd [19844:plms_adm_fsm.c:4826] >> 
plms_imm_adm_op_req_process: Entity: safHE=Switch_2,safDomain=Q50chassis. 
Admin_op id: 1
Oct 24 20:01:11.882570 osafplmd [19844:plms_adm_fsm.c:0840] >> 
plms_HE_adm_shutdown_state_lock_op: Entity: 
safHE=Switch_2,safDomain=Q50chassis. Operation: shutdown to lock.
Oct 24 20:01:11.882576 osafplmd [19844:plms_adm_fsm.c:3735] >> 
plms_ent_shutdown_to_lock: Entity: safHE=Switch_2,safDomain=Q50chassis. 
shutdown to lock
Oct 24 20:01:11.882608 osafplmd [19844:plms_adm_fsm.c:3838] ER Ent 
safHE=Switch_2,safDomain=Q50chassis is not in admin operation context 
   but still its admin state is shutting down==>Parent  
   or deps in shutting down state. Reject forced lock.
Oct 24 20:01:11.882621 osafplmd [19844:imma_oi_api.c:0895] >> 
admin_op_result_common 
Oct 24 20:01:11.882645 osafplmd [19844:imma_oi_api.c:1137] << 
admin_op_result_common 
Oct 24 20:01:11.882653 osafplmd [19844:plms_adm_fsm.c:3848] TR Sending admin 
response to IMMsuccessful.
Oct 24 20:01:11.882660 osafplmd [19844:plms_adm_fsm.c:3852] << 
plms_ent_shutdown_to_lock: ret_err: 1
Oct 24 20:01:11.882666 osafplmd [19844:plms_adm_fsm.c:0846] << 
plms_HE_adm_shutdown_state_lock_op: ret_err: 2
Oct 24 20:01:11.882672 osafplmd [19844:plms_adm_fsm.c:4945] << 
plms_imm_adm_op_req_process: ret_err: 2



---

** [tickets:#1782] PLM: entities can get permanently stuck in shutting-down 
state**

**Status:** accepted
**Milestone:** 5.17.08
**Created:** Mon Apr 25, 2016 01:00 PM UTC by Alex Jones
**Last Updated:** Mon Apr 10, 2017 01:40 PM UTC
**Owner:** Alex Jones


According to the PLM spec section 5.4 Figure 9 and Figure 10, an entity in 
shutting-down admin state should accept an explicit lock or unlock to 
transition it out. PLM currently forbids this explicit transition.

So, if "shutdown" admin operation is called for an entity, and the automatic 
transition to locked never completes, the entity is permanantly stuck.

PLM needs to accept an explicit lock/unlock in shutting-down state.


---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets


[tickets] [opensaf:tickets] #1510 CKPT: cpnd crashes during checkpoint open timeout with large sections

2017-10-24 Thread Alex Jones via Opensaf-tickets
- **status**: review --> fixed
- **Milestone**: future --> 5.17.10
- **Comment**:

commit ee7cb901b8eb023502e3c21180afb78dd15c328b
Author: Alex Jones 
Date:   Tue Oct 24 09:10:15 2017 -0400




---

** [tickets:#1510] CKPT: cpnd crashes during checkpoint open timeout with large 
sections**

**Status:** fixed
**Milestone:** 5.17.10
**Created:** Thu Oct 01, 2015 04:14 PM UTC by Alex Jones
**Last Updated:** Tue Oct 17, 2017 02:20 PM UTC
**Owner:** Alex Jones


When opening a collocated checkpoint replica where the active has large numbers 
of sections (~200k), the sync from the active can timeout with errorcode 
SA_AIS_ERR_TRY_AGAIN. In this case the code deletes the memory for the node, 
but does not delete the node from the db. When the checkpoint access is tried 
again, the freed memory for the node is still in the db, and ckptnd crashes.

Valgrind analysis shows the following:

==53610== Thread 1:
==53610== Invalid read of size 4
==53610==at 0x4E4D7C4: ncs_patricia_tree_get (patricia.c:93)
==53610==by 0x40400D: cpnd_ckpt_node_get (cpnd_db.c:42)
==53610==by 0x40D1A2: cpnd_process_evt (cpnd_evt.c:1957)
==53610==by 0x40E9D6: cpnd_main_process (cpnd_init.c:568)
==53610==by 0x403882: main (cpnd_main.c:72)
==53610==  Address 0x687de60 is 0 bytes inside a block of size 1,072 free'd
==53610==at 0x4C29D4E: free (in 
/usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so)
==53610==by 0x40A827: cpnd_evt_proc_ckpt_open (cpnd_evt.c:983)
==53610==by 0x40D426: cpnd_process_evt (cpnd_evt.c:202)
==53610==by 0x40E9D6: cpnd_main_process (cpnd_init.c:568)
==53610==by 0x403882: main (cpnd_main.c:72)
==53610== 
==53610== Invalid read of size 8
==53610==at 0x4E4D7C0: ncs_patricia_tree_get (patricia.c:90)
==53610==by 0x40400D: cpnd_ckpt_node_get (cpnd_db.c:42)
==53610==by 0x40D1A2: cpnd_process_evt (cpnd_evt.c:1957)
==53610==by 0x40E9D6: cpnd_main_process (cpnd_init.c:568)
==53610==by 0x403882: main (cpnd_main.c:72)
==53610==  Address 0x687de70 is 16 bytes inside a block of size 1,072 free'd
==53610==at 0x4C29D4E: free (in 
/usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so)
==53610==by 0x40A827: cpnd_evt_proc_ckpt_open (cpnd_evt.c:983)
==53610==by 0x40D426: cpnd_process_evt (cpnd_evt.c:202)
==53610==by 0x40E9D6: cpnd_main_process (cpnd_init.c:568)
==53610==by 0x403882: main (cpnd_main.c:72)
==53610== 
==53610== Invalid read of size 8
==53610==at 0x4E4D7FB: ncs_patricia_tree_get (patricia.c:435)
==53610==by 0x40400D: cpnd_ckpt_node_get (cpnd_db.c:42)
==53610==by 0x40D1A2: cpnd_process_evt (cpnd_evt.c:1957)
==53610==by 0x40E9D6: cpnd_main_process (cpnd_init.c:568)
==53610==by 0x403882: main (cpnd_main.c:72)
==53610==  Address 0x687de78 is 24 bytes inside a block of size 1,072 free'd
==53610==at 0x4C29D4E: free (in 
/usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so)
==53610==by 0x40A827: cpnd_evt_proc_ckpt_open (cpnd_evt.c:983)
==53610==by 0x40D426: cpnd_process_evt (cpnd_evt.c:202)
==53610==by 0x40E9D6: cpnd_main_process (cpnd_init.c:568)
==53610==by 0x403882: main (cpnd_main.c:72)
==53610== 
==53610== Invalid read of size 1
==53610==at 0x4C2D0B9: bcmp (in 
/usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so)
==53610==by 0x4E4D803: ncs_patricia_tree_get (patricia.c:435)
==53610==by 0x40400D: cpnd_ckpt_node_get (cpnd_db.c:42)
==53610==by 0x40D1A2: cpnd_process_evt (cpnd_evt.c:1957)
==53610==by 0x40E9D6: cpnd_main_process (cpnd_init.c:568)
==53610==by 0x403882: main (cpnd_main.c:72)
==53610==  Address 0x687de80 is 32 bytes inside a block of size 1,072 free'd
==53610==at 0x4C29D4E: free (in 
/usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so)
==53610==by 0x40A827: cpnd_evt_proc_ckpt_open (cpnd_evt.c:983)
==53610==by 0x40D426: cpnd_process_evt (cpnd_evt.c:202)
==53610==by 0x40E9D6: cpnd_main_process (cpnd_init.c:568)
==53610==by 0x403882: main (cpnd_main.c:72)
==53610== 
==53610== Invalid read of size 1
==53610==at 0x4C2D0D0: bcmp (in 
/usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so)
==53610==by 0x4E4D803: ncs_patricia_tree_get (patricia.c:435)
==53610==by 0x40400D: cpnd_ckpt_node_get (cpnd_db.c:42)
==53610==by 0x40D1A2: cpnd_process_evt (cpnd_evt.c:1957)
==53610==by 0x40E9D6: cpnd_main_process (cpnd_init.c:568)
==53610==by 0x403882: main (cpnd_main.c:72)
==53610==  Address 0x687de81 is 33 bytes inside a block of size 1,072 free'd
==53610==at 0x4C29D4E: free (in 
/usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so)
==53610==by 0x40A827: cpnd_evt_proc_ckpt_open (cpnd_evt.c:983)
==53610==by 0x40D426: cpnd_process_evt (cpnd_evt.c:202)
==53610==by 0x40E9D6: cpnd_main_process (cpnd_init.c:568)
==53610==by 0x403882: main (cpnd_main.c:72)
==53610== 
==53610== Invalid read of size 4
==53610==at 0x4E4D7C4: ncs_patricia_tree_get (patricia.c:93)
==53610==by 0x40400D: cpnd_ckpt_node_get (cpnd_db.c:42)
==53610==

[tickets] [opensaf:tickets] #2623 ntf: Add support for code coverage to ntfimcn

2017-10-24 Thread elunlen via Opensaf-tickets
- **status**: accepted --> fixed
- **assigned_to**: elunlen -->  nobody 
- **Comment**:

commit 0e40c15a37b87195af7c5cd7b9c952d71021f2c2
Author: Lennart Lund 
Date:   Tue Oct 24 14:06:57 2017 +0200

ntf: Add support for code coverage to ntfimcn [#2623]

Opened again. Add missing static declaration of the inline version of the
create_gcov_flush_thread(void) function in the osaf_gcov.h file.




---

** [tickets:#2623] ntf: Add support for code coverage to ntfimcn**

**Status:** fixed
**Milestone:** 5.17.10
**Created:** Wed Oct 11, 2017 01:49 PM UTC by elunlen
**Last Updated:** Tue Oct 24, 2017 11:41 AM UTC
**Owner:** nobody


Add support for code coverage using gcov. For all services calling demonize() 
this is done automatically but ntfimcn does not call that function and it has 
to be activated if --enable-gcov is done when running the ./configure script. 
Update of ntfimc shall be based on functionality added via ticket [#2620]


---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets


[tickets] [opensaf:tickets] #2623 ntf: Add support for code coverage to ntfimcn

2017-10-24 Thread elunlen via Opensaf-tickets
- **status**: fixed --> accepted
- **assigned_to**: elunlen
- **Comment**:

Opened again. Add missing static declaration of the inline version of the 
create_gcov_flush_thread(void) function in the osaf_gcov.h file.



---

** [tickets:#2623] ntf: Add support for code coverage to ntfimcn**

**Status:** accepted
**Milestone:** 5.17.10
**Created:** Wed Oct 11, 2017 01:49 PM UTC by elunlen
**Last Updated:** Mon Oct 23, 2017 10:07 AM UTC
**Owner:** elunlen


Add support for code coverage using gcov. For all services calling demonize() 
this is done automatically but ntfimcn does not call that function and it has 
to be activated if --enable-gcov is done when running the ./configure script. 
Update of ntfimc shall be based on functionality added via ticket [#2620]


---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets


[tickets] [opensaf:tickets] #2652 clm: return wrong error code

2017-10-24 Thread Vu Minh Nguyen via Opensaf-tickets
- **status**: assigned --> review



---

** [tickets:#2652] clm: return wrong error code**

**Status:** review
**Milestone:** 5.17.10
**Created:** Tue Oct 24, 2017 05:09 AM UTC by Vu Minh Nguyen
**Last Updated:** Tue Oct 24, 2017 05:09 AM UTC
**Owner:** Vu Minh Nguyen


saClmClusterNodeGet_4() returns `SA_AIS_ERR_UNAVAILABLE`(31) when querying 
non-member node information from a member node.

According to AIS, chapter 3.5.5, it should return `SA_AIS_ERR_NOT_EXIST` 
instead.

> root@SC-1:~# clm-state
> safNode=PL-3,safCluster=myClmCluster
>saClmNodeAdminState=LOCKED(2)
>saClmNodeIsMember=NON_MEMBER(0)
>saClmNodeID=131855(0x2030f)
> safNode=PL-4,safCluster=myClmCluster
>saClmNodeAdminState=UNLOCKED(1)
>saClmNodeIsMember=MEMBER(1)
>saClmNodeID=132111(0x2040f)
> safNode=PL-5,safCluster=myClmCluster
>saClmNodeAdminState=UNLOCKED(1)
>saClmNodeIsMember=MEMBER(1)
>saClmNodeID=132367(0x2050f)
> safNode=SC-1,safCluster=myClmCluster
>saClmNodeAdminState=UNLOCKED(1)
>saClmNodeIsMember=MEMBER(1)
>saClmNodeID=131343(0x2010f)
> safNode=SC-2,safCluster=myClmCluster
>saClmNodeAdminState=UNLOCKED(1)
>saClmNodeIsMember=MEMBER(1)
>saClmNodeID=131599(0x2020f)
> root@SC-1:~# clmprint -n 0x2030f
> node_id:131855(2030f)
> error - clmprint:: saClmClusterNodeGet_4 failed, rc = 31


---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets


[tickets] [opensaf:tickets] #2651 clm: clmprint does not work as expected

2017-10-24 Thread Vu Minh Nguyen via Opensaf-tickets
Here are major changes in this ticket.

// printout when input wrong data format for timeout option
$ clmprint -n 0x2020f -t 11safasd
error - input wrong data format!

// printout when querying non-member node information
$ clmprint -n 0x3060f
Node id 0x3060f is not in cluster membership

// printout when inputing multiple options
$ clmprint  -b -m -a -n
error - multiple options!

// printout when passing duplicated option
$ clmprint  -n 0x2030f -n 0x2030f
error - duplicated option!


---

** [tickets:#2651] clm: clmprint does not work as expected**

**Status:** review
**Milestone:** 5.17.10
**Created:** Mon Oct 23, 2017 10:40 AM UTC by Vu Minh Nguyen
**Last Updated:** Tue Oct 24, 2017 07:44 AM UTC
**Owner:** Vu Minh Nguyen


1. clmprint returns 0 for the error case
> root@SC-1:~# clmprint -n 0x3060f
> node_id:198159(3060f)
> error - clmprint:: saClmClusterNodeGet_4 failed, rc = 12
> root@SC-1:~# echo $?

2. clmprint does not handle invalid inputs
> clmprint -b -m -a -n
> node_id:4294967295()
> node_id:4294967295()
> ...

3. clmprint is not able to print non-member node information
> root@SC-1:~# clm-adm -o lock safNode=PL-5,safCluster=myClmCluster
> root@SC-1:~# clm-state safNode=PL-5,safCluster=myClmCluster
> safNode=PL-5,safCluster=myClmCluster
> saClmNodeAdminState=LOCKED(2)
> saClmNodeIsMember=NON_MEMBER(0)
> saClmNodeID=132367(0x2050f)
> root@SC-1:~# clmprint -n 0x2050f
> node_id:132367(2050f)
> error - clmprint:: saClmClusterNodeGet_4 failed, rc = 31


---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets


[tickets] [opensaf:tickets] #2651 clm: clmprint does not work as expected

2017-10-24 Thread Vu Minh Nguyen via Opensaf-tickets
- **status**: assigned --> review



---

** [tickets:#2651] clm: clmprint does not work as expected**

**Status:** review
**Milestone:** 5.17.10
**Created:** Mon Oct 23, 2017 10:40 AM UTC by Vu Minh Nguyen
**Last Updated:** Mon Oct 23, 2017 10:40 AM UTC
**Owner:** Vu Minh Nguyen


1. clmprint returns 0 for the error case
> root@SC-1:~# clmprint -n 0x3060f
> node_id:198159(3060f)
> error - clmprint:: saClmClusterNodeGet_4 failed, rc = 12
> root@SC-1:~# echo $?

2. clmprint does not handle invalid inputs
> clmprint -b -m -a -n
> node_id:4294967295()
> node_id:4294967295()
> ...

3. clmprint is not able to print non-member node information
> root@SC-1:~# clm-adm -o lock safNode=PL-5,safCluster=myClmCluster
> root@SC-1:~# clm-state safNode=PL-5,safCluster=myClmCluster
> safNode=PL-5,safCluster=myClmCluster
> saClmNodeAdminState=LOCKED(2)
> saClmNodeIsMember=NON_MEMBER(0)
> saClmNodeID=132367(0x2050f)
> root@SC-1:~# clmprint -n 0x2050f
> node_id:132367(2050f)
> error - clmprint:: saClmClusterNodeGet_4 failed, rc = 31


---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets


[tickets] [opensaf:tickets] #2592 smf: Upgrade nodes without using node group

2017-10-24 Thread elunlen via Opensaf-tickets
- **status**: review --> fixed
- **assigned_to**: elunlen -->  nobody 
- **Comment**:

commit 48340ca7a3c45c1bd12fe37d7c05e026abd55531
Author: Lennart Lund 
Date:   Tue Oct 24 09:21:35 2017 +0200

smf: smf: Upgrade nodes without using node group [#2592]

Do not create a node group for lock operations on nodes if only one node
(deactivation/activation units). This is needed to make it possible to
upgrade old OpenSAF versions. On old versions AMF will crash and cause
cyclic node reboot




---

** [tickets:#2592] smf: Upgrade nodes without using node group**

**Status:** fixed
**Milestone:** 5.17.10
**Created:** Mon Sep 25, 2017 11:15 AM UTC by elunlen
**Last Updated:** Thu Sep 28, 2017 12:59 PM UTC
**Owner:** nobody


After parallel lock handling was introduced a node group is always created when 
activation/deactivation units are nodes. This is also the case if the list of 
nodes only contains one node.
A problem is that old versions of AMF will crash and cause a cyclic node reboot 
since node groups are not handled correctly.
Change lock handling so that a node group is not used if only one node. This 
will make it possible to upgrade using rolling upgrade over nodes (only one 
node at a time)


---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets