- **Milestone**: 4.5.2 --> 4.6.2


---

** [tickets:#1576] AMF : SU struck in terminating ( health check timeout - 
proxy proxied )**

**Status:** unassigned
**Milestone:** 4.6.2
**Created:** Thu Oct 29, 2015 05:49 AM UTC by Srikanth R
**Last Updated:** Thu Oct 29, 2015 05:49 AM UTC
**Owner:** nobody
**Attachments:**

- 
[1570.tgz](https://sourceforge.net/p/opensaf/tickets/1576/attachment/1570.tgz) 
(1.5 MB; application/x-compressed-tar)


Changeset : 6901
Application : SU1 mapped to SC-2 & SU2 mapped to SC-1.
                  Each SU consists of 3 Pre instantiable components ( one of 
the component is LOCAL & PROXIED and the other two components are SA_AWARE )
                  
                  
Steps :

 * Brought up two controllers in the cluster.
 * Performed unlock-in  operation on SU1.
 * Health check is started by both SA-AWARE components.
 *  One of the SA-AWARE components faulted in health check and as part of 
repair, SU is struck in terminating state.


Oct 29 10:30:35 SYSTEST-CNTLR-2 osafamfnd[3617]: NO 
'safSu=2nAdminRepair_SU_1,safSg=2nAdminRepair_SG,safApp=2nAdminRepair' Presence 
State INSTANTIATING => INSTANTIATED
Oct 29 10:31:21 SYSTEST-CNTLR-2 osafamfnd[3617]: NO saAmfSUFailover is true for 
'safSu=2nAdminRepair_SU_1,safSg=2nAdminRepair_SG,safApp=2nAdminRepair'
Oct 29 10:31:21 SYSTEST-CNTLR-2 osafamfnd[3617]: NO SU failover probation timer 
started (timeout: 1200000000000 ns)
Oct 29 10:31:21 SYSTEST-CNTLR-2 osafamfnd[3617]: NO Performing failover of 
'safSu=2nAdminRepair_SU_1,safSg=2nAdminRepair_SG,safApp=2nAdminRepair' (SU 
failover count: 1)
Oct 29 10:31:21 SYSTEST-CNTLR-2 osafamfnd[3617]: NO 
'safComp=2nAdminRepair,safSu=2nAdminRepair_SU_1,safSg=2nAdminRepair_SG,safApp=2nAdminRepair'
 recovery action escalated from 'noRecommendation' to 'suFailover'
Oct 29 10:31:21 SYSTEST-CNTLR-2 osafamfnd[3617]: NO 
'safComp=2nAdminRepair,safSu=2nAdminRepair_SU_1,safSg=2nAdminRepair_SG,safApp=2nAdminRepair'
 faulted due to 'healthCheckcallbackTimeout' : Recovery is 'suFailover'
Oct 29 10:31:21 SYSTEST-CNTLR-2 osafamfnd[3617]: NO Terminating components of 
'safSu=2nAdminRepair_SU_1,safSg=2nAdminRepair_SG,safApp=2nAdminRepair'(abruptly 
& unordered)
Oct 29 10:31:21 SYSTEST-CNTLR-2 osafamfnd[3617]: NO 
'safSu=2nAdminRepair_SU_1,safSg=2nAdminRepair_SG,safApp=2nAdminRepair' Presence 
State INSTANTIATED => TERMINATING
Oct 29 10:31:21 SYSTEST-CNTLR-2 osafamfnd[3617]: NO 
'safSu=2nAdminRepair_SU_1,safSg=2nAdminRepair_SG,safApp=2nAdminRepair' Presence 
State TERMINATING => TERMINATING
Oct 29 10:31:21 SYSTEST-CNTLR-2 osafamfnd[3617]: NO 
'safSu=2nAdminRepair_SU_1,safSg=2nAdminRepair_SG,safApp=2nAdminRepair' Presence 
State TERMINATING => TERMINATING

 * Amfd crashes during opensafd stop  on the SC-2,

Oct 29 11:27:46 SYSTEST-CNTLR-2 opensafd: Stopping OpenSAF Services
Oct 29 11:27:46 SYSTEST-CNTLR-2 osafamfnd[3617]: NO Shutdown initiated
Oct 29 11:27:46 SYSTEST-CNTLR-2 osafamfnd[3617]: NO Terminating all AMF 
components
...
Oct 29 11:27:46 SYSTEST-CNTLR-2 osafamfd[3607]: NO Re-initializing with IMM
...
Oct 29 11:28:46 SYSTEST-CNTLR-2 osafamfd[3607]: exiting for shutdown
Oct 29 11:28:46 SYSTEST-CNTLR-2 osafamfnd[3617]: ER AMF director unexpectedly 
crashed
Oct 29 11:28:46 SYSTEST-CNTLR-2 osafamfnd[3617]: Rebooting OpenSAF NodeId = 
131599 EE Name = , Reason: local AVD down(Adest) or both AVD down(Vdest) 
received, OwnNodeId = 131599, SupervisionTime = 60



---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
_______________________________________________
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets

Reply via email to