- **Milestone**: 4.5.2 --> 4.6.2
---
** [tickets:#1576] AMF : SU struck in terminating ( health check timeout -
proxy proxied )**
**Status:** unassigned
**Milestone:** 4.6.2
**Created:** Thu Oct 29, 2015 05:49 AM UTC by Srikanth R
**Last Updated:** Thu Oct 29, 2015 05:49 AM UTC
**Owner:** nobody
**Attachments:**
-
[1570.tgz](https://sourceforge.net/p/opensaf/tickets/1576/attachment/1570.tgz)
(1.5 MB; application/x-compressed-tar)
Changeset : 6901
Application : SU1 mapped to SC-2 & SU2 mapped to SC-1.
Each SU consists of 3 Pre instantiable components ( one of
the component is LOCAL & PROXIED and the other two components are SA_AWARE )
Steps :
* Brought up two controllers in the cluster.
* Performed unlock-in operation on SU1.
* Health check is started by both SA-AWARE components.
* One of the SA-AWARE components faulted in health check and as part of
repair, SU is struck in terminating state.
Oct 29 10:30:35 SYSTEST-CNTLR-2 osafamfnd[3617]: NO
'safSu=2nAdminRepair_SU_1,safSg=2nAdminRepair_SG,safApp=2nAdminRepair' Presence
State INSTANTIATING => INSTANTIATED
Oct 29 10:31:21 SYSTEST-CNTLR-2 osafamfnd[3617]: NO saAmfSUFailover is true for
'safSu=2nAdminRepair_SU_1,safSg=2nAdminRepair_SG,safApp=2nAdminRepair'
Oct 29 10:31:21 SYSTEST-CNTLR-2 osafamfnd[3617]: NO SU failover probation timer
started (timeout: 1200000000000 ns)
Oct 29 10:31:21 SYSTEST-CNTLR-2 osafamfnd[3617]: NO Performing failover of
'safSu=2nAdminRepair_SU_1,safSg=2nAdminRepair_SG,safApp=2nAdminRepair' (SU
failover count: 1)
Oct 29 10:31:21 SYSTEST-CNTLR-2 osafamfnd[3617]: NO
'safComp=2nAdminRepair,safSu=2nAdminRepair_SU_1,safSg=2nAdminRepair_SG,safApp=2nAdminRepair'
recovery action escalated from 'noRecommendation' to 'suFailover'
Oct 29 10:31:21 SYSTEST-CNTLR-2 osafamfnd[3617]: NO
'safComp=2nAdminRepair,safSu=2nAdminRepair_SU_1,safSg=2nAdminRepair_SG,safApp=2nAdminRepair'
faulted due to 'healthCheckcallbackTimeout' : Recovery is 'suFailover'
Oct 29 10:31:21 SYSTEST-CNTLR-2 osafamfnd[3617]: NO Terminating components of
'safSu=2nAdminRepair_SU_1,safSg=2nAdminRepair_SG,safApp=2nAdminRepair'(abruptly
& unordered)
Oct 29 10:31:21 SYSTEST-CNTLR-2 osafamfnd[3617]: NO
'safSu=2nAdminRepair_SU_1,safSg=2nAdminRepair_SG,safApp=2nAdminRepair' Presence
State INSTANTIATED => TERMINATING
Oct 29 10:31:21 SYSTEST-CNTLR-2 osafamfnd[3617]: NO
'safSu=2nAdminRepair_SU_1,safSg=2nAdminRepair_SG,safApp=2nAdminRepair' Presence
State TERMINATING => TERMINATING
Oct 29 10:31:21 SYSTEST-CNTLR-2 osafamfnd[3617]: NO
'safSu=2nAdminRepair_SU_1,safSg=2nAdminRepair_SG,safApp=2nAdminRepair' Presence
State TERMINATING => TERMINATING
* Amfd crashes during opensafd stop on the SC-2,
Oct 29 11:27:46 SYSTEST-CNTLR-2 opensafd: Stopping OpenSAF Services
Oct 29 11:27:46 SYSTEST-CNTLR-2 osafamfnd[3617]: NO Shutdown initiated
Oct 29 11:27:46 SYSTEST-CNTLR-2 osafamfnd[3617]: NO Terminating all AMF
components
...
Oct 29 11:27:46 SYSTEST-CNTLR-2 osafamfd[3607]: NO Re-initializing with IMM
...
Oct 29 11:28:46 SYSTEST-CNTLR-2 osafamfd[3607]: exiting for shutdown
Oct 29 11:28:46 SYSTEST-CNTLR-2 osafamfnd[3617]: ER AMF director unexpectedly
crashed
Oct 29 11:28:46 SYSTEST-CNTLR-2 osafamfnd[3617]: Rebooting OpenSAF NodeId =
131599 EE Name = , Reason: local AVD down(Adest) or both AVD down(Vdest)
received, OwnNodeId = 131599, SupervisionTime = 60
---
Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
_______________________________________________
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets