- **status**: unassigned --> accepted
- **assigned_to**: Nagendra Kumar
- **Version**: --> 5.2 FC
- **Comment**:
This is reproducible on eb8089acf533+ (opensaf-5.1.x) 5.1.GA/5.1.0 release also.
Reproducible steps:
1. The following code changes were done for reproducing on standby controller:
diff --git a/osaf/services/saf/amf/amfd/svctype.cc
b/osaf/services/saf/amf/amfd/svctype.cc
--- a/osaf/services/saf/amf/amfd/svctype.cc
+++ b/osaf/services/saf/amf/amfd/svctype.cc
@@ -230,6 +230,9 @@ SaAisErrorT avd_svctype_config_get(void)
searchParam.searchOneAttr.attrName =
const_cast<SaImmAttrNameT>("SaImmAttrClassName");
searchParam.searchOneAttr.attrValueType = SA_IMM_ATTR_SASTRINGT;
searchParam.searchOneAttr.attrValue = &className;
+ LOG_ER("1. Sleeping .........................");
+ sleep(1);
+ LOG_ER("2. Sleeping .........................");
if (immutil_saImmOmSearchInitialize_2(avd_cb->immOmHandle, nullptr,
SA_IMM_SUBTREE,
SA_IMM_SEARCH_ONE_ATTR | SA_IMM_SEARCH_GET_ALL_ATTR,
&searchParam,
2. Start Act(SC-1) and Standby(SC-2) controller.
3. Kill immnd on SC-2 and when when following errors comes again kill Immnd:
"1. Sleeping ........................."
4. Amfd exists:
Mar 14 13:02:58 PM_SC-2 osafimmd[1586]: NO SBY: New Epoch for IMMND process at
node 2010f old epoch: 26 new epoch:27
Mar 14 13:02:58 PM_SC-2 osafimmd[1586]: NO IMMND coord at 2010f
Mar 14 13:02:58 PM_SC-2 osafamfd[1637]: ER No objects found (1)
Mar 14 13:02:58 PM_SC-2 osafamfd[1637]: ER Failed to read configuration, AMF
will not start
Mar 14 13:02:58 PM_SC-2 osafamfd[1637]: ER avd_imm_config_get FAILED
Mar 14 13:02:58 PM_SC-2 osafamfnd[1647]: WA AMF director unexpectedly crashed
Mar 14 13:02:58 PM_SC-2 osafamfnd[1647]: Rebooting OpenSAF NodeId = 131599 EE
Name = , Reason: local AVD down(Adest) or both AVD down(Vdest) received,
OwnNodeId = 131599, SupervisionTime = 60
Mar 14 13:02:58 PM_SC-2 opensaf_reboot: Rebooting local node; timeout=60
---
** [tickets:#2361] AMFD: amfd crashed with healthCheckcallbackTimeout causing
both controllers to reboot**
**Status:** accepted
**Milestone:** 5.0.2
**Created:** Fri Mar 10, 2017 09:08 AM UTC by Chani Srivastava
**Last Updated:** Fri Mar 10, 2017 10:29 AM UTC
**Owner:** Nagendra Kumar
**Environment details**
OS : Suse 64bit
Changeset : 8634 ( 5.2.FC)
Setup : 4 nodes ( 2 controllers and 2 payloads with 1PBE enabled )
**Step**
1. Bringu opensaf on four nodes and create a load of 1 lakh objects
2. Imm test cases running on standby controller
SC-1 syslog
Mar 7 19:45:58 OSAF-SC1 osafamfnd[4720]: NO
'safComp=IMMD,safSu=SC-1,safSg=2N,safApp=OpenSAF' recovery action escalated
from 'componentFailover' to 'suFailover'
Mar 7 19:45:58 OSAF-SC1 osafamfnd[4720]: NO
'safComp=IMMD,safSu=SC-1,safSg=2N,safApp=OpenSAF' faulted due to
'healthCheckcallbackTimeout' : Recovery is 'suFailover'
**Mar 7 19:45:58 OSAF-SC1 osafamfnd[4720]: ER
safComp=IMMD,safSu=SC-1,safSg=2N,safApp=OpenSAF Faulted due
to:healthCheckcallbackTimeout Recovery is:suFailover
Mar 7 19:45:58 OSAF-SC1 osafamfnd[4720]: Rebooting OpenSAF NodeId = 131343 EE
Name = , Reason: Component faulted: recovery is node failfast, OwnNodeId =
131343, SupervisionTime = 60**
Mar 7 19:45:58 OSAF-SC1 opensaf_reboot: Rebooting local node; timeout=60
SC-2 syslog
Mar 7 19:41:00 OSAF-SC2 osafamfd[4339]: ER Failed to read configuration, AMF
will not start
Mar 7 19:41:00 OSAF-SC2 osafamfd[4339]: ER avd_imm_config_get FAILED
**Mar 7 19:41:00 OSAF-SC2 osafamfnd[4349]: ER AMFD has unexpectedly crashed.
Rebooting node**
Mar 7 19:41:00 OSAF-SC2 osafamfnd[4349]: Rebooting OpenSAF NodeId = 131599 EE
Name = , Reason: AMFD has unexpectedly crashed. Rebooting node, OwnNodeId =
131599, SupervisionTime = 60
Mar 7 19:41:00 OSAF-SC2 opensaf_reboot: Rebooting local node; timeout=60
amfd, immnd and immd traces are shared seperately as those are huge in size
---
Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets