---

** [tickets:#2733] ntf: NTFD has unexpectedly crashed due to ntfimcnd exit**

**Status:** unassigned
**Milestone:** 5.18.01
**Created:** Thu Dec 07, 2017 11:38 AM UTC by Canh Truong
**Last Updated:** Thu Dec 07, 2017 11:38 AM UTC
**Owner:** nobody


When the cluster is started,  NTFD has unexpectedly crashed. The issue happen 
because the ntfimcnd exit when saImmOiDispatch() fails with BAD_HANDLE error

syslog:
017-12-07 17:42:39.278 SC-1 osafckptnd[531]: Started
2017-12-07 17:42:39.355 SC-1 osafsmfnd[573]: Started
2017-12-07 17:42:39.356 SC-1 osafsmfnd[573]: NO MDS initialize_smfnd: 
smfnd_mds_init()
2017-12-07 17:42:39.356 SC-1 osafsmfnd[573]: NO MDS smfnd_mds_init: 
mds_get_handle()
2017-12-07 17:42:39.357 SC-1 osafsmfnd[573]: NO MDS mds_get_handle: Done
2017-12-07 17:42:39.357 SC-1 osafsmfnd[573]: NO MDS smfnd_mds_init: 
mds_register()
2017-12-07 17:42:39.358 SC-1 osafsmfnd[573]: NO MDS smfnd_mds_init: Done
2017-12-07 17:42:39.394 SC-1 osafamfnd[501]: NO 
'safSu=SC-1,safSg=NoRed,safApp=OpenSAF' Presence State INSTANTIATING => 
INSTANTIATED
2017-12-07 17:42:39.425 SC-1 osafsmfd[615]: Started
2017-12-07 17:42:39.472 SC-1 osafamfnd[501]: NO Assigning 
'safSi=NoRed1,safApp=OpenSAF' ACTIVE to 'safSu=SC-1,safSg=NoRed,safApp=OpenSAF'
2017-12-07 17:42:39.473 SC-1 osafamfnd[501]: NO Assigned 
'safSi=NoRed1,safApp=OpenSAF' ACTIVE to 'safSu=SC-1,safSg=NoRed,safApp=OpenSAF'
2017-12-07 17:42:39.663 SC-1 osafimmnd[443]: NO Implementer connected: 19 
(safCheckPointService) <265, 2010f>
2017-12-07 17:42:39.665 SC-1 osafimmnd[443]: NO Implementer disconnected 19 
<265, 2010f> (safCheckPointService)
2017-12-07 17:42:49.558 SC-1 osafamfd[487]: NO Received node_up from 2030f: 
msg_id 1
2017-12-07 17:42:49.659 SC-1 osafamfnd[501]: NO Instantiation of 
'safComp=NTF,safSu=SC-1,safSg=2N,safApp=OpenSAF' failed
2017-12-07 17:42:49.659 SC-1 osafamfnd[501]: NO Reason: component registration 
timer expired
2017-12-07 17:42:54.695 SC-1 osafimmnd[443]: NO Implementer locally 
disconnected. Marking it as doomed 16 <15, 2010f> (@OpenSafImmReplicatorA)
2017-12-07 17:42:54.695 SC-1 osafimmnd[443]: NO Implementer disconnected 16 
<15, 2010f> (@OpenSafImmReplicatorA)
2017-12-07 17:42:54.696 SC-1 opensafd[384]: ER Service NTFD has unexpectedly 
crashed. Unable to continue, exiting


2017-12-07 17:42:54.712 SC-1 osafntfd[672]: mkfifo already exists: 
/var/lib/opensaf/osafntfd.fifo File exists
2017-12-07 17:42:54.712 SC-1 osafntfd[672]: Started
2017-12-07 17:42:54.714 SC-1 osafamfnd[501]: NO 
'safSu=SC-1,safSg=2N,safApp=OpenSAF' Presence State INSTANTIATING => 
INSTANTIATED
2017-12-07 17:42:54.717 SC-1 
2017-12-07 17:42:54.725 SC-1 osafntfimcnd[677]: logtrace: trace enabled to file 
'osafntfimcn', mask=0xffffffff
2017-12-07 17:42:54.728 SC-1 osafimmnd[443]: NO Implementer (applier) 
connected: 20 (@OpenSafImmReplicatorA) <267, 2010f>
2017-12-07 17:42:54.731 SC-1 osafntfimcnd[677]: NO Started
2017-12-07 17:42:59.564 SC-1 osafamfd[487]: exiting for shutdown
2017-12-07 17:42:59.565 SC-1 osafamfnd[501]: ER AMFD has unexpectedly crashed. 
Rebooting node
2017-12-07 17:42:59.565 SC-1 osafamfnd[501]: Rebooting OpenSAF NodeId = 131343 
EE Name = , Reason: AMFD has unexpectedly crashed. Rebooting node, OwnNodeId = 
131343, SupervisionTime = 60
2017-12-07 17:42:59.573 SC-1 opensaf_reboot: Rebooting local node; timeout=60
2017-12-07 17:42:59.624 SC-1 upstart-socket-bridge[353]: Disconnected from 
Upstart


ntfimcnd:
<143>1 2017-12-07T17:43:21.529917+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="136"] 477:imm/agent/imma_mds.cc:402 T3 IMMND DOWN

<143>1 2017-12-07T17:43:21.529922+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="137"] 477:imm/agent/imma_db.cc:680 >> imma_mark_clients_stale 
<143>1 2017-12-07T17:43:21.529926+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="138"] 477:imm/agent/imma_db.cc:741 TR Stale marked client cl:16 
node:2010f
<143>1 2017-12-07T17:43:21.52993+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="139"] 477:imm/agent/imma_db.cc:824 >> isExposed 
<143>1 2017-12-07T17:43:21.529934+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="140"] 477:imm/agent/imma_db.cc:853 TR OM CLIENT
<143>1 2017-12-07T17:43:21.529937+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="141"] 477:imm/agent/imma_db.cc:886 TR isExposed Returning Exposed:0
<143>1 2017-12-07T17:43:21.529941+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="142"] 477:imm/agent/imma_db.cc:887 << isExposed 
<143>1 2017-12-07T17:43:21.529945+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="143"] 477:imm/agent/imma_db.cc:755 << imma_mark_clients_stale 
<143>1 2017-12-07T17:43:21.52995+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="144"] 477:mds/mds_dt_trans.c:755 >> mdtm_process_poll_recv_data_tcp 
<143>1 2017-12-07T17:43:21.529956+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="145"] 477:imm/agent/imma_mds.cc:402 T3 IMMND DOWN
<143>1 2017-12-07T17:43:21.52996+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="146"] 477:imm/agent/imma_db.cc:680 >> imma_mark_clients_stale 
<143>1 2017-12-07T17:43:21.530215+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="147"] 477:imm/agent/imma_db.cc:741 TR Stale marked client cl:15 
node:2010f
<143>1 2017-12-07T17:43:21.530235+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="148"] 477:imm/agent/imma_db.cc:824 >> isExposed 
<143>1 2017-12-07T17:43:21.530247+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="149"] 477:imm/agent/imma_db.cc:877 TR OI CLIENT
<143>1 2017-12-07T17:43:21.530259+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="150"] 477:imm/agent/imma_db.cc:886 TR isExposed Returning Exposed:1
<143>1 2017-12-07T17:43:21.53027+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="151"] 477:imm/agent/imma_db.cc:887 << isExposed 
<143>1 2017-12-07T17:43:21.530281+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="152"] 477:imm/agent/imma_proc.cc:617 >> imma_proc_stale_dispatch 
<143>1 2017-12-07T17:43:21.530302+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="153"] 477:imm/agent/imma_proc.cc:634 T3 Posted stale handle 
ipc-message
<143>1 2017-12-07T17:43:21.530326+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="154"] 477:imm/agent/imma_proc.cc:685 << imma_proc_stale_dispatch 
<143>1 2017-12-07T17:43:21.530339+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="155"] 477:imm/agent/imma_db.cc:755 << imma_mark_clients_stale 
<143>1 2017-12-07T17:43:21.530368+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="156"] 474:imm/agent/imma_oi_api.cc:539 >> saImmOiDispatch 
<143>1 2017-12-07T17:43:21.530377+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="157"] 474:imm/agent/imma_oi_api.cc:569 T1 Handle f0002010f is 
stale, trying to resurrect it.
<143>1 2017-12-07T17:43:21.530384+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="158"] 474:imm/agent/imma_oi_api.cc:674 << saImmOiDispatch 
<141>1 2017-12-07T17:43:21.5304+07:00 SC-1 osafntfimcnd 474 osafntfimcn [meta 
sequenceId="159"] 474:ntf/ntfimcnd/ntfimcn_main.c:185 NO saImmOiDispatch() Fail 
SA_AIS_ERR_BAD_HANDLE (9)




---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets

Reply via email to