- **status**: review --> fixed
- **Comment**:
changeset: 8682:50a2033a8a8d
branch: opensaf-5.0.x
parent: 8679:7ec6c15c249f
user: Praveen Malviya <praveen.malv...@oracle.com>
date: Fri Mar 10 10:48:17 2017 +0530
summary: clmd: try to re-read node config from IMM if BAD_HANDLE is
returned [#2325].
changeset: 8683:59e265654232
branch: opensaf-5.1.x
parent: 8680:e02390320bbb
user: Praveen Malviya <praveen.malv...@oracle.com>
date: Fri Mar 10 10:49:06 2017 +0530
summary: clmd: try to re-read node config from IMM if BAD_HANDLE is
returned [#2325].
changeset: 8684:9338ad3cacc0
tag: tip
parent: 8681:0e9c5da42416
user: Praveen Malviya <praveen.malv...@oracle.com>
date: Fri Mar 10 10:49:44 2017 +0530
summary: clmd: try to re-read node config from IMM if BAD_HANDLE is
returned [#2325].
---
** [tickets:#2325] clm: standby clmd crashed after failing to read node
configuration from IMM.**
**Status:** fixed
**Milestone:** 5.0.2
**Created:** Fri Feb 24, 2017 09:32 AM UTC by Praveen
**Last Updated:** Fri Mar 03, 2017 10:40 AM UTC
**Owner:** Praveen
Issue is not reproducible.
While coming up as standby, CLMD successfully initializes with IMM. It
successfuly reads cluster related configuration. While reading node related
configuration from IMM, CLMD make a calls to saImmOmSearchNext_2(). This API
could not send any message to IMMND and failed:
Feb 15 06:32:17 SC-2-2 osafclmd[3972]: WA OpenSAF imm lib: Message loss
detected for dest 565213425675031 service id:25
Feb 15 06:32:17 SC-2-2 osafimmnd[3930]: WA IMMND - Client Node Get Failed for
cli_hdl:932008034831
Feb 15 06:32:17 SC-2-2 osafclmd[3972]: WA OpenSAF imm lib: Message loss
detected for dest 565213425675031 service id:25
Feb 15 06:32:17 SC-2-2 osafclmd[3972]: WA marking handle as exposed
CLMD does not explicitly check whether node config read was sucessful or not.
It comes and completes the cold sync. When a payload joins the cluster, active
CLMD checkpoints run time data for the node. Since node is not present on
standby CLMD, it crashes:
Feb 15 06:33:26 SC-2-2 osafimmd[3915]: NO SBY: New Epoch for IMMND process at
node 2020f old epoch: 22 new epoch:23
Feb 15 06:33:26 SC-2-2 osafclmd[3972]: ER Node is NULL,problem with the
database.
Feb 15 06:33:26 SC-2-2 osafclmd[3972]:
../../opensaf/src/clm/clmd/clms_mbcsv.c:468: ckpt_proc_node_rec: Assertion '0'
failed.
Feb 15 06:33:27 SC-2-2 osafamfnd[4002]: NO
'safComp=CLM,safSu=SC-2,safSg=2N,safApp=OpenSAF' faulted due to 'avaDown' :
Recovery is 'nodeFailfast'
---
Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
Announcing the Oxford Dictionaries API! The API offers world-renowned
dictionary content that is easy and intuitive to access. Sign up for an
account today to start using our lexical data to power your apps and
projects. Get started today and enter our developer competition.
http://sdm.link/oxford
_______________________________________________
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets