Hi Ritu,
If 'length' in non-extended SaNameT has value 256, then you can see asserts as
the one reported in the ticket.
Extended names feature does not support non-extended SaNameT with length of 256.
For safe use of value length bigger than 255, use osaf_extended_* functions.
Please check your test and confirm the statement above.
Thanks,
Zoran
-----Original Message-----
From: Ritu Raj [mailto:ritu-...@users.sf.net]
Sent: den 7 september 2016 07:57
To: [opensaf:tickets]
Subject: [opensaf:tickets] #2000 msg: Cluster reset happend due to msgd crashed
on both the controller
I attahced the bt and msgd trace file, below is the snippet of bt:
2 0x00007f44089ef197 in __osafassert_fail (__file=0x7f4408a41987
"osaf_extended_name.c", __line=139, __func=0x7f4408a419f0 <__FUNCTION__.2883>
"osaf_extended_name_length",
__assertion=0x7f4408a41960 "length < SA_MAX_UNEXTENDED_NAME_LENGTH") at
sysf_def.c:281
#3 0x00007f44089ead1e in osaf_extended_name_length (name=0x67a72c) at
osaf_extended_name.c:139
#4 0x00007f44089fe7ff in osaf_encode_sanamet (ub=0x7fff9f4f09d0,
name=0x67a72c) at hj_enc.c:403
#5 0x00007f44089eb275 in ncs_edp_sanamet (hdl=0x6654c0, edu_tkn=0x0,
ptr=0x67a72c, ptr_data_len=0x7fff9f4eee14, buf_env=0x7fff9f4f0130,
op=EDP_OP_TYPE_ENC, o_err=0x7fff9f4f0238) at saf_edu.c:62
#6 0x00007f44089f8ca1 in ncs_edu_run_edp (edu_hdl=0x6654c0, edu_tkn=0x0,
rule=0x7fff9f4ef190, edp=0x404f40 <ncs_edp_sanamet@plt>, ptr=0x67a72c,
dcnt=0x7fff9f4eee14, buf_env=0x7fff9f4f0130,
optype=EDP_OP_TYPE_ENC, o_err=0x7fff9f4f0238) at hj_edu.c:499
#7 0x00007f44089f99b2 in ncs_edu_prfm_enc_on_non_ptr (edu_hdl=0x6654c0,
edu_tkn=0x0, hdl_node=0x0, rule=0x7fff9f4ef190, ptr=0x67a72c,
ptr_data_len=0x7fff9f4ef364, buf_env=0x7fff9f4f0130, o_err=0x7fff9f4f0238)
at hj_edu.c:972
#8 0x00007f44089f9302 in ncs_edu_perform_exec_action_on_non_ptr
(edu_hdl=0x6654c0, edu_tkn=0x0, hdl_node=0x0, rule=0x7fff9f4ef190,
optype=EDP_OP_TYPE_ENC, ptr=0x67a72c, ptr_data_len=0x7fff9f4ef364,
buf_env=0x7fff9f4f0130, o_err=0x7fff9f4f0238) at hj_edu.c:805
#9 0x00007f44089f92a0 in ncs_edu_perform_exec_action (edu_hdl=0x6654c0,
edu_tkn=0x0, hdl_node=0x0, rule=0x7fff9f4ef190, optype=EDP_OP_TYPE_ENC,
ptr=0x67a72c, ptr_data_len=0x7fff9f4ef364,
buf_env=0x7fff9f4f0130, o_err=0x7fff9f4f0238) at hj_edu.c:780
#10 0x00007f44089f9041 in ncs_edu_exec_rule (edu_hdl=0x6654c0, edu_tkn=0x0,
hdl_node=0x0, rule=0x7fff9f4ef190, ptr=0x67a72c, ptr_data_len=0x7fff9f4ef364,
buf_env=0x7fff9f4f0130, optype=EDP_OP_TYPE_ENC,
o_err=0x7fff9f4f0238) at hj_edu.c:627
#11 0x00007f44089fa8db in ncs_edu_run_rules_for_enc (edu_hdl=0x6654c0,
edu_tkn=0x0, hdl_node=0x0, prog=0x7fff9f4ef150, ptr=0x67a72c,
ptr_data_len=0x7fff9f4ef364, buf_env=0x7fff9f4f0130, o_err=0x7fff9f4f0238,
instr_count=4) at hj_edu.c:1666
Attachments:
-
[bt_msgd.tar](https://sourceforge.net/p/opensaf/tickets/_discuss/thread/1ff9fd44/ec64/attachment/bt_msgd.tar)
(20.5 kB; application/x-tar)
-
[osafmsgd](https://sourceforge.net/p/opensaf/tickets/_discuss/thread/1ff9fd44/ec64/attachment/osafmsgd)
(280.6 kB; application/octet-stream)
---
** [tickets:#2000] msg: Cluster reset happend due to msgd crashed on both the
controller**
**Status:** unassigned
**Milestone:** 4.7.2
**Created:** Tue Sep 06, 2016 06:04 AM UTC by Ritu Raj **Last Updated:** Tue
Sep 06, 2016 06:04 AM UTC
**Owner:** nobody
**Attachments:**
-
[Active_syslog](https://sourceforge.net/p/opensaf/tickets/2000/attachment/Active_syslog)
(716.7 kB; application/octet-stream)
-
[Standby_syslog](https://sourceforge.net/p/opensaf/tickets/2000/attachment/Standby_syslog)
(696.4 kB; application/octet-stream)
Environment details
------------------
OS : Suse 64bit
Changeset : 7997 ( 5.1.FC)
Setup : 4 nodes ( 2 controllers and 2 payloads with headless feature disabled &
1PBE enabled with 30K objects )
Summary :
------------------
Cluster reset happend due to assertion SA_MAX_UNEXTENDED_NAME_LENGTH failed in
msgd
Steps followed & Observed behaviour
------------------
1. Invoked failover
2. After, few successful failover, New Active Controller rebooted beacuse of
Assertion 'length < SA_MAX_UNEXTENDED_NAME_LENGTH' failed in msgd. While
previous Active joinig the cluster as a Standby Role resulted cluster reset
happend.
[Timeline: Sep 6 00:13:02 sofo-s2]
Sep 6 00:13:02 sofo-s2 osafimmd[3985]: NO MDS event from svc_id 24 (change:5,
dest:13) Sep 6 00:13:02 sofo-s2 osafmsgd[4145]: osaf_extended_name.c:139:
osaf_extended_name_length: Assertion 'length < SA_MAX_UNEXTENDED_NAME_LENGTH'
failed.
Sep 6 00:13:02 sofo-s2 osafamfnd[4046]: NO
'safComp=MQD,safSu=SC-2,safSg=2N,safApp=OpenSAF' faulted due to 'avaDown' :
Recovery is 'nodeFailfast'
Sep 6 00:13:02 sofo-s2 osafamfnd[4046]: ER
safComp=MQD,safSu=SC-2,safSg=2N,safApp=OpenSAF Faulted due to:avaDown Recovery
is:nodeFailfast Sep 6 00:13:02 sofo-s2 osafamfnd[4046]: Rebooting OpenSAF
NodeId = 131599 EE Name = , Reason: Component faulted: recovery is node
failfast, OwnNodeId = 131599, SupervisionTime = 60 Sep 6 00:13:02 sofo-s2
opensaf_reboot: Rebooting local node; timeout=60
Notes:
1. Syslog attached
2 msgnd & msgd trace not enabled
---
Sent from sourceforge.net because you indicated interest in
<https://sourceforge.net/p/opensaf/tickets/2000/>
To unsubscribe from further messages, please visit
<https://sourceforge.net/auth/subscriptions/>
---
** [tickets:#2000] msg: Cluster reset happend due to msgd crashed on both the
controller**
**Status:** unassigned
**Milestone:** 4.7.2
**Created:** Tue Sep 06, 2016 06:04 AM UTC by Ritu Raj
**Last Updated:** Wed Sep 07, 2016 05:57 AM UTC
**Owner:** nobody
**Attachments:**
-
[Active_syslog](https://sourceforge.net/p/opensaf/tickets/2000/attachment/Active_syslog)
(716.7 kB; application/octet-stream)
-
[Standby_syslog](https://sourceforge.net/p/opensaf/tickets/2000/attachment/Standby_syslog)
(696.4 kB; application/octet-stream)
Environment details
------------------
OS : Suse 64bit
Changeset : 7997 ( 5.1.FC)
Setup : 4 nodes ( 2 controllers and 2 payloads with headless feature disabled &
1PBE enabled with 30K objects )
Summary :
------------------
Cluster reset happend due to assertion SA_MAX_UNEXTENDED_NAME_LENGTH failed in
msgd
Steps followed & Observed behaviour
------------------
1. Invoked failover
2. After, few successful failover, New Active Controller rebooted beacuse of
Assertion 'length < SA_MAX_UNEXTENDED_NAME_LENGTH' failed in msgd. While
previous Active joinig the cluster as a Standby Role resulted cluster reset
happend.
[Timeline: Sep 6 00:13:02 sofo-s2]
Sep 6 00:13:02 sofo-s2 osafimmd[3985]: NO MDS event from svc_id 24 (change:5,
dest:13)
Sep 6 00:13:02 sofo-s2 osafmsgd[4145]: osaf_extended_name.c:139:
osaf_extended_name_length: Assertion 'length < SA_MAX_UNEXTENDED_NAME_LENGTH'
failed.
Sep 6 00:13:02 sofo-s2 osafamfnd[4046]: NO
'safComp=MQD,safSu=SC-2,safSg=2N,safApp=OpenSAF' faulted due to 'avaDown' :
Recovery is 'nodeFailfast'
Sep 6 00:13:02 sofo-s2 osafamfnd[4046]: ER
safComp=MQD,safSu=SC-2,safSg=2N,safApp=OpenSAF Faulted due to:avaDown Recovery
is:nodeFailfast
Sep 6 00:13:02 sofo-s2 osafamfnd[4046]: Rebooting OpenSAF NodeId = 131599 EE
Name = , Reason: Component faulted: recovery is node failfast, OwnNodeId =
131599, SupervisionTime = 60
Sep 6 00:13:02 sofo-s2 opensaf_reboot: Rebooting local node; timeout=60
Notes:
1. Syslog attached
2 msgnd & msgd trace not enabled
---
Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
_______________________________________________
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets