[tickets] [opensaf:tickets] #1486 smf : SMFD asserted in csi active callback during switchovers ( ncs_sel_obj_create: socketpair failed )

2016-09-16 Thread Srikanth R
- **summary**: SMFD faulted in active callback during switchovers --> smf : 
SMFD asserted in  csi active callback during switchovers ( ncs_sel_obj_create: 
socketpair failed )
- **Component**: unknown --> smf



---

** [tickets:#1486] smf : SMFD asserted in  csi active callback during 
switchovers ( ncs_sel_obj_create: socketpair failed )**

**Status:** unassigned
**Milestone:** 4.7.2
**Created:** Wed Sep 16, 2015 10:04 AM UTC by Ritu Raj
**Last Updated:** Wed May 04, 2016 07:27 PM UTC
**Owner:** nobody


Setup
4.6GA with changeset 6490
4 nodes(OEL6.4 with TIPC version 1.7.7) configured with no PBE configured 

Issues Observed:
> Cluser went for reboot during switchover as SMFD faulted due to 
'csiSetcallbackFailed'

Steps Performed:

 * Continuous switchovers are invoked on the setup.
 * After a count of over 1000 switchovers, Standby Controller (SC-2) got 
rebooted when it is being promoted to ACTIVE state , as SMFD failed in active 
callback.

Sep 16 06:25:00 SLOT-2 osafsmfd[1926]: ER amf_active_state_handler oi activate 
FAIL
Sep 16 06:25:00 SLOT-2 osafamfnd[1802]: NO 
'safComp=SMF,safSu=SC-2,safSg=2N,safApp=OpenSAF' faulted due to 
'csiSetcallbackFailed' : Recovery is 'nodeFailfast'
Sep 16 06:25:00 SLOT-2 osafamfnd[1802]: ER 
safComp=SMF,safSu=SC-2,safSg=2N,safApp=OpenSAF Faulted due 
to:csiSetcallbackFailed Recovery is:nodeFailfast
Sep 16 06:25:00 SLOT-2 osafamfnd[1802]: Rebooting OpenSAF NodeId = 131599 EE 
Name = , Reason: Component faulted: recovery is node failfast, OwnNodeId = 
131599, SupervisionTime = 60


* After SC-2 went for reboot, SC-1 tried to become active, during which smfd 
also faulted on the new promoted back active controller.

Sep 16 06:25:00 SLOT-1 root: Invoking switchover from invoke_switchover.sh
Sep 16 06:25:00 SLOT-1 osafamfd[3830]: NO safSi=SC-2N,safApp=OpenSAF Swap 
initiated
Sep 16 06:25:00 SLOT-1 osafamfnd[3845]: NO Assigning 
'safSi=SC-2N,safApp=OpenSAF' QUIESCED to 'safSu=SC-1,safSg=2N,safApp=OpenSAF'
Sep 16 06:25:00 SLOT-1 osafsmfd[3871]: ncs_sel_obj_create: socketpair failed - 
Too many open files

Sep 16 06:25:05 SLOT-1 kernel: TIPC: Resetting link <1.1.1:eth0-1.1.2:eth1>, 
peer not responding
Sep 16 06:25:05 SLOT-1 kernel: TIPC: Lost link <1.1.1:eth0-1.1.2:eth1> on 
network plane A
Sep 16 06:25:05 SLOT-1 kernel: TIPC: Lost contact with <1.1.2>
Sep 16 06:25:05 SLOT-1 osaffmd[3716]: NO Node Down event for node id 2020f:

Sep 16 06:25:06 SLOT-1 osafimmnd[3746]: NO This IMMND re-elected coord 
redundantly, failover ?
Sep 16 06:25:06 SLOT-1 osafsmfd[3871]: ncs_sel_obj_create: socketpair failed - 
Too many open files
Sep 16 06:25:06 SLOT-1 osafsmfd[3871]: ER immutil_saImmOiInitialize_2 fail, rc 
= 2
...
Sep 16 06:25:06 SLOT-1 osafamfnd[3845]: ER 
safComp=SMF,safSu=SC-1,safSg=2N,safApp=OpenSAF Faulted due 
to:csiSetcallbackFailed Recovery is:nodeFailfast
Sep 16 06:25:06 SLOT-1 osafamfnd[3845]: Rebooting OpenSAF NodeId = 131343 EE 
Name = , Reason: Component faulted: recovery is node failfast, OwnNodeId = 
131343, SupervisionTime = 60



---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.--
___
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets


[tickets] [opensaf:tickets] #1486 smf : SMFD asserted in csi active callback during switchovers ( ncs_sel_obj_create: socketpair failed )

2016-09-16 Thread Anders Widell
Sounds like SMFD or a library linked into it is leaking file descriptors...


---

** [tickets:#1486] smf : SMFD asserted in  csi active callback during 
switchovers ( ncs_sel_obj_create: socketpair failed )**

**Status:** unassigned
**Milestone:** 4.7.2
**Created:** Wed Sep 16, 2015 10:04 AM UTC by Ritu Raj
**Last Updated:** Fri Sep 16, 2016 08:31 AM UTC
**Owner:** nobody


Setup
4.6GA with changeset 6490
4 nodes(OEL6.4 with TIPC version 1.7.7) configured with no PBE configured 

Issues Observed:
> Cluser went for reboot during switchover as SMFD faulted due to 
'csiSetcallbackFailed'

Steps Performed:

 * Continuous switchovers are invoked on the setup.
 * After a count of over 1000 switchovers, Standby Controller (SC-2) got 
rebooted when it is being promoted to ACTIVE state , as SMFD failed in active 
callback.

Sep 16 06:25:00 SLOT-2 osafsmfd[1926]: ER amf_active_state_handler oi activate 
FAIL
Sep 16 06:25:00 SLOT-2 osafamfnd[1802]: NO 
'safComp=SMF,safSu=SC-2,safSg=2N,safApp=OpenSAF' faulted due to 
'csiSetcallbackFailed' : Recovery is 'nodeFailfast'
Sep 16 06:25:00 SLOT-2 osafamfnd[1802]: ER 
safComp=SMF,safSu=SC-2,safSg=2N,safApp=OpenSAF Faulted due 
to:csiSetcallbackFailed Recovery is:nodeFailfast
Sep 16 06:25:00 SLOT-2 osafamfnd[1802]: Rebooting OpenSAF NodeId = 131599 EE 
Name = , Reason: Component faulted: recovery is node failfast, OwnNodeId = 
131599, SupervisionTime = 60


* After SC-2 went for reboot, SC-1 tried to become active, during which smfd 
also faulted on the new promoted back active controller.

Sep 16 06:25:00 SLOT-1 root: Invoking switchover from invoke_switchover.sh
Sep 16 06:25:00 SLOT-1 osafamfd[3830]: NO safSi=SC-2N,safApp=OpenSAF Swap 
initiated
Sep 16 06:25:00 SLOT-1 osafamfnd[3845]: NO Assigning 
'safSi=SC-2N,safApp=OpenSAF' QUIESCED to 'safSu=SC-1,safSg=2N,safApp=OpenSAF'
Sep 16 06:25:00 SLOT-1 osafsmfd[3871]: ncs_sel_obj_create: socketpair failed - 
Too many open files

Sep 16 06:25:05 SLOT-1 kernel: TIPC: Resetting link <1.1.1:eth0-1.1.2:eth1>, 
peer not responding
Sep 16 06:25:05 SLOT-1 kernel: TIPC: Lost link <1.1.1:eth0-1.1.2:eth1> on 
network plane A
Sep 16 06:25:05 SLOT-1 kernel: TIPC: Lost contact with <1.1.2>
Sep 16 06:25:05 SLOT-1 osaffmd[3716]: NO Node Down event for node id 2020f:

Sep 16 06:25:06 SLOT-1 osafimmnd[3746]: NO This IMMND re-elected coord 
redundantly, failover ?
Sep 16 06:25:06 SLOT-1 osafsmfd[3871]: ncs_sel_obj_create: socketpair failed - 
Too many open files
Sep 16 06:25:06 SLOT-1 osafsmfd[3871]: ER immutil_saImmOiInitialize_2 fail, rc 
= 2
...
Sep 16 06:25:06 SLOT-1 osafamfnd[3845]: ER 
safComp=SMF,safSu=SC-1,safSg=2N,safApp=OpenSAF Faulted due 
to:csiSetcallbackFailed Recovery is:nodeFailfast
Sep 16 06:25:06 SLOT-1 osafamfnd[3845]: Rebooting OpenSAF NodeId = 131343 EE 
Name = , Reason: Component faulted: recovery is node failfast, OwnNodeId = 
131343, SupervisionTime = 60



---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.--
___
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets


[tickets] [opensaf:tickets] #1864 log: get ER syslog when running logtest 5 2

2016-09-16 Thread Canh Truong
- **status**: accepted --> review



---

** [tickets:#1864] log: get ER syslog when running logtest 5 2**

**Status:** review
**Milestone:** 4.7.2
**Created:** Wed Jun 08, 2016 02:32 AM UTC by Vu Minh Nguyen
**Last Updated:** Wed Jun 22, 2016 11:18 AM UTC
**Owner:** Canh Truong


When running `logtest 5 2` sometimes gets following ER message in syslog:
> 2016-04-27 02:39:57 SC-1 osaflogd[462]: ER Old log files could not be renamed 
> and closed for stream: safLgStrCfg=saLogNotification,safApp=safLogService
> 2016-04-27 02:39:57 SC-1 osaflogd[462]: ER Old log files could not be renamed 
> and closed for stream: safLgStrCfg=saLogSystem,safApp=safLogService

The problem is in logtest app - Test case #2 of suite #5.

"logtest 5 2" does following steps:
1) Create `xxtest` directory 
2) Change default `logRootDirectory` to above directory
3) Change `logRootDirectory` back to default value
4) **Remove all log files in `xxtest` directory**

In step #2 or #3, there is one step to rename the openning log files by 
appending the closed time to the log file names in `IMM apply callback`. The 
problem happens because the step #4 done while LOG service not yet completed 
`IMM apply callback` due to #2 or #3.

Can reproduce the problem by making logsv high load while performing `logtest 5 
2`:
1) Open 2 terminals
2) On terminal #1, in loop, using `saflogger` tool sending log record to alarm 
stream while on other terminal performing `logtest 5 2` in loop.


---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.--
___
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets


[tickets] [opensaf:tickets] #2034 imm: IMMsv README changes fro 5.1

2016-09-16 Thread Neelakanta Reddy
changeset:   8090:628fcbb0c110
branch:  opensaf-5.1.x
parent:  8088:aa40d984dee1
user:Neelakanta Reddy
date:Fri Sep 16 16:00:53 2016 +0530
summary: imm: corrected typos in README [#2034]

changeset:   8091:95d8784d1d0c
tag: tip
parent:  8089:4cb5345507df
user:Neelakanta Reddy
date:Fri Sep 16 16:00:53 2016 +0530
summary: imm: corrected typos in README [#2034]



---

** [tickets:#2034] imm: IMMsv README changes fro 5.1**

**Status:** fixed
**Milestone:** 5.1.RC2
**Created:** Wed Sep 14, 2016 08:35 AM UTC by Neelakanta Reddy
**Last Updated:** Thu Sep 15, 2016 07:02 AM UTC
**Owner:** Neelakanta Reddy


This Ticket is to update IMM README for 5.1 IMM Enhancements


---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.--
___
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets


[tickets] [opensaf:tickets] #2040 clmd seg faulted on active controller during switchover

2016-09-16 Thread Ritu Raj



---

** [tickets:#2040] clmd seg faulted on active controller during switchover**

**Status:** unassigned
**Milestone:** 4.7.2
**Created:** Fri Sep 16, 2016 11:30 AM UTC by Ritu Raj
**Last Updated:** Fri Sep 16, 2016 11:30 AM UTC
**Owner:** nobody
**Attachments:**

- 
[SC-1.tar.bz2](https://sourceforge.net/p/opensaf/tickets/2040/attachment/SC-1.tar.bz2)
 (3.6 MB; application/x-bzip)
- [clmd_bt](https://sourceforge.net/p/opensaf/tickets/2040/attachment/clmd_bt) 
(3.2 kB; application/octet-stream)


# Environment details
OS : Suse 64bit
Changeset : 7997 ( 5.1.FC)
Setup : 4 nodes ( 2 controllers and 2 payloads with headless feature disabled & 
 1PBE with 30K objects)


#Summary 
clmd seg faulted on active controller during controller switchover 

#Steps followed & Observed behaviour
1. Incoked controller switchover (SC-1 is the Active)
2. During role change, on SC-1 clmd got crashed and node went for reboot as 
'safComp=CLM,safSu=SC-1,safSg=2N,safApp=OpenSAF' faulted due to 'avaDown'
3. After, Active controller went for reboot,
>>NTFD crashed on Standby controller  and cluster reset happend -- Regarding 
>>NTFD crashed a ticket is already raised -- 
>>https://sourceforge.net/p/opensaf/tickets/1999/

*Syslog :

Sep 16 15:33:30 sofo-s1 osafamfnd[2162]: NO 
'safComp=CLM,safSu=SC-1,safSg=2N,safApp=OpenSAF' faulted due to 'avaDown' : 
Recovery is 'nodeFailfast'
Sep 16 15:33:30 sofo-s1 osafamfnd[2162]: ER 
safComp=CLM,safSu=SC-1,safSg=2N,safApp=OpenSAF Faulted due to:avaDown Recovery 
is:nodeFailfast
Sep 16 15:33:30 sofo-s1 osafamfnd[2162]: Rebooting OpenSAF NodeId = 131343 EE 
Name = , Reason: Component faulted: recovery is node failfast, OwnNodeId = 
131343, SupervisionTime = 60


*Below is the bt:

0  0x7f3db18e1b55 in raise () from /lib64/libc.so.6
1  0x7f3db18e3131 in abort () from /lib64/libc.so.6
2  0x7f3db191ec2f in __libc_message () from /lib64/libc.so.6
3  0x7f3db1924358 in malloc_printerr () from /lib64/libc.so.6
4  0x7f3db19292fc in free () from /lib64/libc.so.6
5  0x7f3db223db52 in timer_delete@@GLIBC_2.3.3 () from /lib64/librt.so.1
6  0x004055b3 in amf_quiesced_state_handler (cb=0x633820 <_clms_cb>, 
invocation=4288675847) at clms_amf.c:123
7  0x00405795 in clms_amf_csi_set_callback (invocation=4288675847, 
compName=0x6bac88, new_haState=SA_AMF_HA_QUIESCED, csiDescriptor=...) at 
clms_amf.c:223
8  0x7f3db332e1f1 in ava_hdl_cbk_rec_prc (info=0x6bac70, 
reg_cbk=0x7fff3e5fafe0) at ava_hdl.cc:645
9  0x7f3db332d896 in ava_hdl_cbk_dispatch_all (cb=0x7fff3e5fb0b0, 
hdl_rec=0x7fff3e5fb0b8) at ava_hdl.cc:446
10 0x7f3db332d376 in ava_hdl_cbk_dispatch (cb=0x7fff3e5fb0b0, 
hdl_rec=0x7fff3e5fb0b8, flags=SA_DISPATCH_ALL) at ava_hdl.cc:320
11 0x7f3db3325a49 in AmfAgent::Dispatch (hdl=4285530114, 
flags=SA_DISPATCH_ALL) at amf_agent.cc:283
12 0x7f3db332588e in saAmfDispatch (hdl=4285530114, flags=SA_DISPATCH_ALL) 
at amf_agent.cc:244
13 0x00413966 in main (argc=2, argv=0x7fff3e5fb208) at clms_main.c:515


*Notes:
1. Issue is random
2. Syslog, clmd trace and bt file attached 


---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.--
___
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets


[tickets] [opensaf:tickets] #2041 Msg: saMsgInitialize is returning continuous TRY_AGAINS after mqsv ndrestarts in backward compatability.

2016-09-16 Thread Madhurika Koppula



---

** [tickets:#2041] Msg: saMsgInitialize is returning continuous TRY_AGAINS 
after mqsv ndrestarts in backward compatability.**

**Status:** unassigned
**Milestone:** 4.7.2
**Created:** Fri Sep 16, 2016 12:13 PM UTC by Madhurika Koppula
**Last Updated:** Fri Sep 16, 2016 12:13 PM UTC
**Owner:** nobody
**Attachments:**

- 
[messages-20160921.bz2](https://sourceforge.net/p/opensaf/tickets/2041/attachment/messages-20160921.bz2)
 (240.9 kB; application/octet-stream)


**Environment Details:**
OS : Suse 64bit
Setup : 4 nodes ( 2 controllers and 2 payloads with headless feature disabled & 
1PBE enabled ).

Backward Compatability:
Opensaf versions on nodes:
SC-1 (5.0), SC-2 (5.1 FC), PL-3 (5.0), PL-4(5.1FC).

**Summary:**  saMsgInitialize is returning continuous TRY_AGAINS after 
mqnd_imm_initialize failed with ERR_TIMEOUT.

**Steps followed & Observed behaviour:**

Mqsv test application is being ran by continuously killing mqnd.

Observations:

saMsgInitialize failed with continuous TRY_AGAIN. Below is the snapshot. 

100|0| Version : B.3.1
100|0| RETRY   : saMsgInitialize with all valid parameters
100|0| Return Value: SA_AIS_ERR_TRY_AGAIN
100|0|
100|0|
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1 Retry Count : 10
100|0|
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1 Retry Count : 20
100|0|
100|0| Version : B.3.1
100|0| Version  Sun Sep 18 11:51:19 IST 2016
100|0|Sun Sep 18 11:51:19 IST 2016
100|0|Sun Sep 18 11:51:59 IST 2016
100|0|Sun Sep 18 11:51:59 IST 2016
100|0|Sun Sep 18 11:52:39 IST 2016
100|0|Sun Sep 18 11:52:39 IST 2016

100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1 Retry Count : 30
100|0|
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1
100|0| Version : B.3.1 Retry Count : 40
100|0| Try again count exceeded* TEST CASE FAILED 

Below is the snippet of syslog of SC-1:


Sep 18 11:48:32 SCALE_SLOT-41 osafimmnd[19813]: NO Implementer (applier) 
connected: 2462 (@OpenSafImmReplicatorA) <20504, 2010f>
Sep 18 11:48:32 SCALE_SLOT-41 osafntfimcnd[19819]: NO Started
Sep 18 11:48:39 SCALE_SLOT-41 osafamfd[1816]: NO Re-initializing with IMM
Sep 18 11:48:39 SCALE_SLOT-41 osafimmnd[19813]: NO Implementer connected: 2463 
(safAmfService) <20506, 2010f>
Sep 18 11:48:39 SCALE_SLOT-41 osafamfd[1816]: NO Finished re-initializing with 
IMM

**Sep 18 11:48:39 SCALE_SLOT-41 osafmsgnd[19792]: ER mqnd_imm_initialize 
Failed: 5**

Sep 18 11:48:39 SCALE_SLOT-41 osafamfnd[1826]: 
'safComp=MQND,safSu=SC-1,safSg=NoRed,safApp=OpenSAF'unregistered
Sep 18 11:48:39 SCALE_SLOT-41 osafmsgnd[19792]: CR Destroying the shared memory 
segment failed
Sep 18 11:48:39 SCALE_SLOT-41 osafmsgnd[19792]: ER saAmfComponentUnregister 
Failed with error 9
Sep 18 11:48:39 SCALE_SLOT-41 osafmsgnd[19792]: ER Cb is NULL
Sep 18 11:48:49 SCALE_SLOT-41 osafimmnd[19813]: NO Implementer connected: 2464 
(MsgQueueService131343) <20507, 2010f>
Sep 18 11:48:49 SCALE_SLOT-41 osafimmnd[19813]: NO Implementer locally 
disconnected. Marking it as doomed 2464 <20507, 2010f> (MsgQueueService131343)

Attachments:
1)Syslog of SC-1.



---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.--
___
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets


[tickets] [opensaf:tickets] #2042 EVT : Application segfaulted during

2016-09-16 Thread Srikanth R



---

** [tickets:#2042] EVT : Application segfaulted during **

**Status:** unassigned
**Milestone:** 4.7.2
**Created:** Fri Sep 16, 2016 12:22 PM UTC by Srikanth R
**Last Updated:** Fri Sep 16, 2016 12:22 PM UTC
**Owner:** nobody
**Attachments:**

- [eda_bt](https://sourceforge.net/p/opensaf/tickets/2042/attachment/eda_bt) 
(59.5 kB; application/octet-stream)


Setup : 7997 5.1.FC 

Issue :
 Application segfaulted on payload in MDS  callback processing  by EVT thread.
 Below is the backtrace.
 
 0  0x7ff2f5282d64 in ncs_decode_32bit (stream=0x7ff2f6b95c98) at 
hj_dec.c:197
1  0x7ff2f5f181e4 in eda_mds_dec (info=0x7ff2f6b95dd0) at eda_mds.c:1285
2  0x7ff2f5f185fa in eda_mds_callback (info=0x7ff2f6b95dd0) at 
eda_mds.c:1440
3  0x7ff2f52b887b in mds_mcm_do_decode_full_or_flat (svccb=0x639c40, 
cbinfo=0x7ff2f6b95dd0, recv_msg=0x7aace8, orig_msg=0x0) at mds_c_sndrcv.c:4915
4  0x7ff2f52b7841 in mds_mcm_process_recv_snd_msg_common (svccb=0x639c40, 
recv=0x7aace8) at mds_c_sndrcv.c:4255
5  0x7ff2f52b7f24 in mcm_recv_normal_snd (svccb=0x639c40, recv=0x7aace8) at 
mds_c_sndrcv.c:4389
6  0x7ff2f52b7305 in mds_mcm_ll_data_rcv (recv=0x7aace8) at 
mds_c_sndrcv.c:4067
7  0x7ff2f52a54ac in mdtm_process_recv_message_common (flag=0 '\000', 
buffer=0x61424a "\252", len=167, transport_adest=72075191086465088, 
seq_num_check=30108, buff_dump=0x7ff2f6b961bc) at mds_dt_common.c:505
8  0x7ff2f52a626f in mdtm_process_recv_data (buffer=0x614242 "", len=175, 
transport_adest=72075191086465088, buff_dump=0x7ff2f6b961bc) at 
mds_dt_common.c:949
9  0x7ff2f52c952f in mdtm_process_recv_events () at mds_dt_tipc.c:793
10 0x7ff2f586c7b6 in start_thread () from /lib64/libpthread.so.0
11 0x7ff2f55c89cd in clone () from /lib64/libc.so.6

The entire backtrace is attached as an attachment. This issue is observed in 
earlier releases also.



---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.--
___
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets


[tickets] [opensaf:tickets] #2042 EVT : Application segfaulted in MDS callback processing

2016-09-16 Thread Srikanth R
- **summary**: EVT : Application segfaulted during  --> EVT : Application 
segfaulted in MDS callback processing



---

** [tickets:#2042] EVT : Application segfaulted in MDS callback processing**

**Status:** unassigned
**Milestone:** 4.7.2
**Created:** Fri Sep 16, 2016 12:22 PM UTC by Srikanth R
**Last Updated:** Fri Sep 16, 2016 12:22 PM UTC
**Owner:** nobody
**Attachments:**

- [eda_bt](https://sourceforge.net/p/opensaf/tickets/2042/attachment/eda_bt) 
(59.5 kB; application/octet-stream)


Setup : 7997 5.1.FC 

Issue :
 Application segfaulted on payload in MDS  callback processing  by EVT thread.
 Below is the backtrace.
 
 0  0x7ff2f5282d64 in ncs_decode_32bit (stream=0x7ff2f6b95c98) at 
hj_dec.c:197
1  0x7ff2f5f181e4 in eda_mds_dec (info=0x7ff2f6b95dd0) at eda_mds.c:1285
2  0x7ff2f5f185fa in eda_mds_callback (info=0x7ff2f6b95dd0) at 
eda_mds.c:1440
3  0x7ff2f52b887b in mds_mcm_do_decode_full_or_flat (svccb=0x639c40, 
cbinfo=0x7ff2f6b95dd0, recv_msg=0x7aace8, orig_msg=0x0) at mds_c_sndrcv.c:4915
4  0x7ff2f52b7841 in mds_mcm_process_recv_snd_msg_common (svccb=0x639c40, 
recv=0x7aace8) at mds_c_sndrcv.c:4255
5  0x7ff2f52b7f24 in mcm_recv_normal_snd (svccb=0x639c40, recv=0x7aace8) at 
mds_c_sndrcv.c:4389
6  0x7ff2f52b7305 in mds_mcm_ll_data_rcv (recv=0x7aace8) at 
mds_c_sndrcv.c:4067
7  0x7ff2f52a54ac in mdtm_process_recv_message_common (flag=0 '\000', 
buffer=0x61424a "\252", len=167, transport_adest=72075191086465088, 
seq_num_check=30108, buff_dump=0x7ff2f6b961bc) at mds_dt_common.c:505
8  0x7ff2f52a626f in mdtm_process_recv_data (buffer=0x614242 "", len=175, 
transport_adest=72075191086465088, buff_dump=0x7ff2f6b961bc) at 
mds_dt_common.c:949
9  0x7ff2f52c952f in mdtm_process_recv_events () at mds_dt_tipc.c:793
10 0x7ff2f586c7b6 in start_thread () from /lib64/libpthread.so.0
11 0x7ff2f55c89cd in clone () from /lib64/libc.so.6

The entire backtrace is attached as an attachment. This issue is observed in 
earlier releases also.



---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.--
___
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets