[tickets] [opensaf:tickets] #2508 ntfd: coredump due to failure of pthread_mutex_destroy

2017-07-05 Thread Minh Hon Chau via Opensaf-tickets
- **status**: review --> fixed
- **assigned_to**: Minh Hon Chau -->  nobody 
- **Comment**:

commits
release:[645a7fca7576e437782da96197b56096e1afbd8c]
develop:[cb04247b15858eadfeeb5c385b7544d0049b9e67]



---

** [tickets:#2508] ntfd: coredump due to failure of pthread_mutex_destroy**

**Status:** fixed
**Milestone:** 5.17.08
**Created:** Wed Jun 21, 2017 06:12 AM UTC by Minh Hon Chau
**Last Updated:** Wed Jul 05, 2017 09:23 AM UTC
**Owner:** nobody


syslog:

2017-05-23 18:05:36 SC-3 systemd[1]: Stopping OpenSAF daemon...
2017-05-23 18:05:36 SC-3 opensafd: Stopping OpenSAF Services
2017-05-23 18:05:36 SC-3 osafamfnd[255]: NO Shutdown initiated
2017-05-23 18:05:36 SC-3 osafamfnd[255]: NO Terminating all AMF components
2017-05-23 18:05:36 SC-3 osafclmna[166]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osafclmd[235]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osafckptd[381]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osafimmd[194]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osaffmd[184]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osafamfwd[335]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osafckptnd[314]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osafsmfnd[273]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osaflogd[215]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osafimmpbed: NO IMM PBE received SIG_TERM, closing db 
handle
2017-05-23 18:05:36 SC-3 osafimmpbed: IN IMM PBE process EXITING...
2017-05-23 18:05:36 SC-3 osafimmnd[205]: NO Implementer locally disconnected. 
Marking it as doomed 25 <303, 2030f> (OpenSafImmPBE)
2017-05-23 18:05:36 SC-3 osafrded[175]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osafimmnd[205]: NO Implementer disconnected 25 <303, 
2030f> (OpenSafImmPBE)
2017-05-23 18:05:36 SC-3 osafimmnd[205]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osafsmfd[271]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osafntfimcnd[473]: NO saImmOiDispatch() Fail 
SA_AIS_ERR_BAD_HANDLE (9)
2017-05-23 18:05:36 SC-3 osafamfd[245]: NO Re-initializing with IMM

2017-05-23 18:05:36 SC-3 osafntfd[225]: osaf_abort(16) called from 
0x5611b80d2a69 with errno=4


backtrace:

~~~
[New LWP 225]
[New LWP 227]
[New LWP 228]
[New LWP 229]
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1".
Core was generated by `/usr/local/lib/opensaf/osafntfd'.
Program terminated with signal SIGABRT, Aborted.
#0  0x7f8fd1780428 in __GI_raise (sig=sig@entry=6) at 
../sysdeps/unix/sysv/linux/raise.c:54
[Current thread is 1 (Thread 0x7f8fd32cc740 (LWP 225))]

Thread 4 (Thread 0x7f8fd3289b00 (LWP 229)):
#0  __clock_nanosleep (clock_id=, flags=1, req=0x7f8fd3289240, 
rem=0x0) at ../sysdeps/unix/sysv/linux/clock_nanosleep.c:48
oldstate = 0
r = 
rem = 0x0
req = 0x7f8fd3289240
flags = 1
clock_id = 
#1  0x7f8fd233157d in osaf_nanosleep (sleep_duration=0x7f8fd23b9c10 
<_ZN4baseL10kOneSecondE>) at src/base/osaf_time.c:39
wakeup_time = {tv_sec = 19013, tv_nsec = 484354498}
retval = 32655
#2  0x7f8fd23a0336 in base::Sleep (duration=...) at ./src/base/time.h:135
No locals.
#3  0x7f8fd23a043e in rda_callback_task (rda_callback_cb=0x5611b8fa61b0) at 
src/rde/agent/rda_papi.cc:117
msg = "10 2", '\000' 
rc = PCSRDA_RC_FATAL_IPC_CONNECTION_LOST
value = 2
retry_count = 0
conn_lost = true
cmd_type = RDE_RDA_HA_ROLE
cb_info = {cb_type = PCS_RDA_ROLE_CHG_IND, info = {io_role = 
PCS_RDA_STANDBY}}
#4  0x7f8fd1b1b6ba in start_thread (arg=0x7f8fd3289b00) at 
pthread_create.c:333
__res = 
pd = 0x7f8fd3289b00
now = 
unwind_buf = {cancel_jmp_buf = {{jmp_buf = {140255699704576, 
6214217193111308760, 1, 140732091062047, 140255699705280, 0, 
-6259172466962839080, -6259168954657519144}, mask_was_saved = 0}}, priv = {pad 
= {0x0, 0x0, 0x0, 0x0}, data = {prev = 0x0, cleanup = 0x0, canceltype = 0}}}
not_first_call = 
pagesize_m1 = 
sp = 
freesize = 
__PRETTY_FUNCTION__ = "start_thread"
#5  0x7f8fd185182d in clone () at 
../sysdeps/unix/sysv/linux/x86_64/clone.S:109
No locals.

Thread 3 (Thread 0x7f8fd32a9b00 (LWP 228)):
#0  0x7f8fd1845b5d in poll () at ../sysdeps/unix/syscall-template.S:84
No locals.
#1  0x7f8fd2388473 in mdtm_process_recv_events_tcp () at 
src/mds/mds_dt_trans.c:986
pollres = 1
#2  0x7f8fd1b1b6ba in start_thread (arg=0x7f8fd32a9b00) at 
pthread_create.c:333
__res = 
pd = 0x7f8fd32a9b00
now = 
unwind_buf = {cancel_jmp_buf = {{jmp_buf = {140255699835648, 
6214217193111308760, 1, 140732091060655, 140255699836352, 0, 
-6259172484142708264, -6259168954657519144}, mask_was_saved = 0}}, priv = {pad 
= {0x0, 0x0, 0x0, 0x0}, data = {prev = 0x0, cleanup = 0x0, canceltype = 0}}}
not_first_call = 
pagesize_m1 = 
sp = 
freesize = 
__PRETTY_FUNCTION__ = "start_thread"

[tickets] [opensaf:tickets] #2520 clm: make CLM tests more independent from other CLM tests

2017-07-05 Thread Zoran Milinkovic via Opensaf-tickets
- **Comment**:

https://sourceforge.net/p/opensaf/mailman/message/35929376/



---

** [tickets:#2520] clm: make CLM tests more independent from other CLM tests**

**Status:** review
**Milestone:** 5.17.08
**Created:** Tue Jul 04, 2017 01:50 PM UTC by Zoran Milinkovic
**Last Updated:** Wed Jul 05, 2017 02:53 PM UTC
**Owner:** Zoran Milinkovic


Today's CLM tests depend on other tests, request them to revert CLM state to 
the starting state.
If CLM tests are done manually, then they must be done in an order. Also, some 
tests cannot be called twice.

For example test 7 21:
~~~
$ clmtest 7 21

Suite 7: Test case for saClmClusterTrack. ** For all tests to pass, Run a 
payload with node_name PL-3 **

waiting on poll
Inside TrackCallback4
invocation : 0
Step : 4
error = 1
numberOfMembers = 2
No of items = 1

Value of i = 0
Cluster Change = 3
Node Name length = 36, value = safNode=PL-3,safCluster=myClmCluster
Node Member = 0
Node  view number  = 5
Node  eename length = 0,value  = 
Node  boottimestamp  = 1499175648545988548
Node  nodeAddress family  = 1,node address length = 0, node address value = 
Node  nodeid  = 131855

   21  PASSED   saClmClusterTrack_4 with SA_TRACK_CHANGES_ONLY track flags - 
admin lock

=

   Test Result:
  Total:  1
  Passed: 1
  Failed: 0
~~~

And if we repeat the test, it fails:

~~~
$ clmtest 7 21

Suite 7: Test case for saClmClusterTrack. ** For all tests to pass, Run a 
payload with node_name PL-3 **

error - saImmOmAdminOperationInvoke_2 admin-op RETURNED: SA_AIS_ERR_NO_OP (28)
clmtest: src/clm/apitest/tet_saClmClusterTrack.c:604: saClmClusterTrack_21: 
Assertion `ret == 1' failed.
waiting on pollAborted
~~~

Or if we call another test after executing test 7 21:

~~~
$ clmtest 7 23

Suite 7: Test case for saClmClusterTrack. ** For all tests to pass, Run a 
payload with node_name PL-3 **

error - saImmOmAdminOperationInvoke_2 admin-op RETURNED: SA_AIS_ERR_NO_OP (28)
clmtest: src/clm/apitest/tet_saClmClusterTrack.c:672: saClmClusterTrack_23: 
Assertion `ret == 1' failed.
Aborted
~~~



---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets


[tickets] [opensaf:tickets] #2520 clm: make CLM tests more independent from other CLM tests

2017-07-05 Thread Zoran Milinkovic via Opensaf-tickets
- **status**: accepted --> review



---

** [tickets:#2520] clm: make CLM tests more independent from other CLM tests**

**Status:** review
**Milestone:** 5.17.08
**Created:** Tue Jul 04, 2017 01:50 PM UTC by Zoran Milinkovic
**Last Updated:** Tue Jul 04, 2017 01:50 PM UTC
**Owner:** Zoran Milinkovic


Today's CLM tests depend on other tests, request them to revert CLM state to 
the starting state.
If CLM tests are done manually, then they must be done in an order. Also, some 
tests cannot be called twice.

For example test 7 21:
~~~
$ clmtest 7 21

Suite 7: Test case for saClmClusterTrack. ** For all tests to pass, Run a 
payload with node_name PL-3 **

waiting on poll
Inside TrackCallback4
invocation : 0
Step : 4
error = 1
numberOfMembers = 2
No of items = 1

Value of i = 0
Cluster Change = 3
Node Name length = 36, value = safNode=PL-3,safCluster=myClmCluster
Node Member = 0
Node  view number  = 5
Node  eename length = 0,value  = 
Node  boottimestamp  = 1499175648545988548
Node  nodeAddress family  = 1,node address length = 0, node address value = 
Node  nodeid  = 131855

   21  PASSED   saClmClusterTrack_4 with SA_TRACK_CHANGES_ONLY track flags - 
admin lock

=

   Test Result:
  Total:  1
  Passed: 1
  Failed: 0
~~~

And if we repeat the test, it fails:

~~~
$ clmtest 7 21

Suite 7: Test case for saClmClusterTrack. ** For all tests to pass, Run a 
payload with node_name PL-3 **

error - saImmOmAdminOperationInvoke_2 admin-op RETURNED: SA_AIS_ERR_NO_OP (28)
clmtest: src/clm/apitest/tet_saClmClusterTrack.c:604: saClmClusterTrack_21: 
Assertion `ret == 1' failed.
waiting on pollAborted
~~~

Or if we call another test after executing test 7 21:

~~~
$ clmtest 7 23

Suite 7: Test case for saClmClusterTrack. ** For all tests to pass, Run a 
payload with node_name PL-3 **

error - saImmOmAdminOperationInvoke_2 admin-op RETURNED: SA_AIS_ERR_NO_OP (28)
clmtest: src/clm/apitest/tet_saClmClusterTrack.c:672: saClmClusterTrack_23: 
Assertion `ret == 1' failed.
Aborted
~~~



---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets


[tickets] [opensaf:tickets] #2508 ntfd: coredump due to failure of pthread_mutex_destroy

2017-07-05 Thread Minh Hon Chau via Opensaf-tickets
- **status**: fixed --> review
- **assigned_to**: Minh Hon Chau
- **Comment**:

there's a bug in previous commit that allows to destroy uninitialized mutex, 
sending out for reveiw the updated patch



---

** [tickets:#2508] ntfd: coredump due to failure of pthread_mutex_destroy**

**Status:** review
**Milestone:** 5.17.08
**Created:** Wed Jun 21, 2017 06:12 AM UTC by Minh Hon Chau
**Last Updated:** Tue Jul 04, 2017 04:03 AM UTC
**Owner:** Minh Hon Chau


syslog:

2017-05-23 18:05:36 SC-3 systemd[1]: Stopping OpenSAF daemon...
2017-05-23 18:05:36 SC-3 opensafd: Stopping OpenSAF Services
2017-05-23 18:05:36 SC-3 osafamfnd[255]: NO Shutdown initiated
2017-05-23 18:05:36 SC-3 osafamfnd[255]: NO Terminating all AMF components
2017-05-23 18:05:36 SC-3 osafclmna[166]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osafclmd[235]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osafckptd[381]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osafimmd[194]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osaffmd[184]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osafamfwd[335]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osafckptnd[314]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osafsmfnd[273]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osaflogd[215]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osafimmpbed: NO IMM PBE received SIG_TERM, closing db 
handle
2017-05-23 18:05:36 SC-3 osafimmpbed: IN IMM PBE process EXITING...
2017-05-23 18:05:36 SC-3 osafimmnd[205]: NO Implementer locally disconnected. 
Marking it as doomed 25 <303, 2030f> (OpenSafImmPBE)
2017-05-23 18:05:36 SC-3 osafrded[175]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osafimmnd[205]: NO Implementer disconnected 25 <303, 
2030f> (OpenSafImmPBE)
2017-05-23 18:05:36 SC-3 osafimmnd[205]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osafsmfd[271]: exiting for shutdown
2017-05-23 18:05:36 SC-3 osafntfimcnd[473]: NO saImmOiDispatch() Fail 
SA_AIS_ERR_BAD_HANDLE (9)
2017-05-23 18:05:36 SC-3 osafamfd[245]: NO Re-initializing with IMM

2017-05-23 18:05:36 SC-3 osafntfd[225]: osaf_abort(16) called from 
0x5611b80d2a69 with errno=4


backtrace:

~~~
[New LWP 225]
[New LWP 227]
[New LWP 228]
[New LWP 229]
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1".
Core was generated by `/usr/local/lib/opensaf/osafntfd'.
Program terminated with signal SIGABRT, Aborted.
#0  0x7f8fd1780428 in __GI_raise (sig=sig@entry=6) at 
../sysdeps/unix/sysv/linux/raise.c:54
[Current thread is 1 (Thread 0x7f8fd32cc740 (LWP 225))]

Thread 4 (Thread 0x7f8fd3289b00 (LWP 229)):
#0  __clock_nanosleep (clock_id=, flags=1, req=0x7f8fd3289240, 
rem=0x0) at ../sysdeps/unix/sysv/linux/clock_nanosleep.c:48
oldstate = 0
r = 
rem = 0x0
req = 0x7f8fd3289240
flags = 1
clock_id = 
#1  0x7f8fd233157d in osaf_nanosleep (sleep_duration=0x7f8fd23b9c10 
<_ZN4baseL10kOneSecondE>) at src/base/osaf_time.c:39
wakeup_time = {tv_sec = 19013, tv_nsec = 484354498}
retval = 32655
#2  0x7f8fd23a0336 in base::Sleep (duration=...) at ./src/base/time.h:135
No locals.
#3  0x7f8fd23a043e in rda_callback_task (rda_callback_cb=0x5611b8fa61b0) at 
src/rde/agent/rda_papi.cc:117
msg = "10 2", '\000' 
rc = PCSRDA_RC_FATAL_IPC_CONNECTION_LOST
value = 2
retry_count = 0
conn_lost = true
cmd_type = RDE_RDA_HA_ROLE
cb_info = {cb_type = PCS_RDA_ROLE_CHG_IND, info = {io_role = 
PCS_RDA_STANDBY}}
#4  0x7f8fd1b1b6ba in start_thread (arg=0x7f8fd3289b00) at 
pthread_create.c:333
__res = 
pd = 0x7f8fd3289b00
now = 
unwind_buf = {cancel_jmp_buf = {{jmp_buf = {140255699704576, 
6214217193111308760, 1, 140732091062047, 140255699705280, 0, 
-6259172466962839080, -6259168954657519144}, mask_was_saved = 0}}, priv = {pad 
= {0x0, 0x0, 0x0, 0x0}, data = {prev = 0x0, cleanup = 0x0, canceltype = 0}}}
not_first_call = 
pagesize_m1 = 
sp = 
freesize = 
__PRETTY_FUNCTION__ = "start_thread"
#5  0x7f8fd185182d in clone () at 
../sysdeps/unix/sysv/linux/x86_64/clone.S:109
No locals.

Thread 3 (Thread 0x7f8fd32a9b00 (LWP 228)):
#0  0x7f8fd1845b5d in poll () at ../sysdeps/unix/syscall-template.S:84
No locals.
#1  0x7f8fd2388473 in mdtm_process_recv_events_tcp () at 
src/mds/mds_dt_trans.c:986
pollres = 1
#2  0x7f8fd1b1b6ba in start_thread (arg=0x7f8fd32a9b00) at 
pthread_create.c:333
__res = 
pd = 0x7f8fd32a9b00
now = 
unwind_buf = {cancel_jmp_buf = {{jmp_buf = {140255699835648, 
6214217193111308760, 1, 140732091060655, 140255699836352, 0, 
-6259172484142708264, -6259168954657519144}, mask_was_saved = 0}}, priv = {pad 
= {0x0, 0x0, 0x0, 0x0}, data = {prev = 0x0, cleanup = 0x0, canceltype = 0}}}
not_first_call = 
pagesize_m1 = 
sp = 
freesize = 
__PRETTY_FUNCTION__ = 

[tickets] [opensaf:tickets] #2521 smf: remove node locking with empty procedures

2017-07-05 Thread Rafael Odzakow via Opensaf-tickets



---

** [tickets:#2521] smf: remove node locking with empty procedures**

**Status:** unassigned
**Milestone:** 5.17.10
**Created:** Wed Jul 05, 2017 09:13 AM UTC by Rafael Odzakow
**Last Updated:** Wed Jul 05, 2017 09:13 AM UTC
**Owner:** Rafael Odzakow





---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets


[tickets] [opensaf:tickets] #2515 log: logtest 2 51 take long time (over 3 minutes)

2017-07-05 Thread Vu Minh Nguyen via Opensaf-tickets
- **status**: review --> fixed
- **assigned_to**: Canh Truong -->  nobody 
- **Comment**:

[Develop]
commit 7f82ddccb739eaae22aad5857520f90072bcb06f
Author: Canh Van Truong 
Date:   Thu Jun 29 18:42:23 2017 +0700

log: update test case logtest 2 51 [#2515]

Test case logtest 2 51 takes more than 3 minutes. The patch updates this
test case as following:
1/ Thread reports fail immeditely exit the test case with fail .
2/ Print information if any operation fails in each thread
3/ Wait thread terminate with timeout

[Release]
commit 60f7747c415e135b2c5b854447ca6396abf463bc
Author: Canh Van Truong 
Date:   Thu Jun 29 18:42:23 2017 +0700

log: update test case logtest 2 51 [#2515]

Test case logtest 2 51 takes more than 3 minutes. The patch updates this
test case as following:
1/ Thread reports fail immeditely exit the test case with fail .
2/ Print information if any operation fails in each thread
3/ Wait thread terminate with timeout




---

** [tickets:#2515] log: logtest 2 51 take long time (over 3 minutes)**

**Status:** fixed
**Milestone:** 5.17.08
**Created:** Fri Jun 30, 2017 02:56 AM UTC by Canh Truong
**Last Updated:** Sat Jul 01, 2017 04:17 PM UTC
**Owner:** nobody


Test case logtest 2 51 take more than 3 minutes.

The test case should be improved as following:
 1/ Do not wait for all threads to terminate before checking the result of each 
thread. If a thread reports fail immeditely exit the test case with fail .
 2/ In each thread. Log or print information if any operation fails.



---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets