[tickets] [opensaf:tickets] #2508 ntfd: coredump due to failure of pthread_mutex_destroy
- **status**: review --> fixed - **assigned_to**: Minh Hon Chau --> nobody - **Comment**: commits release:[645a7fca7576e437782da96197b56096e1afbd8c] develop:[cb04247b15858eadfeeb5c385b7544d0049b9e67] --- ** [tickets:#2508] ntfd: coredump due to failure of pthread_mutex_destroy** **Status:** fixed **Milestone:** 5.17.08 **Created:** Wed Jun 21, 2017 06:12 AM UTC by Minh Hon Chau **Last Updated:** Wed Jul 05, 2017 09:23 AM UTC **Owner:** nobody syslog: 2017-05-23 18:05:36 SC-3 systemd[1]: Stopping OpenSAF daemon... 2017-05-23 18:05:36 SC-3 opensafd: Stopping OpenSAF Services 2017-05-23 18:05:36 SC-3 osafamfnd[255]: NO Shutdown initiated 2017-05-23 18:05:36 SC-3 osafamfnd[255]: NO Terminating all AMF components 2017-05-23 18:05:36 SC-3 osafclmna[166]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osafclmd[235]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osafckptd[381]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osafimmd[194]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osaffmd[184]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osafamfwd[335]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osafckptnd[314]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osafsmfnd[273]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osaflogd[215]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osafimmpbed: NO IMM PBE received SIG_TERM, closing db handle 2017-05-23 18:05:36 SC-3 osafimmpbed: IN IMM PBE process EXITING... 2017-05-23 18:05:36 SC-3 osafimmnd[205]: NO Implementer locally disconnected. Marking it as doomed 25 <303, 2030f> (OpenSafImmPBE) 2017-05-23 18:05:36 SC-3 osafrded[175]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osafimmnd[205]: NO Implementer disconnected 25 <303, 2030f> (OpenSafImmPBE) 2017-05-23 18:05:36 SC-3 osafimmnd[205]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osafsmfd[271]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osafntfimcnd[473]: NO saImmOiDispatch() Fail SA_AIS_ERR_BAD_HANDLE (9) 2017-05-23 18:05:36 SC-3 osafamfd[245]: NO Re-initializing with IMM 2017-05-23 18:05:36 SC-3 osafntfd[225]: osaf_abort(16) called from 0x5611b80d2a69 with errno=4 backtrace: ~~~ [New LWP 225] [New LWP 227] [New LWP 228] [New LWP 229] [Thread debugging using libthread_db enabled] Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1". Core was generated by `/usr/local/lib/opensaf/osafntfd'. Program terminated with signal SIGABRT, Aborted. #0 0x7f8fd1780428 in __GI_raise (sig=sig@entry=6) at ../sysdeps/unix/sysv/linux/raise.c:54 [Current thread is 1 (Thread 0x7f8fd32cc740 (LWP 225))] Thread 4 (Thread 0x7f8fd3289b00 (LWP 229)): #0 __clock_nanosleep (clock_id=, flags=1, req=0x7f8fd3289240, rem=0x0) at ../sysdeps/unix/sysv/linux/clock_nanosleep.c:48 oldstate = 0 r = rem = 0x0 req = 0x7f8fd3289240 flags = 1 clock_id = #1 0x7f8fd233157d in osaf_nanosleep (sleep_duration=0x7f8fd23b9c10 <_ZN4baseL10kOneSecondE>) at src/base/osaf_time.c:39 wakeup_time = {tv_sec = 19013, tv_nsec = 484354498} retval = 32655 #2 0x7f8fd23a0336 in base::Sleep (duration=...) at ./src/base/time.h:135 No locals. #3 0x7f8fd23a043e in rda_callback_task (rda_callback_cb=0x5611b8fa61b0) at src/rde/agent/rda_papi.cc:117 msg = "10 2", '\000' rc = PCSRDA_RC_FATAL_IPC_CONNECTION_LOST value = 2 retry_count = 0 conn_lost = true cmd_type = RDE_RDA_HA_ROLE cb_info = {cb_type = PCS_RDA_ROLE_CHG_IND, info = {io_role = PCS_RDA_STANDBY}} #4 0x7f8fd1b1b6ba in start_thread (arg=0x7f8fd3289b00) at pthread_create.c:333 __res = pd = 0x7f8fd3289b00 now = unwind_buf = {cancel_jmp_buf = {{jmp_buf = {140255699704576, 6214217193111308760, 1, 140732091062047, 140255699705280, 0, -6259172466962839080, -6259168954657519144}, mask_was_saved = 0}}, priv = {pad = {0x0, 0x0, 0x0, 0x0}, data = {prev = 0x0, cleanup = 0x0, canceltype = 0}}} not_first_call = pagesize_m1 = sp = freesize = __PRETTY_FUNCTION__ = "start_thread" #5 0x7f8fd185182d in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:109 No locals. Thread 3 (Thread 0x7f8fd32a9b00 (LWP 228)): #0 0x7f8fd1845b5d in poll () at ../sysdeps/unix/syscall-template.S:84 No locals. #1 0x7f8fd2388473 in mdtm_process_recv_events_tcp () at src/mds/mds_dt_trans.c:986 pollres = 1 #2 0x7f8fd1b1b6ba in start_thread (arg=0x7f8fd32a9b00) at pthread_create.c:333 __res = pd = 0x7f8fd32a9b00 now = unwind_buf = {cancel_jmp_buf = {{jmp_buf = {140255699835648, 6214217193111308760, 1, 140732091060655, 140255699836352, 0, -6259172484142708264, -6259168954657519144}, mask_was_saved = 0}}, priv = {pad = {0x0, 0x0, 0x0, 0x0}, data = {prev = 0x0, cleanup = 0x0, canceltype = 0}}} not_first_call = pagesize_m1 = sp = freesize = __PRETTY_FUNCTION__ = "start_thread"
[tickets] [opensaf:tickets] #2520 clm: make CLM tests more independent from other CLM tests
- **Comment**: https://sourceforge.net/p/opensaf/mailman/message/35929376/ --- ** [tickets:#2520] clm: make CLM tests more independent from other CLM tests** **Status:** review **Milestone:** 5.17.08 **Created:** Tue Jul 04, 2017 01:50 PM UTC by Zoran Milinkovic **Last Updated:** Wed Jul 05, 2017 02:53 PM UTC **Owner:** Zoran Milinkovic Today's CLM tests depend on other tests, request them to revert CLM state to the starting state. If CLM tests are done manually, then they must be done in an order. Also, some tests cannot be called twice. For example test 7 21: ~~~ $ clmtest 7 21 Suite 7: Test case for saClmClusterTrack. ** For all tests to pass, Run a payload with node_name PL-3 ** waiting on poll Inside TrackCallback4 invocation : 0 Step : 4 error = 1 numberOfMembers = 2 No of items = 1 Value of i = 0 Cluster Change = 3 Node Name length = 36, value = safNode=PL-3,safCluster=myClmCluster Node Member = 0 Node view number = 5 Node eename length = 0,value = Node boottimestamp = 1499175648545988548 Node nodeAddress family = 1,node address length = 0, node address value = Node nodeid = 131855 21 PASSED saClmClusterTrack_4 with SA_TRACK_CHANGES_ONLY track flags - admin lock = Test Result: Total: 1 Passed: 1 Failed: 0 ~~~ And if we repeat the test, it fails: ~~~ $ clmtest 7 21 Suite 7: Test case for saClmClusterTrack. ** For all tests to pass, Run a payload with node_name PL-3 ** error - saImmOmAdminOperationInvoke_2 admin-op RETURNED: SA_AIS_ERR_NO_OP (28) clmtest: src/clm/apitest/tet_saClmClusterTrack.c:604: saClmClusterTrack_21: Assertion `ret == 1' failed. waiting on pollAborted ~~~ Or if we call another test after executing test 7 21: ~~~ $ clmtest 7 23 Suite 7: Test case for saClmClusterTrack. ** For all tests to pass, Run a payload with node_name PL-3 ** error - saImmOmAdminOperationInvoke_2 admin-op RETURNED: SA_AIS_ERR_NO_OP (28) clmtest: src/clm/apitest/tet_saClmClusterTrack.c:672: saClmClusterTrack_23: Assertion `ret == 1' failed. Aborted ~~~ --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org! http://sdm.link/slashdot___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #2520 clm: make CLM tests more independent from other CLM tests
- **status**: accepted --> review --- ** [tickets:#2520] clm: make CLM tests more independent from other CLM tests** **Status:** review **Milestone:** 5.17.08 **Created:** Tue Jul 04, 2017 01:50 PM UTC by Zoran Milinkovic **Last Updated:** Tue Jul 04, 2017 01:50 PM UTC **Owner:** Zoran Milinkovic Today's CLM tests depend on other tests, request them to revert CLM state to the starting state. If CLM tests are done manually, then they must be done in an order. Also, some tests cannot be called twice. For example test 7 21: ~~~ $ clmtest 7 21 Suite 7: Test case for saClmClusterTrack. ** For all tests to pass, Run a payload with node_name PL-3 ** waiting on poll Inside TrackCallback4 invocation : 0 Step : 4 error = 1 numberOfMembers = 2 No of items = 1 Value of i = 0 Cluster Change = 3 Node Name length = 36, value = safNode=PL-3,safCluster=myClmCluster Node Member = 0 Node view number = 5 Node eename length = 0,value = Node boottimestamp = 1499175648545988548 Node nodeAddress family = 1,node address length = 0, node address value = Node nodeid = 131855 21 PASSED saClmClusterTrack_4 with SA_TRACK_CHANGES_ONLY track flags - admin lock = Test Result: Total: 1 Passed: 1 Failed: 0 ~~~ And if we repeat the test, it fails: ~~~ $ clmtest 7 21 Suite 7: Test case for saClmClusterTrack. ** For all tests to pass, Run a payload with node_name PL-3 ** error - saImmOmAdminOperationInvoke_2 admin-op RETURNED: SA_AIS_ERR_NO_OP (28) clmtest: src/clm/apitest/tet_saClmClusterTrack.c:604: saClmClusterTrack_21: Assertion `ret == 1' failed. waiting on pollAborted ~~~ Or if we call another test after executing test 7 21: ~~~ $ clmtest 7 23 Suite 7: Test case for saClmClusterTrack. ** For all tests to pass, Run a payload with node_name PL-3 ** error - saImmOmAdminOperationInvoke_2 admin-op RETURNED: SA_AIS_ERR_NO_OP (28) clmtest: src/clm/apitest/tet_saClmClusterTrack.c:672: saClmClusterTrack_23: Assertion `ret == 1' failed. Aborted ~~~ --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org! http://sdm.link/slashdot___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #2508 ntfd: coredump due to failure of pthread_mutex_destroy
- **status**: fixed --> review - **assigned_to**: Minh Hon Chau - **Comment**: there's a bug in previous commit that allows to destroy uninitialized mutex, sending out for reveiw the updated patch --- ** [tickets:#2508] ntfd: coredump due to failure of pthread_mutex_destroy** **Status:** review **Milestone:** 5.17.08 **Created:** Wed Jun 21, 2017 06:12 AM UTC by Minh Hon Chau **Last Updated:** Tue Jul 04, 2017 04:03 AM UTC **Owner:** Minh Hon Chau syslog: 2017-05-23 18:05:36 SC-3 systemd[1]: Stopping OpenSAF daemon... 2017-05-23 18:05:36 SC-3 opensafd: Stopping OpenSAF Services 2017-05-23 18:05:36 SC-3 osafamfnd[255]: NO Shutdown initiated 2017-05-23 18:05:36 SC-3 osafamfnd[255]: NO Terminating all AMF components 2017-05-23 18:05:36 SC-3 osafclmna[166]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osafclmd[235]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osafckptd[381]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osafimmd[194]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osaffmd[184]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osafamfwd[335]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osafckptnd[314]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osafsmfnd[273]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osaflogd[215]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osafimmpbed: NO IMM PBE received SIG_TERM, closing db handle 2017-05-23 18:05:36 SC-3 osafimmpbed: IN IMM PBE process EXITING... 2017-05-23 18:05:36 SC-3 osafimmnd[205]: NO Implementer locally disconnected. Marking it as doomed 25 <303, 2030f> (OpenSafImmPBE) 2017-05-23 18:05:36 SC-3 osafrded[175]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osafimmnd[205]: NO Implementer disconnected 25 <303, 2030f> (OpenSafImmPBE) 2017-05-23 18:05:36 SC-3 osafimmnd[205]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osafsmfd[271]: exiting for shutdown 2017-05-23 18:05:36 SC-3 osafntfimcnd[473]: NO saImmOiDispatch() Fail SA_AIS_ERR_BAD_HANDLE (9) 2017-05-23 18:05:36 SC-3 osafamfd[245]: NO Re-initializing with IMM 2017-05-23 18:05:36 SC-3 osafntfd[225]: osaf_abort(16) called from 0x5611b80d2a69 with errno=4 backtrace: ~~~ [New LWP 225] [New LWP 227] [New LWP 228] [New LWP 229] [Thread debugging using libthread_db enabled] Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1". Core was generated by `/usr/local/lib/opensaf/osafntfd'. Program terminated with signal SIGABRT, Aborted. #0 0x7f8fd1780428 in __GI_raise (sig=sig@entry=6) at ../sysdeps/unix/sysv/linux/raise.c:54 [Current thread is 1 (Thread 0x7f8fd32cc740 (LWP 225))] Thread 4 (Thread 0x7f8fd3289b00 (LWP 229)): #0 __clock_nanosleep (clock_id=, flags=1, req=0x7f8fd3289240, rem=0x0) at ../sysdeps/unix/sysv/linux/clock_nanosleep.c:48 oldstate = 0 r = rem = 0x0 req = 0x7f8fd3289240 flags = 1 clock_id = #1 0x7f8fd233157d in osaf_nanosleep (sleep_duration=0x7f8fd23b9c10 <_ZN4baseL10kOneSecondE>) at src/base/osaf_time.c:39 wakeup_time = {tv_sec = 19013, tv_nsec = 484354498} retval = 32655 #2 0x7f8fd23a0336 in base::Sleep (duration=...) at ./src/base/time.h:135 No locals. #3 0x7f8fd23a043e in rda_callback_task (rda_callback_cb=0x5611b8fa61b0) at src/rde/agent/rda_papi.cc:117 msg = "10 2", '\000' rc = PCSRDA_RC_FATAL_IPC_CONNECTION_LOST value = 2 retry_count = 0 conn_lost = true cmd_type = RDE_RDA_HA_ROLE cb_info = {cb_type = PCS_RDA_ROLE_CHG_IND, info = {io_role = PCS_RDA_STANDBY}} #4 0x7f8fd1b1b6ba in start_thread (arg=0x7f8fd3289b00) at pthread_create.c:333 __res = pd = 0x7f8fd3289b00 now = unwind_buf = {cancel_jmp_buf = {{jmp_buf = {140255699704576, 6214217193111308760, 1, 140732091062047, 140255699705280, 0, -6259172466962839080, -6259168954657519144}, mask_was_saved = 0}}, priv = {pad = {0x0, 0x0, 0x0, 0x0}, data = {prev = 0x0, cleanup = 0x0, canceltype = 0}}} not_first_call = pagesize_m1 = sp = freesize = __PRETTY_FUNCTION__ = "start_thread" #5 0x7f8fd185182d in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:109 No locals. Thread 3 (Thread 0x7f8fd32a9b00 (LWP 228)): #0 0x7f8fd1845b5d in poll () at ../sysdeps/unix/syscall-template.S:84 No locals. #1 0x7f8fd2388473 in mdtm_process_recv_events_tcp () at src/mds/mds_dt_trans.c:986 pollres = 1 #2 0x7f8fd1b1b6ba in start_thread (arg=0x7f8fd32a9b00) at pthread_create.c:333 __res = pd = 0x7f8fd32a9b00 now = unwind_buf = {cancel_jmp_buf = {{jmp_buf = {140255699835648, 6214217193111308760, 1, 140732091060655, 140255699836352, 0, -6259172484142708264, -6259168954657519144}, mask_was_saved = 0}}, priv = {pad = {0x0, 0x0, 0x0, 0x0}, data = {prev = 0x0, cleanup = 0x0, canceltype = 0}}} not_first_call = pagesize_m1 = sp = freesize = __PRETTY_FUNCTION__ =
[tickets] [opensaf:tickets] #2521 smf: remove node locking with empty procedures
--- ** [tickets:#2521] smf: remove node locking with empty procedures** **Status:** unassigned **Milestone:** 5.17.10 **Created:** Wed Jul 05, 2017 09:13 AM UTC by Rafael Odzakow **Last Updated:** Wed Jul 05, 2017 09:13 AM UTC **Owner:** Rafael Odzakow --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org! http://sdm.link/slashdot___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #2515 log: logtest 2 51 take long time (over 3 minutes)
- **status**: review --> fixed - **assigned_to**: Canh Truong --> nobody - **Comment**: [Develop] commit 7f82ddccb739eaae22aad5857520f90072bcb06f Author: Canh Van TruongDate: Thu Jun 29 18:42:23 2017 +0700 log: update test case logtest 2 51 [#2515] Test case logtest 2 51 takes more than 3 minutes. The patch updates this test case as following: 1/ Thread reports fail immeditely exit the test case with fail . 2/ Print information if any operation fails in each thread 3/ Wait thread terminate with timeout [Release] commit 60f7747c415e135b2c5b854447ca6396abf463bc Author: Canh Van Truong Date: Thu Jun 29 18:42:23 2017 +0700 log: update test case logtest 2 51 [#2515] Test case logtest 2 51 takes more than 3 minutes. The patch updates this test case as following: 1/ Thread reports fail immeditely exit the test case with fail . 2/ Print information if any operation fails in each thread 3/ Wait thread terminate with timeout --- ** [tickets:#2515] log: logtest 2 51 take long time (over 3 minutes)** **Status:** fixed **Milestone:** 5.17.08 **Created:** Fri Jun 30, 2017 02:56 AM UTC by Canh Truong **Last Updated:** Sat Jul 01, 2017 04:17 PM UTC **Owner:** nobody Test case logtest 2 51 take more than 3 minutes. The test case should be improved as following: 1/ Do not wait for all threads to terminate before checking the result of each thread. If a thread reports fail immeditely exit the test case with fail . 2/ In each thread. Log or print information if any operation fails. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org! http://sdm.link/slashdot___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets