[tickets] [opensaf:tickets] #315 SU restart not according to spec
For default branch 315.patch rebased on parent: 6844:ff30714b16bb tip. Attachments: - [315.patch](https://sourceforge.net/p/opensaf/tickets/_discuss/thread/f316c8a6/ddd6/attachment/315.patch) (93.8 kB; application/octet-stream) --- ** [tickets:#315] SU restart not according to spec** **Status:** review **Milestone:** 4.5.2 **Created:** Fri May 24, 2013 08:35 AM UTC by Nagendra Kumar **Last Updated:** Fri Sep 18, 2015 06:19 AM UTC **Owner:** Praveen Migrated from http://devel.opensaf.org/ticket/3061 First issue: = When testing http://devel.opensaf.org/ticket/3056 I found the problem that SU restart does not follow the instantiation level as supposed to: spec 3.8.2 "Within a service unit, the Availability Management Framework terminates the pre-instantiable components according to the configured instantiation level. All pre-instantiable components with the same instantiation level are terminated by the Avail- ability Management Framework in parallel. Pre-instantiable components of a given level are only terminated by the Availability Management Framework when all pre-instantiable components with a higher instantiation level have been terminated. As has been said previously, the instantiation level is only applicable during service unit instantiation and termination. As restarting a service unit means terminating the service unit and instantiating it again, the instantiation level also applies when restart-ing a service unit." It is obvious from the code in avnd_su_pres_inst_surestart_hdler(): /* •If pi su, pick the first pi comp & trigger it's FSM with RestartEv?. */ if (m_AVND_SU_IS_PREINSTANTIABLE(su)) { TRACE("PI SU:'%s'",su->name.value); for (curr_comp = m_AVND_COMP_FROM_SU_DLL_NODE_GET(m_NCS_DBLIST_FIND_FIRST(&su->comp_list)); should pick the last component since this list is sorted by the instantiation level. Second issue: 3.11.1.2: "Restarting a service unit is achieved by the following actions: • First, all components in the service unit are terminated in the order dictated by their instantiation-levels. • In a second step, all components in the service unit are instantiated in the order dictated by their instantiation-levels." That is not the case today since each component is restarted (not terminated and instantiated) --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- ___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #315 SU restart not according to spec
Attached are the patches for 4.5 and 4.6 branches. 1)4.5 : 315_4.5.patch (parent: 6841:0198c81ad4ad). 2)4.6: 315_4.6.patch (generated on parent: 6842:e9b051d8a81e) Attachments: - [315_4.5.patch](https://sourceforge.net/p/opensaf/tickets/_discuss/thread/f316c8a6/ace0/attachment/315_4.5.patch) (93.8 kB; application/octet-stream) - [315_4.6.patch](https://sourceforge.net/p/opensaf/tickets/_discuss/thread/f316c8a6/ace0/attachment/315_4.6.patch) (93.8 kB; application/octet-stream) --- ** [tickets:#315] SU restart not according to spec** **Status:** review **Milestone:** 4.5.2 **Created:** Fri May 24, 2013 08:35 AM UTC by Nagendra Kumar **Last Updated:** Fri Sep 11, 2015 04:35 AM UTC **Owner:** Praveen Migrated from http://devel.opensaf.org/ticket/3061 First issue: = When testing http://devel.opensaf.org/ticket/3056 I found the problem that SU restart does not follow the instantiation level as supposed to: spec 3.8.2 "Within a service unit, the Availability Management Framework terminates the pre-instantiable components according to the configured instantiation level. All pre-instantiable components with the same instantiation level are terminated by the Avail- ability Management Framework in parallel. Pre-instantiable components of a given level are only terminated by the Availability Management Framework when all pre-instantiable components with a higher instantiation level have been terminated. As has been said previously, the instantiation level is only applicable during service unit instantiation and termination. As restarting a service unit means terminating the service unit and instantiating it again, the instantiation level also applies when restart-ing a service unit." It is obvious from the code in avnd_su_pres_inst_surestart_hdler(): /* •If pi su, pick the first pi comp & trigger it's FSM with RestartEv?. */ if (m_AVND_SU_IS_PREINSTANTIABLE(su)) { TRACE("PI SU:'%s'",su->name.value); for (curr_comp = m_AVND_COMP_FROM_SU_DLL_NODE_GET(m_NCS_DBLIST_FIND_FIRST(&su->comp_list)); should pick the last component since this list is sorted by the instantiation level. Second issue: 3.11.1.2: "Restarting a service unit is achieved by the following actions: • First, all components in the service unit are terminated in the order dictated by their instantiation-levels. • In a second step, all components in the service unit are instantiated in the order dictated by their instantiation-levels." That is not the case today since each component is restarted (not terminated and instantiated) --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- ___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #464 Verify duplicate TIPC node_id before configuring TIPC
- **status**: review --> fixed - **Comment**: changeset: 6844:ff30714b16bb tag: tip user:A V Mahesh date:Fri Sep 18 11:00:14 2015 +0530 summary: opensaf: prevented cluster re-start on misconfigured slot_id [#464/#414] --- ** [tickets:#464] Verify duplicate TIPC node_id before configuring TIPC** **Status:** fixed **Milestone:** 4.7.FC **Created:** Tue Jun 18, 2013 10:06 AM UTC by Mathi Naickan **Last Updated:** Thu Sep 03, 2015 08:56 AM UTC **Owner:** A V Mahesh (AVM) Currently, there is no direct way to detect if there is already another node running with the same TIPC node_id. If we could query say the TIPC routing table before doing stuff that within nid_tipc, then we could fail the opensaf startup when duplicate slot_id is configured. This can fix known issues like http://devel.opensaf.org/ticket/2802 --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- ___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #427 amf: Dependent sis are still in assigned state when tolerence timer expires during controller failover.
- **status**: review --> fixed - **Comment**: changeset: 6841:0198c81ad4ad branch: opensaf-4.5.x parent: 6838:9ff3ee6e28df user:praveen.malv...@oracle.com date:Fri Sep 18 10:13:36 2015 +0530 summary: amfd: act on dep SIs if tol timer expires during controller fail-over [#427] changeset: 6842:e9b051d8a81e branch: opensaf-4.6.x parent: 6839:964e043fa545 user:praveen.malv...@oracle.com date:Fri Sep 18 10:13:52 2015 +0530 summary: amfd: act on dep SIs if tol timer expires during controller fail-over [#427] changeset: 6843:6a62544eb8ac tag: tip parent: 6840:a2c9e8a31cb4 user:praveen.malv...@oracle.com date:Fri Sep 18 10:14:03 2015 +0530 summary: amfd: act on dep SIs if tol timer expires during controller fail-over [#427] --- ** [tickets:#427] amf: Dependent sis are still in assigned state when tolerence timer expires during controller failover.** **Status:** fixed **Milestone:** 4.5.2 **Labels:** SI Deps **Created:** Fri May 31, 2013 06:32 AM UTC by Praveen **Last Updated:** Tue May 19, 2015 10:08 AM UTC **Owner:** Praveen Migrated from http://devel.opensaf.org/ticket/2834. Changeset : 3784 4.2.2 RC1 Found on SLES 32 Bit Setup 2 Controllers and 2 Payloads Model - 2n SUs are hosted on payloads. Scenario: Created Dependency like SI1 as sponsor and SI2 as dependent with tolerance time as 60 sec. Lock SI1 SI1 has been unassigned. Waited for 56 sec Did "pkill osafamfd" on Active Controller and Active controller went for reboot Controller node joined successfully after reboot. SI2 unassignment never happened. SI assignment status is showing as: safSi=SI1,safApp=App1 saAmfSIAdminState=LOCKED(2) saAmfSIAssignmentState=UNASSIGNED(1) safSi=SI2,safApp=App1 saAmfSIAdminState=UNLOCKED(1) saAmfSIAssignmentState=FULLY_ASSIGNED(2) safSi=SI3,safApp=App1 saAmfSIAdminState=UNLOCKED(1) saAmfSIAssignmentState=FULLY_ASSIGNED(2) safSi=SI4,safApp=App1 saAmfSIAdminState=UNLOCKED(1) saAmfSIAssignmentState=FULLY_ASSIGNED(2) Expected behavior is when tolerance timer is expires during controller failover, tolerance timer should restart after failover is done. Attached /var/log/messages of all nodes --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- ___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1488 LCK: if master GLND reboots deadlock can occur for currently held locks
- **status**: assigned --> review --- ** [tickets:#1488] LCK: if master GLND reboots deadlock can occur for currently held locks** **Status:** review **Milestone:** 4.5.2 **Created:** Wed Sep 16, 2015 02:54 PM UTC by Alex Jones **Last Updated:** Wed Sep 16, 2015 02:54 PM UTC **Owner:** Alex Jones If the master GLND is rebooted while an exclusive lock (or locks) is held, when the new master is elected and the other GLNDs send over the current lock information held by them to the new master, they do not send all information needed by the new master to lock/unlock currently held locks. When this happens the lock(s) can never be unlocked or granted. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Monitor Your Dynamic Infrastructure at Any Scale With Datadog! Get real-time metrics from all of your servers, apps and tools in one place. SourceForge users - Click here to start your Free Trial of Datadog now! http://pubads.g.doubleclick.net/gampad/clk?id=241902991&iu=/4140___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1473 AMF: Component Termination Failed alarm is raised then cleared immediately
- **status**: accepted --> review --- ** [tickets:#1473] AMF: Component Termination Failed alarm is raised then cleared immediately** **Status:** review **Milestone:** 4.5.2 **Created:** Thu Sep 10, 2015 09:52 AM UTC by Quyen Dao **Last Updated:** Thu Sep 17, 2015 11:43 AM UTC **Owner:** Praveen **Attachments:** - [osafamfd](https://sourceforge.net/p/opensaf/tickets/1473/attachment/osafamfd) (161.3 kB; application/octet-stream) - [AppConfig-2N.xml](https://sourceforge.net/p/opensaf/tickets/1473/attachment/AppConfig-2N.xml) (9.3 kB; text/xml) Steps to reproduce: * Load the attached model * Change the clc-cli script of the component to always return 1 for cleanup * Kill the component Result: Component Termination Failed alarm is raised then cleared immediately and presence state of the failed component is UNINSTANTIATED instead TERMINATION-FAILED. Log root@PL-3:~# ntfread === Sep 10 15:51:42 - Alarm === eventType = SA_NTF_ALARM_PROCESSING notificationObject = "safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1" notifyingObject = "safApp=safAmfService" notificationClassId = SA_NTF_VENDOR_ID_SAF.SA_SVC_AMF.3 (0x3) additionalText = "Cleanup of Component safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1 failed" - additionalInfo: 0 - infoId = 1 infoType = 10 infoValue = "safAmfNode=PL-3,safAmfCluster=myAmfCluster" probableCause = SA_NTF_SOFTWARE_ERROR perceivedSeverity = SA_NTF_SEVERITY_MAJOR === Sep 10 15:51:42 - Alarm === eventType = SA_NTF_ALARM_PROCESSING notificationObject = "safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1" notifyingObject = "safApp=safAmfService" notificationClassId = SA_NTF_VENDOR_ID_SAF.SA_SVC_AMF.3 (0x3) additionalText = "Previous raised alarm of safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1 is now cleared" probableCause = SA_NTF_SOFTWARE_ERROR perceivedSeverity = SA_NTF_SEVERITY_CLEARED root@PL-3:~# amf-state su all safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1 safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1 saAmfSUAdminState=UNLOCKED(1) saAmfSUOperState=DISABLED(2) saAmfSUPresenceState=TERMINATION-FAILED(7) saAmfSUReadinessState=OUT-OF-SERVICE(1) root@PL-3:~# root@PL-3:~# amf-state comp all safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1 safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1 saAmfCompOperState=DISABLED(2) saAmfCompPresenceState=UNINSTANTIATED(1) saAmfCompReadinessState=OUT-OF-SERVICE(1) root@PL-3:~# --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Monitor Your Dynamic Infrastructure at Any Scale With Datadog! Get real-time metrics from all of your servers, apps and tools in one place. SourceForge users - Click here to start your Free Trial of Datadog now! http://pubads.g.doubleclick.net/gampad/clk?id=241902991&iu=/4140___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #78 amf: support for si-swap admin op for Nway model.
Attached is a sample configuration for Nway model. Attachments: - [AppConfig-N-Way_2Comp_2SI_2CSI.xml](https://sourceforge.net/p/opensaf/tickets/_discuss/thread/fc168502/dd29/attachment/AppConfig-N-Way_2Comp_2SI_2CSI.xml) (16.1 kB; text/xml) --- ** [tickets:#78] amf: support for si-swap admin op for Nway model.** **Status:** review **Milestone:** 4.7.FC **Created:** Mon May 13, 2013 04:31 AM UTC by Nagendra Kumar **Last Updated:** Wed Sep 16, 2015 10:54 AM UTC **Owner:** Praveen Migrated from http://devel.opensaf.org/ticket/2073 performing "si-swap" on an si of models NplusM, Nway,NwayActive? is returning "ERR_NOT SUPPORTED" Also there is no mentioning about what the return code should be for "si-swap" on NwayActive? and NoRed? Models. console output: amf-adm si-swap safSi=NWayActiveXAppSiorder_SI1,safApp=NWayActiveXAppSiorder error - saImmOmAdminOperationInvoke_2 admin-op RETURNED: SA_AIS_ERR_NOT_SUPPORTED (19) **Implementation details:** In this implemntaion for NWAY model, si-swap admin op will be supported with following restrictions: 1)Operation will be rejected if Equal distribution feature is enabled. 2)operation will be rejected if SIRankedSU is configured and AutoAdjust flag is enabled. However it will be allowed if AutoAdjust is disabled. **Reason for rejecting the operation in above two cases:** In both of these cases AutoAdjust flag is eanbled and is used to partially adjust the SG. At the same time, autoadjust feature is used without using autoadjustprob timer. In the equal distribution cases both active assignments and standby assignments are equally distributed among the available SUs without waiting for any autoadjustprob timer expiry. Thus if si-swap is supported in this case, SG will be adjusted to same configuration. Same explanation is valid for other case also. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Monitor Your Dynamic Infrastructure at Any Scale With Datadog! Get real-time metrics from all of your servers, apps and tools in one place. SourceForge users - Click here to start your Free Trial of Datadog now! http://pubads.g.doubleclick.net/gampad/clk?id=241902991&iu=/4140___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #520 Mds: Tune MDS logging to minimal & informative
- **status**: assigned --> review --- ** [tickets:#520] Mds: Tune MDS logging to minimal & informative ** **Status:** review **Milestone:** 4.7.FC **Created:** Thu Jul 25, 2013 01:19 PM UTC by hano **Last Updated:** Tue Aug 25, 2015 04:08 PM UTC **Owner:** A V Mahesh (AVM) Minimize the MDS logging to only in case of required so that it can not reach 1 Mb of log rotation range/size sooner . amfnd core dump is produced when amfnd main thread (10720) is waiting for a pthread mutex, gl_mds_library_mutex, which is held by the mds thread (10723). The amf watchdog detects this (no healthchecks received) and sends an abort signal to the amfnd. Holding a mutex during file operations in MDS is not correct and should be corrected. (HR50165) #0 0x7f7830d70294 in __lll_lock_wait () from /lib64/libpthread.so.0 (gdb) p gl_mds_library_mutex $1 = {__data = {__lock = 2, __count = 1, __owner = 10723, __nusers = 1, __kind = 1, __spins = 0, __list = {__prev = 0x0, __next = 0x0}}, __size = " ã ) ", ' ' , __align = 4294967298} (gdb) info thr Id Target Id Frame 4Thread 0x7f7832263b00 (LWP 10723) 0x7f783083e20d in write () from /lib64/libc.so.6 3Thread 0x7f7832283b00 (LWP 10722) 0x7f7830844f53 in select () from /lib64/libc.so.6 2Thread 0x7f7832243b00 (LWP 10724) 0x7f7830d7076d in read () from /lib64/libpthread.so.0 * 1Thread 0x7f7832286700 (LWP 10720) 0x7f7830d70294 in __lll_lock_wait () from /lib64/libpthread.so.0 --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Monitor Your Dynamic Infrastructure at Any Scale With Datadog! Get real-time metrics from all of your servers, apps and tools in one place. SourceForge users - Click here to start your Free Trial of Datadog now! http://pubads.g.doubleclick.net/gampad/clk?id=241902991&iu=/4140___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1473 AMF: Component Termination Failed alarm is raised then cleared immediately
- **status**: unassigned --> accepted - **assigned_to**: Praveen - Attachments has changed: Diff: --- old +++ new @@ -1 +1,2 @@ +osafamfd (161.3 kB; application/octet-stream) AppConfig-2N.xml (9.3 kB; text/xml) - **Part**: - --> d - **Comment**: I reproduced the issue by using the attached configuration. Only change I did is hosting the Sus on controllers. There is a case when amfd is clearing the alram raised on comp in term_failed state. When all components are cleaned up, amfnd sends a su-failvoer request to amfd. In this case alram is being cleared and component is marked uninstantiated, even in the case when saAmfNodeFailfastOnTerminationFailure is false on the node hosting the term_failed comp. Amfd should not mark comp uninstantiated and hence should not clear alarm when saAmfNodeFailfastOnTerminationFailure is false and admin repair is pending. Attached is the amfd trace. --- ** [tickets:#1473] AMF: Component Termination Failed alarm is raised then cleared immediately** **Status:** accepted **Milestone:** 4.5.2 **Created:** Thu Sep 10, 2015 09:52 AM UTC by Quyen Dao **Last Updated:** Thu Sep 10, 2015 09:52 AM UTC **Owner:** Praveen **Attachments:** - [osafamfd](https://sourceforge.net/p/opensaf/tickets/1473/attachment/osafamfd) (161.3 kB; application/octet-stream) - [AppConfig-2N.xml](https://sourceforge.net/p/opensaf/tickets/1473/attachment/AppConfig-2N.xml) (9.3 kB; text/xml) Steps to reproduce: * Load the attached model * Change the clc-cli script of the component to always return 1 for cleanup * Kill the component Result: Component Termination Failed alarm is raised then cleared immediately and presence state of the failed component is UNINSTANTIATED instead TERMINATION-FAILED. Log root@PL-3:~# ntfread === Sep 10 15:51:42 - Alarm === eventType = SA_NTF_ALARM_PROCESSING notificationObject = "safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1" notifyingObject = "safApp=safAmfService" notificationClassId = SA_NTF_VENDOR_ID_SAF.SA_SVC_AMF.3 (0x3) additionalText = "Cleanup of Component safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1 failed" - additionalInfo: 0 - infoId = 1 infoType = 10 infoValue = "safAmfNode=PL-3,safAmfCluster=myAmfCluster" probableCause = SA_NTF_SOFTWARE_ERROR perceivedSeverity = SA_NTF_SEVERITY_MAJOR === Sep 10 15:51:42 - Alarm === eventType = SA_NTF_ALARM_PROCESSING notificationObject = "safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1" notifyingObject = "safApp=safAmfService" notificationClassId = SA_NTF_VENDOR_ID_SAF.SA_SVC_AMF.3 (0x3) additionalText = "Previous raised alarm of safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1 is now cleared" probableCause = SA_NTF_SOFTWARE_ERROR perceivedSeverity = SA_NTF_SEVERITY_CLEARED root@PL-3:~# amf-state su all safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1 safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1 saAmfSUAdminState=UNLOCKED(1) saAmfSUOperState=DISABLED(2) saAmfSUPresenceState=TERMINATION-FAILED(7) saAmfSUReadinessState=OUT-OF-SERVICE(1) root@PL-3:~# root@PL-3:~# amf-state comp all safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1 safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1 saAmfCompOperState=DISABLED(2) saAmfCompPresenceState=UNINSTANTIATED(1) saAmfCompReadinessState=OUT-OF-SERVICE(1) root@PL-3:~# --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Monitor Your Dynamic Infrastructure at Any Scale With Datadog! Get real-time metrics from all of your servers, apps and tools in one place. SourceForge users - Click here to start your Free Trial of Datadog now! http://pubads.g.doubleclick.net/gampad/clk?id=241902991&iu=/4140___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1490 log: cannot delete value of logStreamFileFormat
- **status**: accepted --> review --- ** [tickets:#1490] log: cannot delete value of logStreamFileFormat** **Status:** review **Milestone:** 4.7.FC **Created:** Thu Sep 17, 2015 09:07 AM UTC by Vu Minh Nguyen **Last Updated:** Thu Sep 17, 2015 09:07 AM UTC **Owner:** Vu Minh Nguyen At start up, `logStreamFileFormat` attribute in configuration obj has value. Means that, built-in file format will be used as default value for app stream. If user sets an value to `logStreamFileFormat`, users can not delete the value of this attribute. This is not correct. If user want to use built-in file format, logsv has to allow user to delete the value of `logStreamFileFormat`. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Monitor Your Dynamic Infrastructure at Any Scale With Datadog! Get real-time metrics from all of your servers, apps and tools in one place. SourceForge users - Click here to start your Free Trial of Datadog now! http://pubads.g.doubleclick.net/gampad/clk?id=241902991&iu=/4140___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1490 log: cannot delete value of logStreamFileFormat
- **status**: unassigned --> accepted - **assigned_to**: Vu Minh Nguyen --- ** [tickets:#1490] log: cannot delete value of logStreamFileFormat** **Status:** accepted **Milestone:** 4.7.FC **Created:** Thu Sep 17, 2015 09:07 AM UTC by Vu Minh Nguyen **Last Updated:** Thu Sep 17, 2015 09:07 AM UTC **Owner:** Vu Minh Nguyen At start up, `logStreamFileFormat` attribute in configuration obj has value. Means that, built-in file format will be used as default value for app stream. If user sets an value to `logStreamFileFormat`, users can not delete the value of this attribute. This is not correct. If user want to use built-in file format, logsv has to allow user to delete the value of `logStreamFileFormat`. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Monitor Your Dynamic Infrastructure at Any Scale With Datadog! Get real-time metrics from all of your servers, apps and tools in one place. SourceForge users - Click here to start your Free Trial of Datadog now! http://pubads.g.doubleclick.net/gampad/clk?id=241902991&iu=/4140___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1490 log: cannot delete value of logStreamFileFormat
--- ** [tickets:#1490] log: cannot delete value of logStreamFileFormat** **Status:** unassigned **Milestone:** 4.7.FC **Created:** Thu Sep 17, 2015 09:07 AM UTC by Vu Minh Nguyen **Last Updated:** Thu Sep 17, 2015 09:07 AM UTC **Owner:** nobody At start up, `logStreamFileFormat` attribute in configuration obj has value. Means that, built-in file format will be used as default value for app stream. If user sets an value to `logStreamFileFormat`, users can not delete the value of this attribute. This is not correct. If user want to use built-in file format, logsv has to allow user to delete the value of `logStreamFileFormat`. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Monitor Your Dynamic Infrastructure at Any Scale With Datadog! Get real-time metrics from all of your servers, apps and tools in one place. SourceForge users - Click here to start your Free Trial of Datadog now! http://pubads.g.doubleclick.net/gampad/clk?id=241902991&iu=/4140___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1489 amf: setting saAmfSIPrefActiveAssignments to empty in runtime causes amfd to crash.
--- ** [tickets:#1489] amf: setting saAmfSIPrefActiveAssignments to empty in runtime causes amfd to crash.** **Status:** unassigned **Milestone:** future **Created:** Thu Sep 17, 2015 08:50 AM UTC by Rafael **Last Updated:** Thu Sep 17, 2015 08:50 AM UTC **Owner:** nobody To reproduce it run: immcfg -a saAmfSIPrefActiveAssignments="" safSi=All-NWayActive,safApp=DemoApp This will cause a amfd crash, from the system log: Sep 17 10:00:46 SC-2-1 osafamfd[15466]: si.cc:1291: update_ass_state: Assertion 'saAmfSINumCurrActiveAssignments <= saAmfSIPrefActiveAssignments' failed. Sep 17 10:00:47 SC-2-1 osafamfnd[15479]: WA AMF director unexpectedly crashed Sep 17 10:00:47 SC-2-1 osafamfnd[15479]: Rebooting OpenSAF NodeId = 131343 EE Name = , Reason: local AVD down(Adest) or both AVD down(Vdest) received, OwnNodeId = 131343, SupervisionTime = 60 --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Monitor Your Dynamic Infrastructure at Any Scale With Datadog! Get real-time metrics from all of your servers, apps and tools in one place. SourceForge users - Click here to start your Free Trial of Datadog now! http://pubads.g.doubleclick.net/gampad/clk?id=241902991&iu=/4140___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1396 log: issue with priority of messages adding to mailbox in the server
- **Type**: defect --> enhancement - **Milestone**: 4.5.2 --> 5.0 --- ** [tickets:#1396] log: issue with priority of messages adding to mailbox in the server** **Status:** accepted **Milestone:** 5.0 **Created:** Mon Jun 29, 2015 09:14 AM UTC by Vu Minh Nguyen **Last Updated:** Thu Sep 17, 2015 07:41 AM UTC **Owner:** do truong giang If there are several initialize and finalize done in a row in the same process, there is possibility getting SA_AIS_ERR_BAD_HANDLE. [Test sequence] 1. saLogInitialize() 2. saLogFinalize() 3. saLogInitialize() 4. saLogFinalize() --> get error SA_AIS_ERR_BAD_HANDLE Here is the problem: The finalize request will delete the requested client and the agent will wait for the answer (ack). If the last client initialized by this process is finalized the agent will free the resources used by the agent which means that the mds also is finalized which results in an agent down event on the server side. The agent down message to the event handler is placed in the mailbox queue with the lowest priority. In this test sequence the next thing that happens is that the agent is started again and a new initiate request is sent. When this request is received on the server the initialize message to the event handler is placed in the queue with the highest priority. What happen in this case is that the event handler has not yet fetched the agent down message when the initiate message is queued which means that the order of handling the messages will be initialize (highest priority) and then agent down (lowest priority). - The initialize will create a client - The agent down will remove all clients for the agent process including the just initialized - The agent sends a finalize and gets BAD_HANDLE as a reply in the ack from the server since the client is already deleted. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Monitor Your Dynamic Infrastructure at Any Scale With Datadog! Get real-time metrics from all of your servers, apps and tools in one place. SourceForge users - Click here to start your Free Trial of Datadog now! http://pubads.g.doubleclick.net/gampad/clk?id=241902991&iu=/4140___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1396 log: issue with priority of messages adding to mailbox in the server
- **status**: assigned --> accepted --- ** [tickets:#1396] log: issue with priority of messages adding to mailbox in the server** **Status:** accepted **Milestone:** 4.5.2 **Created:** Mon Jun 29, 2015 09:14 AM UTC by Vu Minh Nguyen **Last Updated:** Thu Jul 23, 2015 03:25 AM UTC **Owner:** do truong giang If there are several initialize and finalize done in a row in the same process, there is possibility getting SA_AIS_ERR_BAD_HANDLE. [Test sequence] 1. saLogInitialize() 2. saLogFinalize() 3. saLogInitialize() 4. saLogFinalize() --> get error SA_AIS_ERR_BAD_HANDLE Here is the problem: The finalize request will delete the requested client and the agent will wait for the answer (ack). If the last client initialized by this process is finalized the agent will free the resources used by the agent which means that the mds also is finalized which results in an agent down event on the server side. The agent down message to the event handler is placed in the mailbox queue with the lowest priority. In this test sequence the next thing that happens is that the agent is started again and a new initiate request is sent. When this request is received on the server the initialize message to the event handler is placed in the queue with the highest priority. What happen in this case is that the event handler has not yet fetched the agent down message when the initiate message is queued which means that the order of handling the messages will be initialize (highest priority) and then agent down (lowest priority). - The initialize will create a client - The agent down will remove all clients for the agent process including the just initialized - The agent sends a finalize and gets BAD_HANDLE as a reply in the ack from the server since the client is already deleted. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Monitor Your Dynamic Infrastructure at Any Scale With Datadog! Get real-time metrics from all of your servers, apps and tools in one place. SourceForge users - Click here to start your Free Trial of Datadog now! http://pubads.g.doubleclick.net/gampad/clk?id=241902991&iu=/4140___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets