[tickets] [opensaf:tickets] #1988 AMF: Admin operation continuation does not work with short cluster init timeout
- **status**: review --> fixed - **assigned_to**: Minh Hon Chau --> nobody - **Comment**: changeset: 8103:aac8fd955b93 tag: tip parent: 8101:9ffc9d219684 user:minh-chau date:Tue Sep 20 15:40:20 2016 +1000 summary: AMFD: Sync all nodes presence state before starting application assignment [#1988] changeset: 8102:1794ee4f2e48 branch: opensaf-5.1.x parent: 8100:9f1767961132 user:minh-chau date:Tue Sep 20 15:36:43 2016 +1000 summary: AMFD: Sync all nodes presence state before starting application assignment [#1988] --- ** [tickets:#1988] AMF: Admin operation continuation does not work with short cluster init timeout** **Status:** fixed **Milestone:** 5.1.RC2 **Created:** Wed Aug 31, 2016 12:04 AM UTC by Minh Hon Chau **Last Updated:** Mon Sep 19, 2016 01:08 PM UTC **Owner:** nobody In scenario of admin continuation after headless, if saAmfClusterStartupTimeout configures short value, then the admin continuation will initiate when saAmfClusterStartupTimeout expires but the SU is still in OUT OF SERVICE. The eventual result is failure of admin operation after headless. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- ___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1988 AMF: Admin operation continuation does not work with short cluster init timeout
Hi Minh, Ack code review only. Thanks, Praveen --- ** [tickets:#1988] AMF: Admin operation continuation does not work with short cluster init timeout** **Status:** review **Milestone:** 5.1.RC2 **Created:** Wed Aug 31, 2016 12:04 AM UTC by Minh Hon Chau **Last Updated:** Wed Sep 14, 2016 02:34 AM UTC **Owner:** Minh Hon Chau In scenario of admin continuation after headless, if saAmfClusterStartupTimeout configures short value, then the admin continuation will initiate when saAmfClusterStartupTimeout expires but the SU is still in OUT OF SERVICE. The eventual result is failure of admin operation after headless. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- ___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1988 AMF: Admin operation continuation does not work with short cluster init timeout
- **status**: assigned --> review --- ** [tickets:#1988] AMF: Admin operation continuation does not work with short cluster init timeout** **Status:** review **Milestone:** 5.1.RC2 **Created:** Wed Aug 31, 2016 12:04 AM UTC by Minh Hon Chau **Last Updated:** Tue Sep 13, 2016 11:02 AM UTC **Owner:** Minh Hon Chau In scenario of admin continuation after headless, if saAmfClusterStartupTimeout configures short value, then the admin continuation will initiate when saAmfClusterStartupTimeout expires but the SU is still in OUT OF SERVICE. The eventual result is failure of admin operation after headless. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- ___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1988 AMF: Admin operation continuation does not work with short cluster init timeout
- **Milestone**: 5.2.FC --> 5.1.RC2 --- ** [tickets:#1988] AMF: Admin operation continuation does not work with short cluster init timeout** **Status:** assigned **Milestone:** 5.1.RC2 **Created:** Wed Aug 31, 2016 12:04 AM UTC by Minh Hon Chau **Last Updated:** Mon Sep 12, 2016 01:26 AM UTC **Owner:** Minh Hon Chau In scenario of admin continuation after headless, if saAmfClusterStartupTimeout configures short value, then the admin continuation will initiate when saAmfClusterStartupTimeout expires but the SU is still in OUT OF SERVICE. The eventual result is failure of admin operation after headless. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- ___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1988 AMF: Admin operation continuation does not work with short cluster init timeout
- **Milestone**: 5.1.RC1 --> 5.2.FC - **Comment**: Temporary change to 5.2 FC for 5.1 RC release process, will back to this after release. --- ** [tickets:#1988] AMF: Admin operation continuation does not work with short cluster init timeout** **Status:** assigned **Milestone:** 5.2.FC **Created:** Wed Aug 31, 2016 12:04 AM UTC by Minh Hon Chau **Last Updated:** Fri Sep 09, 2016 06:23 AM UTC **Owner:** Minh Hon Chau In scenario of admin continuation after headless, if saAmfClusterStartupTimeout configures short value, then the admin continuation will initiate when saAmfClusterStartupTimeout expires but the SU is still in OUT OF SERVICE. The eventual result is failure of admin operation after headless. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- ___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1988 AMF: Admin operation continuation does not work with short cluster init timeout
Attach a patch to fix problem in this ticket Attachments: - [1988.diff](https://sourceforge.net/p/opensaf/tickets/_discuss/thread/254c8488/6b32/attachment/1988.diff) (7.8 kB; text/x-patch) --- ** [tickets:#1988] AMF: Admin operation continuation does not work with short cluster init timeout** **Status:** assigned **Milestone:** 5.1.RC1 **Created:** Wed Aug 31, 2016 12:04 AM UTC by Minh Hon Chau **Last Updated:** Tue Sep 06, 2016 08:28 AM UTC **Owner:** Minh Hon Chau In scenario of admin continuation after headless, if saAmfClusterStartupTimeout configures short value, then the admin continuation will initiate when saAmfClusterStartupTimeout expires but the SU is still in OUT OF SERVICE. The eventual result is failure of admin operation after headless. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- ___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1988 AMF: Admin operation continuation does not work with short cluster init timeout
The attr saAmfClusterStartupTimeout currently is set as 10 sec by default. It's only started if all NCS SUs of active controller get assigned. In big clusters, if this timeout is still set as 10secs, when it times out there are still many nodes hasn't joined cluster, many SU out-of-service. AMFD could not start assignment when cluster init timeout. Aug 19 12:32:05.923649 osafamfd [6705:timer.cc:0066] >> avd_start_tmr: 1 Aug 19 12:32:15.987858 osafamfd [6705:cluster.cc:0055] >> avd_cluster_tmr_init_evh Aug 19 12:32:15.988226 osafamfd [6705:sg_2n_fsm.cc:2808] >> realign: 'safSg=2N,safApp=ABC-01' Aug 19 12:32:15.988254 osafamfd [6705:sg_2n_fsm.cc:0606] TR No in service SUs available in the SG Aug 19 12:32:15.988640 osafamfd [6705:sg_2n_fsm.cc:2808] >> realign: 'safSg=2N,safApp=ABC-02' Aug 19 12:32:15.988661 osafamfd [6705:sg_2n_fsm.cc:0606] TR No in service SUs available in the SG However, this does not cause any problem in cluster start-up scenario because AMFD will also start assignment up on receiving avd_su_oper_state_evh() by calling su_insvc(). This happen after a node completes joining cluster. The one joins cluster earlier, the better chance that its SU been assigned active. Also, if all NCS SUs of active controller have not been assigned, the cb state is not INIT_DONE, AMFD will reject node_up msg of all other nodes. In admin operation continuation after headless, AMFD can't do a similiar sequence as above, because the way SU has fresh assignment (su_insvc) is different from SU continues its pending assignment (susi_success). AMFD needs to have all nodes joined cluster before performing a continuation of admin operation. --- ** [tickets:#1988] AMF: Admin operation continuation does not work with short cluster init timeout** **Status:** assigned **Milestone:** 5.1.RC1 **Created:** Wed Aug 31, 2016 12:04 AM UTC by Minh Hon Chau **Last Updated:** Wed Aug 31, 2016 12:04 AM UTC **Owner:** Minh Hon Chau In scenario of admin continuation after headless, if saAmfClusterStartupTimeout configures short value, then the admin continuation will initiate when saAmfClusterStartupTimeout expires but the SU is still in OUT OF SERVICE. The eventual result is failure of admin operation after headless. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- ___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1988 AMF: Admin operation continuation does not work with short cluster init timeout
--- ** [tickets:#1988] AMF: Admin operation continuation does not work with short cluster init timeout** **Status:** assigned **Milestone:** 5.1.RC1 **Created:** Wed Aug 31, 2016 12:04 AM UTC by Minh Hon Chau **Last Updated:** Wed Aug 31, 2016 12:04 AM UTC **Owner:** Minh Hon Chau In scenario of admin continuation after headless, if saAmfClusterStartupTimeout configures short value, then the admin continuation will initiate when saAmfClusterStartupTimeout expires but the SU is still in OUT OF SERVICE. The eventual result is failure of admin operation after headless. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- ___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets