---

** [tickets:#3035] amfd: csi-remove responses can get lost during controller 
switchover**

**Status:** accepted
**Milestone:** 5.19.06
**Created:** Mon May 06, 2019 02:21 PM UTC by Alex Jones
**Last Updated:** Mon May 06, 2019 02:21 PM UTC
**Owner:** Alex Jones


Seeing csi-remove responses get lost in the following setup:

1. N+M redundancy model (5 active, 1 standby)
2. 6 SUs in N+M model
3. Each SU has 5 pi components
4. SU failover set, and FailfastOnTerminationFailure set for all nodes
2. active controller also has active SI for N+M model
3. standby N+M SI is on a payload
4. one component on active controller in N+M SU receives TERM signal
5. this causes all components in that SU to get cleanup scripts called
6. another component in this SU fails to cleanup, and goes to TERM_FAILED state
7. this causes failfast (this is also the active controller)
8. CSI-remove requests have been sent to the payload, and payload has 
responded, but the responses don't make it back to the active controller 
because it has rebooted
9. New active controller receives them, but it drops them because of wrong 
message id

We probably shouldn't do SU failover if reboot has been initiated.


---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.
_______________________________________________
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets

Reply via email to