---
** [tickets:#3035] amfd: csi-remove responses can get lost during controller
switchover**
**Status:** accepted
**Milestone:** 5.19.06
**Created:** Mon May 06, 2019 02:21 PM UTC by Alex Jones
**Last Updated:** Mon May 06, 2019 02:21 PM UTC
**Owner:** Alex Jones
Seeing csi-remove responses get lost in the following setup:
1. N+M redundancy model (5 active, 1 standby)
2. 6 SUs in N+M model
3. Each SU has 5 pi components
4. SU failover set, and FailfastOnTerminationFailure set for all nodes
2. active controller also has active SI for N+M model
3. standby N+M SI is on a payload
4. one component on active controller in N+M SU receives TERM signal
5. this causes all components in that SU to get cleanup scripts called
6. another component in this SU fails to cleanup, and goes to TERM_FAILED state
7. this causes failfast (this is also the active controller)
8. CSI-remove requests have been sent to the payload, and payload has
responded, but the responses don't make it back to the active controller
because it has rebooted
9. New active controller receives them, but it drops them because of wrong
message id
We probably shouldn't do SU failover if reboot has been initiated.
---
Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list.
_______________________________________________
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets