- **status**: review --> fixed
- **Comment**:
commit 43028cfcc1e7b68e5edf82d16b4caa3cf8523c52 (HEAD -> develop,
origin/develop)
Author: Alex Jones <ajo...@rbbn.com>
Date: Fri May 10 12:48:37 2019 -0400
---
** [tickets:#3035] amfd: csi-remove responses can get lost during controller
switchover**
**Status:** fixed
**Milestone:** 5.19.06
**Created:** Mon May 06, 2019 02:21 PM UTC by Alex Jones
**Last Updated:** Fri May 10, 2019 04:50 PM UTC
**Owner:** Alex Jones
Seeing csi-remove responses get lost in the following setup:
1. N+M redundancy model (5 active, 1 standby)
2. 6 SUs in N+M model
3. Each SU has 5 pi components
4. SU failover set, and FailfastOnTerminationFailure set for all nodes
2. active controller also has active SI for N+M model
3. standby N+M SI is on a payload
4. one component on active controller in N+M SU receives TERM signal
5. this causes all components in that SU to get cleanup scripts called
6. another component in this SU fails to cleanup, and goes to TERM_FAILED state
7. this causes failfast (this is also the active controller)
8. CSI-remove requests have been sent to the payload, and payload has
responded, but the responses don't make it back to the active controller
because it has rebooted
9. New active controller receives them, but it drops them because of wrong
message id
We probably shouldn't do SU failover if reboot has been initiated.
---
Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list.
_______________________________________________
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets