Hi Team,
I’m running an OpenStack cluster deployed using Kolla-Ansible, and I’m
using OpenvSwitch (OVS) as the neutron_plugin_agent. I want to upgrade to
OpenStack with zero-downtime.
During an upgrade of the OVS components (openvswitch-db and
openvswitch-vswitchd), I observe that the flows are lost. These flows do
not get recreated until the neutron-openvswitch-agent service is manually
restarted, which causes a noticeable disruption in network connectivity.
As a workaround, I’ve tried the following sequence:
* docker exec openvswitch_vswitchd ovs-appctl -T 5 -t ovs-vswitchd exit
docker restart openvswitch_db docker start openvswitch_vswitchd*
With this approach, the downtime is reduced to approximately 10 seconds,
and the flows are restored without restarting the Neutron agent.
However, I’m looking for a way to perform the upgrade *with zero downtime*,
or at least without having to restart the neutron-openvswitch-agent service.
During the issue, I noticed the following recurring log messages in
neutron-openvswitch-agent:
* 2025-06-25 09:12:34.977 7 ERROR neutron.agent.common.ovsdb_monitor
[...] Interface monitor is not active ... 2025-06-25 09:12:44.980 7
ERROR neutron.agent.common.ovsdb_monitor [...] Interface monitor is not
active*
These messages appear every few seconds until the agent is restarted. Full
log snippet:
* 2025-06-25 09:12:34.976 INFO ovs_neutron_agent [...] Agent rpc_loop -
iteration:17887 started 2025-06-25 09:12:34.977 ERROR ovsdb_monitor
[...] Interface monitor is not active 2025-06-25 09:12:34.977 INFO
ovs_neutron_agent [...] Agent rpc_loop - iteration:17887 completed.
Processed ports statistics: {'regular': {'added': 0, 'updated': 0,
'removed': 0}}. Elapsed:0.001 ... 2025-06-25 09:12:46.981 INFO
ovs_neutron_agent [...] Agent rpc_loop - iteration:17893 - starting
polling. Elapsed:0.001 2025-06-25 09:12:46.982 INFO ovs_neutron_agent
[...] Agent rpc_loop - iteration:17893 - port information retrieved.
Elapsed:0.002*
Has anyone encountered a similar issue or found a reliable strategy for
upgrading OVS in a containerized Kolla environment without flow loss?
Thanks in advance for your support and suggestions.
Best regards,
Ali
_______________________________________________
discuss mailing list
[email protected]
https://mail.openvswitch.org/mailman/listinfo/ovs-discuss