On 2/4/25 5:19 PM, Felix Huettner wrote:
> On Tue, Feb 04, 2025 at 11:23:08AM +0100, Dumitru Ceara wrote:
>> On 2/4/25 11:04 AM, Felix Huettner wrote:
>>> On Tue, Feb 04, 2025 at 10:35:25AM +0100, Felix Huettner via dev wrote:
>>>> On Mon, Feb 03, 2025 at 02:33:07PM +0100, Dumitru Ceara wrote:
>>>>> On 1/29/25 12:15 PM, Felix Huettner via dev wrote:
>>>>>> We now learn all routes inside the vrfs we also advertise routes on.
>>>>>> The routes are then placed in the southbound database for processing by
>>>>>> northd.
>>>>>>
>>>>>> Routes are only selected if matching the following rules:
>>>>>> 1. must not be a route advertised by us
>>>>>> 2. must not be a local connected route (as we want to not learn transfer
>>>>>> networks)
>>>>>> 3. the prefix must not be a link local address
>>>>>>
>>>>>> However we can not reliably determine over which link we learned the
>>>>>> route in case we have two LRPs of the same LR on the same chassis.
>>>>>> For now we just assume the routes on both links are identical.
>>>>>> Future commits will refine this.
>>>>>>
>>>>>> Signed-off-by: Felix Huettner <[email protected]>
>>>>>> ---
>>>>>
>>>>> Hi Felix,
>>>>>
>>>>> I have a few more (mostly minor) comments on this version.
>>>>
>>>> Hi Dumitru,
>>>>
>>>> thanks for the review.
>>>> The smaller things are addressed in the next version.
>>>>
>>>>>
>>>>>> v3->v4:
>>>>>> - addressed review comments.
>>>>>> v2->v3:
>>>>>> * Set monitor conditions on sb Learned_Route table.
>>>>>> * Do not learn routes if Learned_Route table does not exist (upgrades).
>>>>>>
>>>>>> controller/ovn-controller.c | 64 +++++++++++--
>>>>>> controller/route-exchange-netlink.c | 38 +++++++-
>>>>>> controller/route-exchange-netlink.h | 15 ++-
>>>>>> controller/route-exchange.c | 138 +++++++++++++++++++++++++++-
>>>>>> controller/route-exchange.h | 3 +
>>>>>> lib/ovn-util.c | 10 ++
>>>>>> lib/ovn-util.h | 1 +
>>>>>> tests/system-ovn.at | 21 +++++
>>>>>> 8 files changed, 277 insertions(+), 13 deletions(-)
>>>>>>
>>>>>> diff --git a/controller/ovn-controller.c b/controller/ovn-controller.c
>>>>>> index 1eb8d39d1..5b31f6fd2 100644
>>>>>> --- a/controller/ovn-controller.c
>>>>>> +++ b/controller/ovn-controller.c
>>>>>> @@ -233,7 +233,7 @@ update_sb_monitors(struct ovsdb_idl *ovnsb_idl,
>>>>>> *
>>>>>> * Monitor Template_Var for local chassis.
>>>>>> *
>>>>>> - * Monitor Advertised_Route for local datapaths.
>>>>>> + * Monitor Advertised/Learned_Route for local datapaths.
>>>>>> *
>>>>>> * We always monitor patch ports because they allow us to see the
>>>>>> linkages
>>>>>> * between related logical datapaths. That way, when we know that
>>>>>> we have
>>>>>> @@ -252,6 +252,7 @@ update_sb_monitors(struct ovsdb_idl *ovnsb_idl,
>>>>>> struct ovsdb_idl_condition chprv = OVSDB_IDL_CONDITION_INIT(&chprv);
>>>>>> struct ovsdb_idl_condition tv = OVSDB_IDL_CONDITION_INIT(&tv);
>>>>>> struct ovsdb_idl_condition ar = OVSDB_IDL_CONDITION_INIT(&ar);
>>>>>> + struct ovsdb_idl_condition lr = OVSDB_IDL_CONDITION_INIT(&lr);
>>>>>>
>>>>>> /* Always monitor all logical datapath groups. Otherwise, DPG
>>>>>> updates may
>>>>>> * be received *after* the lflows using it are seen by
>>>>>> ovn-controller.
>>>>>> @@ -277,6 +278,7 @@ update_sb_monitors(struct ovsdb_idl *ovnsb_idl,
>>>>>> ovsdb_idl_condition_add_clause_true(&chprv);
>>>>>> ovsdb_idl_condition_add_clause_true(&tv);
>>>>>> ovsdb_idl_condition_add_clause_true(&ar);
>>>>>> + ovsdb_idl_condition_add_clause_true(&lr);
>>>>>
>>>>> Same comment as for advertised routes in the previous patch. We might
>>>>> be able to move this under if (!chassis) {...}.
>>>
>>> Hi Dumitru,
>>>
>>> actually we can not do this (but i just noticed that).
>>>
>>> Assume we get a new LRP on a chassis of a running ovn-controller. That
>>> LRP belongs to a LR that has route-exchange set. The VRF belonging to
>>> that LR has already been created on the chassis and contains routes that
>>> ovn-controller should learn.
>>>
>>> In this case ovn-controller will in one iteration (at least if i
>>> understood it correctly):
>>> 1. claim the port
>>> 2. add the LR to local_datapaths
>>> 3. try to learn routes from the VRF
>>> 4. update monitoring conditions
>>>
>>> If we do only monitor learned routes for all local_datapaths then at the
>>> point where we learn the routes we did not yet call update_sb_monitors.
>>> So we would try to add a entry to Learned_Route that already exists
>>> there.
>>>
>>> In my understanding the options are:
>>> 1. monitor all Learned_Route entries
>>> 2. only try to learn routes after the monitoring condition has been
>>> updated.
>>>
>>> If you would prefer option 2, i would need some hint how to know if we
>>> have a monitoring condition set.
>>
>> We could decide to switch from monitoring all Learned_Routes to
>> monitoring a subset based on daemon_started_recently(). We do something
>> similar and delay deleting patch ports as long as ovn-controller has
>> "recently started" in the hope that we won't have to re-add them soon.
>>
>> Could that work?
>
> Hi Dumitru,
>
> i think that would only solve the issue on the startup of
> ovn-controller.
>
> But even later during runtime the same thing could happen if we get a
> new local_datapath and it already has its vrf filled with learnable
> routes.
> In this case i think we would update the monitoring condition after we
> would try to insert the routes to the southbound.
>
> So if we would want to use something similar to daemon_started_recently
> then i guess we would need that for each local datapath. Where we only
> start learning routes for this datapath once the monitoring condition
> has had sufficiently long time to update.
>
> But that still seems to be less safe than just monitoring everything.
>
I agree, it sounds complicated. Let's monitor everything for now but
let's add a TODO.rst item and an "xxx: " comment for this. I'm worried
that if the SB table has a lot of records we waste bandwidth/memory/cpu
for (mostly) nothing.
Thanks,
Dumitru
> What do you think?
>
> Thanks a lot,
> Felix
>
>>
>> Regards,
>> Dumitru
>>
>>>
>>> Thanks a lot,
>>> Felix
>>>
>>>>>
>>>>>> goto out;
>>>>>> }
>>>>>>
>>>>>> @@ -365,7 +367,6 @@ update_sb_monitors(struct ovsdb_idl *ovnsb_idl,
>>>>>> sbrec_dns_add_clause_datapaths(&dns, OVSDB_F_INCLUDES,
>>>>>> &uuid, 1);
>>>>>> sbrec_ip_multicast_add_clause_datapath(&ip_mcast,
>>>>>> OVSDB_F_EQ,
>>>>>> uuid);
>>>>>> - sbrec_advertised_route_add_clause_datapath(&ar, OVSDB_F_EQ,
>>>>>> uuid);
>>>>>> }
>>>>>>
>>>>>> /* Datapath groups are immutable, which means a new group
>>>>>> record is
>>>>>> @@ -379,6 +380,14 @@ update_sb_monitors(struct ovsdb_idl *ovnsb_idl,
>>>>>> sbrec_logical_flow_add_clause_logical_dp_group(&lf, OVSDB_F_NE,
>>>>>> NULL);
>>>>>> }
>>>>>>
>>>>>> + /* When the ports are getting bound to the chassis e.g incase of
>>>>>> + * restart, at that moment we don't have the local datapaths, to
>>>>>> avoid
>>>>>> + * removing the existing advertised routes from the vrf or removing
>>>>>> + * learned routes to the SB, we set condition to monitor all.
>>>>>> + */
>>>>>> + ovsdb_idl_condition_add_clause_true(&ar);
>>>>>> + ovsdb_idl_condition_add_clause_true(&lr);
>>>>>> +
>>>>>> out:;
>>>>>> unsigned int cond_seqnos[] = {
>>>>>> sb_table_set_req_mon_condition(ovnsb_idl, port_binding, &pb),
>>>>>> @@ -394,6 +403,7 @@ out:;
>>>>>> sb_table_set_req_mon_condition(ovnsb_idl, chassis_private,
>>>>>> &chprv),
>>>>>> sb_table_set_opt_mon_condition(ovnsb_idl, chassis_template_var,
>>>>>> &tv),
>>>>>> sb_table_set_opt_mon_condition(ovnsb_idl, advertised_route,
>>>>>> &ar),
>>>>>> + sb_table_set_opt_mon_condition(ovnsb_idl, learned_route, &lr),
>>>>>> };
>>>>>>
>>>>>> unsigned int expected_cond_seqno = 0;
>>>>>> @@ -414,6 +424,7 @@ out:;
>>>>>> ovsdb_idl_condition_destroy(&chprv);
>>>>>> ovsdb_idl_condition_destroy(&tv);
>>>>>> ovsdb_idl_condition_destroy(&ar);
>>>>>> + ovsdb_idl_condition_destroy(&lr);
>>>>>> return expected_cond_seqno;
>>>>>> }
>>>>>>
>>>>>> @@ -880,7 +891,8 @@ ctrl_register_ovs_idl(struct ovsdb_idl *ovs_idl)
>>>>>> SB_NODE(meter, "meter") \
>>>>>> SB_NODE(static_mac_binding, "static_mac_binding") \
>>>>>> SB_NODE(chassis_template_var, "chassis_template_var") \
>>>>>> - SB_NODE(advertised_route, "advertised_route")
>>>>>> + SB_NODE(advertised_route, "advertised_route") \
>>>>>> + SB_NODE(learned_route, "learned_route")
>>>>>>
>>>>>> enum sb_engine_node {
>>>>>> #define SB_NODE(NAME, NAME_STR) SB_##NAME,
>>>>>> @@ -5001,13 +5013,40 @@ route_sb_advertised_route_data_handler(struct
>>>>>> engine_node *node, void *data)
>>>>>> return true;
>>>>>> }
>>>>>>
>>>>>> +struct ed_type_route_exchange {
>>>>>> + /* We need the idl to check if a table exists. */
>>>>>> + struct ovsdb_idl *sb_idl;
>>>>>> +};
>>>>>> +
>>>>>> static void
>>>>>> -en_route_exchange_run(struct engine_node *node, void *data OVS_UNUSED)
>>>>>> +en_route_exchange_run(struct engine_node *node, void *data)
>>>>>> {
>>>>>> + struct ed_type_route_exchange *re = data;
>>>>>> +
>>>>>> + struct ovsdb_idl_index *sbrec_learned_route_by_datapath =
>>>>>> + engine_ovsdb_node_get_index(
>>>>>> + engine_get_input("SB_learned_route", node),
>>>>>> + "datapath");
>>>>>> +
>>>>>> + struct ovsdb_idl_index *sbrec_port_binding_by_name =
>>>>>> + engine_ovsdb_node_get_index(
>>>>>> + engine_get_input("SB_port_binding", node),
>>>>>> + "name");
>>>>>> +
>>>>>> struct ed_type_route *route_data =
>>>>>> engine_get_input_data("route", node);
>>>>>>
>>>>>> + /* There can not actually be any routes to advertise unless we also
>>>>>> have
>>>>>> + * the Learned_Route table, since they where introduced in the same
>>>>>> + * release. */
>>>>>> + if (!sbrec_server_has_learned_route_table(re->sb_idl)) {
>>>>>> + return;
>>>>>> + }
>>>>>> +
>>>>>> struct route_exchange_ctx_in r_ctx_in = {
>>>>>> + .ovnsb_idl_txn = engine_get_context()->ovnsb_idl_txn,
>>>>>> + .sbrec_learned_route_by_datapath =
>>>>>> sbrec_learned_route_by_datapath,
>>>>>> + .sbrec_port_binding_by_name = sbrec_port_binding_by_name,
>>>>>> .announce_routes = &route_data->announce_routes,
>>>>>> };
>>>>>>
>>>>>> @@ -5022,9 +5061,11 @@ en_route_exchange_run(struct engine_node *node,
>>>>>> void *data OVS_UNUSED)
>>>>>>
>>>>>> static void *
>>>>>> en_route_exchange_init(struct engine_node *node OVS_UNUSED,
>>>>>> - struct engine_arg *arg OVS_UNUSED)
>>>>>> + struct engine_arg *arg)
>>>>>> {
>>>>>> - return NULL;
>>>>>> + struct ed_type_route_exchange *re = xzalloc(sizeof(*re));
>>>>>> + re->sb_idl = arg->sb_idl;
>>>>>> + return re;
>>>>>> }
>>>>>>
>>>>>> static void
>>>>>> @@ -5239,6 +5280,9 @@ main(int argc, char *argv[])
>>>>>> struct ovsdb_idl_index *sbrec_chassis_template_var_index_by_chassis
>>>>>> = ovsdb_idl_index_create1(ovnsb_idl_loop.idl,
>>>>>>
>>>>>> &sbrec_chassis_template_var_col_chassis);
>>>>>> + struct ovsdb_idl_index *sbrec_learned_route_index_by_datapath
>>>>>> + = ovsdb_idl_index_create1(ovnsb_idl_loop.idl,
>>>>>> + &sbrec_learned_route_col_datapath);
>>>>>>
>>>>>> ovsdb_idl_track_add_all(ovnsb_idl_loop.idl);
>>>>>> ovsdb_idl_omit_alert(ovnsb_idl_loop.idl,
>>>>>> @@ -5265,6 +5309,8 @@ main(int argc, char *argv[])
>>>>>> &sbrec_ha_chassis_group_col_external_ids);
>>>>>> ovsdb_idl_omit(ovnsb_idl_loop.idl,
>>>>>> &sbrec_advertised_route_col_external_ids);
>>>>>> + ovsdb_idl_omit(ovnsb_idl_loop.idl,
>>>>>> + &sbrec_learned_route_col_external_ids);
>>>>>>
>>>>>> /* We don't want to monitor Connection table at all. So omit all the
>>>>>> * columns. */
>>>>>> @@ -5358,6 +5404,10 @@ main(int argc, char *argv[])
>>>>>> route_sb_advertised_route_data_handler);
>>>>>>
>>>>>> engine_add_input(&en_route_exchange, &en_route, NULL);
>>>>>> + engine_add_input(&en_route_exchange, &en_sb_learned_route,
>>>>>> + engine_noop_handler);
>>>>>> + engine_add_input(&en_route_exchange, &en_sb_port_binding,
>>>>>> + engine_noop_handler);
>>>>>>
>>>>>> engine_add_input(&en_addr_sets, &en_sb_address_set,
>>>>>> addr_sets_sb_address_set_handler);
>>>>>> @@ -5576,6 +5626,8 @@ main(int argc, char *argv[])
>>>>>> sbrec_static_mac_binding_by_datapath);
>>>>>> engine_ovsdb_node_add_index(&en_sb_chassis_template_var, "chassis",
>>>>>>
>>>>>> sbrec_chassis_template_var_index_by_chassis);
>>>>>> + engine_ovsdb_node_add_index(&en_sb_learned_route, "datapath",
>>>>>> + sbrec_learned_route_index_by_datapath);
>>>>>> engine_ovsdb_node_add_index(&en_ovs_flow_sample_collector_set, "id",
>>>>>> ovsrec_flow_sample_collector_set_by_id);
>>>>>> engine_ovsdb_node_add_index(&en_ovs_port, "qos",
>>>>>> ovsrec_port_by_qos);
>>>>>> diff --git a/controller/route-exchange-netlink.c
>>>>>> b/controller/route-exchange-netlink.c
>>>>>> index 4ba21ecaa..74741a3fd 100644
>>>>>> --- a/controller/route-exchange-netlink.c
>>>>>> +++ b/controller/route-exchange-netlink.c
>>>>>> @@ -196,8 +196,19 @@ re_nl_delete_route(uint32_t table_id, const struct
>>>>>> in6_addr *dst,
>>>>>> return modify_route(RTM_DELROUTE, 0, table_id, dst, plen);
>>>>>> }
>>>>>>
>>>>>> +void
>>>>>> +re_nl_learned_routes_destroy(struct ovs_list *learned_routes)
>>>>>> +{
>>>>>> + struct re_nl_received_route_node *rr;
>>>>>> + LIST_FOR_EACH_POP (rr, list_node, learned_routes) {
>>>>>> + free(rr);
>>>>>> + }
>>>>>> +}
>>>>>> +
>>>>>> struct route_msg_handle_data {
>>>>>> struct hmapx *routes_to_advertise;
>>>>>> + struct ovs_list *learned_routes;
>>>>>> + const struct sbrec_datapath_binding *db;
>>>>>
>>>>> Nit: this would become reverse xmas tree if we move the 'db' field at
>>>>> the top. It also kind of makes sense because it's per datapath binding.
>>>>>
>>>>>> };
>>>>>>
>>>>>> static void
>>>>>> @@ -208,8 +219,25 @@ handle_route_msg(const struct route_table_msg *msg,
>>>>>> void *data)
>>>>>> struct advertise_route_entry *ar;
>>>>>> int err;
>>>>>>
>>>>>> - /* This route is not from us, we should not touch it. */
>>>>>> + /* This route is not from us, so we learn it. */
>>>>>> if (rd->rtm_protocol != RTPROT_OVN) {
>>>>>> + if (prefix_is_link_local(&rd->rta_dst, rd->rtm_dst_len)) {
>>>>>> + return;
>>>>>> + }
>>>>>> + struct route_data_nexthop *nexthop;
>>>>>> + LIST_FOR_EACH (nexthop, nexthop_node, &rd->nexthops) {
>>>>>> + if (ipv6_is_zero(&nexthop->addr)) {
>>>>>> + /* This is most likely an address on the local link.
>>>>>> + * As we just want to learn remote routes we do not
>>>>>> need it.*/
>>>>>> + continue;
>>>>>> + }
>>>>>> + struct re_nl_received_route_node *rr = xzalloc(sizeof *rr);
>>>>>
>>>>> Nit: xmalloc() is good enough.
>>>>>
>>>>>> + ovs_list_push_back(handle_data->learned_routes,
>>>>>> &rr->list_node);
>>>>>
>>>>> Nit: I'd push this to the list after it is fully initialized.
>>>>>
>>>>>> + rr->db = handle_data->db;
>>>>>> + rr->addr = rd->rta_dst;
>>>>>> + rr->plen = rd->rtm_dst_len;
>>>>>> + rr->nexthop = nexthop->addr;
>>>>>> + }
>>>>>> return;
>>>>>> }
>>>>>>
>>>>>> @@ -236,7 +264,9 @@ handle_route_msg(const struct route_table_msg *msg,
>>>>>> void *data)
>>>>>> }
>>>>>>
>>>>>> void
>>>>>> -re_nl_sync_routes(uint32_t table_id, const struct hmap *routes)
>>>>>> +re_nl_sync_routes(uint32_t table_id, const struct hmap *routes,
>>>>>> + struct ovs_list *learned_routes,
>>>>>> + const struct sbrec_datapath_binding *db)
>>>>>> {
>>>>>> struct hmapx routes_to_advertise =
>>>>>> HMAPX_INITIALIZER(&routes_to_advertise);
>>>>>> struct advertise_route_entry *ar;
>>>>>> @@ -249,11 +279,13 @@ re_nl_sync_routes(uint32_t table_id, const struct
>>>>>> hmap *routes)
>>>>>> * in the system. */
>>>>>> struct route_msg_handle_data data = {
>>>>>> .routes_to_advertise = &routes_to_advertise,
>>>>>> + .learned_routes = learned_routes,
>>>>>> + .db = db,
>>>>>> };
>>>>>> route_table_dump_one_table(table_id, handle_route_msg,
>>>>>> &data);
>>>>>>
>>>>>> - /* Add any remaining routes in the host_routes hmap to the system
>>>>>> routing
>>>>>> + /* Add any remaining routes in the routes hmap to the system routing
>>>>>> * table. */
>>>>>> struct hmapx_node *hn;
>>>>>> HMAPX_FOR_EACH (hn, &routes_to_advertise) {
>>>>>> diff --git a/controller/route-exchange-netlink.h
>>>>>> b/controller/route-exchange-netlink.h
>>>>>> index 93b593ad2..bc77504ae 100644
>>>>>> --- a/controller/route-exchange-netlink.h
>>>>>> +++ b/controller/route-exchange-netlink.h
>>>>>> @@ -19,6 +19,8 @@
>>>>>> #define ROUTE_EXCHANGE_NETLINK_H 1
>>>>>>
>>>>>> #include <stdint.h>
>>>>>> +#include "openvswitch/list.h"
>>>>>> +#include <netinet/in.h>
>>>>>>
>>>>>> /* This value is arbitrary but currently unused.
>>>>>> * See
>>>>>> https://github.com/iproute2/iproute2/blob/main/etc/iproute2/rt_protos */
>>>>>> @@ -27,6 +29,14 @@
>>>>>> struct in6_addr;
>>>>>> struct hmap;
>>>>>>
>>>>>> +struct re_nl_received_route_node {
>>>>>> + struct ovs_list list_node;
>>>>>> + const struct sbrec_datapath_binding *db;
>>>>>
>>>>> Nit: I think it might look "slightly better" if we move this field one
>>>>> line above.
>>>>>
>>>>>> + struct in6_addr addr;
>>>>>
>>>>> Nit: maybe 'prefix' is more descriptive?
>>>>>
>>>>>> + unsigned int plen;
>>>>>> + struct in6_addr nexthop;
>>>>>> +};
>>>>>> +
>>>>>> int re_nl_create_vrf(const char *ifname, uint32_t table_id);
>>>>>> int re_nl_delete_vrf(const char *ifname);
>>>>>>
>>>>>> @@ -37,6 +47,9 @@ int re_nl_delete_route(uint32_t table_id, const struct
>>>>>> in6_addr *dst,
>>>>>>
>>>>>> void re_nl_dump(uint32_t table_id);
>>>>>>
>>>>>> -void re_nl_sync_routes(uint32_t table_id, const struct hmap *routes);
>>>>>> +void re_nl_learned_routes_destroy(struct ovs_list *learned_routes);
>>>>>> +void re_nl_sync_routes(uint32_t table_id, const struct hmap *routes,
>>>>>> + struct ovs_list *learned_routes,
>>>>>> + const struct sbrec_datapath_binding *db);
>>>>>>
>>>>>> #endif /* route-exchange-netlink.h */
>>>>>> diff --git a/controller/route-exchange.c b/controller/route-exchange.c
>>>>>> index 0942780e2..a163968a7 100644
>>>>>> --- a/controller/route-exchange.c
>>>>>> +++ b/controller/route-exchange.c
>>>>>> @@ -21,6 +21,7 @@
>>>>>> #include <net/if.h>
>>>>>>
>>>>>> #include "openvswitch/vlog.h"
>>>>>> +#include "openvswitch/list.h"
>>>>>>
>>>>>> #include "lib/ovn-sb-idl.h"
>>>>>>
>>>>>> @@ -37,6 +38,127 @@ static struct vlog_rate_limit rl =
>>>>>> VLOG_RATE_LIMIT_INIT(5, 20);
>>>>>>
>>>>>> static struct sset _maintained_vrfs =
>>>>>> SSET_INITIALIZER(&_maintained_vrfs);
>>>>>>
>>>>>> +struct route_entry {
>>>>>> + struct hmap_node hmap_node;
>>>>>> +
>>>>>> + const struct sbrec_learned_route *sb_route;
>>>>>> +};
>>>>>> +
>>>>>> +static struct route_entry *
>>>>>
>>>>> We never use the return value, we might as well make this "void".
>>>>>
>>>>>> +route_alloc_entry(struct hmap *routes,
>>>>>
>>>>> route_insert_entry() or route_add_entry() would be more accurate. In
>>>>> sb_sync_learned_routes() we end a loop iteration with:
>>>>>
>>>>> route_e = route_alloc_entry(&sync_routes, sb_route);
>>>>> }
>>>>>
>>>>> Which made me wonder if we leak memory. We don't because
>>>>> route_alloc_entry() inserts into the routes map too.
>>>>>
>>>>>> + const struct sbrec_learned_route *sb_route)
>>>>>> +{
>>>>>> + struct route_entry *route_e = xzalloc(sizeof *route_e);
>>>>>
>>>>> Nit: xmalloc() is fine here.
>>>>>
>>>>>> +
>>>>>> + route_e->sb_route = sb_route;
>>>>>
>>>>> Nit: I'd move the newline from the line above here.
>>>>>
>>>>>> + uint32_t hash = uuid_hash(&sb_route->datapath->header_.uuid);
>>>>>> + hash = hash_string(sb_route->logical_port->logical_port, hash);
>>>>>> + hash = hash_string(sb_route->ip_prefix, hash);
>>>>>> + hmap_insert(routes, &route_e->hmap_node, hash);
>>>>>> +
>>>>>> + return route_e;
>>>>>> +}
>>>>>> +
>>>>>> +static struct route_entry *
>>>>>> +route_lookup(struct hmap *route_map,
>>>>>> + const struct sbrec_datapath_binding *sb_db,
>>>>>> + const struct sbrec_port_binding *logical_port,
>>>>>> + const char *ip_prefix, const char *nexthop)
>>>>>> +{
>>>>>> + struct route_entry *route_e;
>>>>>> + uint32_t hash;
>>>>>> +
>>>>>> + hash = uuid_hash(&sb_db->header_.uuid);
>>>>>> + hash = hash_string(logical_port->logical_port, hash);
>>>>>> + hash = hash_string(ip_prefix, hash);
>>>>>> + HMAP_FOR_EACH_WITH_HASH (route_e, hmap_node, hash, route_map) {
>>>>>> + if (route_e->sb_route->datapath != sb_db) {
>>>>>> + continue;
>>>>>> + }
>>>>>> +
>>>>>> + if (route_e->sb_route->logical_port != logical_port) {
>>>>>> + continue;
>>>>>> + }
>>>>>> +
>>>>>> + if (strcmp(route_e->sb_route->ip_prefix, ip_prefix)) {
>>>>>> + continue;
>>>>>> + }
>>>>>> +
>>>>>> + if (strcmp(route_e->sb_route->nexthop, nexthop)) {
>>>>>> + continue;
>>>>>> + }
>>>>>> +
>>>>>> + return route_e;
>>>>>> + }
>>>>>> +
>>>>>> + return NULL;
>>>>>> +}
>>>>>> +
>>>>>> +static void
>>>>>> +sb_sync_learned_routes(const struct ovs_list *learned_routes,
>>>>>> + const struct sbrec_datapath_binding *datapath,
>>>>>> + const struct sset *bound_ports,
>>>>>> + struct ovsdb_idl_txn *ovnsb_idl_txn,
>>>>>> + struct ovsdb_idl_index
>>>>>> *sbrec_port_binding_by_name,
>>>>>> + struct ovsdb_idl_index
>>>>>> *sbrec_learned_route_by_datapath)
>>>>>> +{
>>>>>> + struct hmap sync_routes = HMAP_INITIALIZER(&sync_routes);
>>>>>> + struct route_entry *route_e;
>>>>>
>>>>> This can be moved inside the second loop below.
>>>>
>>>> It is still used at HMAP_FOR_EACH_POP, so i would leave it here.
>>>>
>>>> The rest will be in the next version.
>>>>
>>>> Thanks a lot,
>>>> Felix
>>>>
>>>>
>>>>>
>>>>>> + const struct sbrec_learned_route *sb_route;
>>>>>> +
>>>>>> + struct sbrec_learned_route *filter =
>>>>>> +
>>>>>> sbrec_learned_route_index_init_row(sbrec_learned_route_by_datapath);
>>>>>> + sbrec_learned_route_index_set_datapath(filter, datapath);
>>>>>> + SBREC_LEARNED_ROUTE_FOR_EACH_EQUAL (sb_route, filter,
>>>>>> +
>>>>>> sbrec_learned_route_by_datapath) {
>>>>>> + /* If the port is not local we don't care about it.
>>>>>> + * Some other ovn-controller will handle it. */
>>>>>> + if (!sset_contains(bound_ports,
>>>>>> + sb_route->logical_port->logical_port)) {
>>>>>> + continue;
>>>>>> + }
>>>>>> + route_e = route_alloc_entry(&sync_routes, sb_route);
>>>>>
>>>>> Unused 'route_e'. As commented above, this is an actual "insert" in the
>>>>> hmap.
>>>>>
>>>>>> + }
>>>>>> + sbrec_learned_route_index_destroy_row(filter);
>>>>>> +
>>>>>> + struct re_nl_received_route_node *learned_route;
>>>>>> + LIST_FOR_EACH (learned_route, list_node, learned_routes) {
>>>>>> + char *ip_prefix = normalize_v46_prefix(&learned_route->addr,
>>>>>> + learned_route->plen);
>>>>>> + char *nexthop = normalize_v46(&learned_route->nexthop);
>>>>>> +
>>>>>> + const char *logical_port_name;
>>>>>> + SSET_FOR_EACH (logical_port_name, bound_ports) {
>>>>>> + const struct sbrec_port_binding *logical_port =
>>>>>> + lport_lookup_by_name(sbrec_port_binding_by_name,
>>>>>> + logical_port_name);
>>>>>> + if (!logical_port) {
>>>>>> + continue;
>>>>>> + }
>>>>>> + route_e = route_lookup(&sync_routes, datapath,
>>>>>> + logical_port, ip_prefix, nexthop);
>>>>>> + if (route_e) {
>>>>>> + hmap_remove(&sync_routes, &route_e->hmap_node);
>>>>>> + free(route_e);
>>>>>> + } else {
>>>>>> + sb_route = sbrec_learned_route_insert(ovnsb_idl_txn);
>>>>>> + sbrec_learned_route_set_datapath(sb_route, datapath);
>>>>>> + sbrec_learned_route_set_logical_port(sb_route,
>>>>>> logical_port);
>>>>>> + sbrec_learned_route_set_ip_prefix(sb_route, ip_prefix);
>>>>>> + sbrec_learned_route_set_nexthop(sb_route, nexthop);
>>>>>> + }
>>>>>> + }
>>>>>> + free(ip_prefix);
>>>>>> + free(nexthop);
>>>>>> + }
>>>>>> +
>>>>>> + HMAP_FOR_EACH_POP (route_e, hmap_node, &sync_routes) {
>>>>>> + sbrec_learned_route_delete(route_e->sb_route);
>>>>>> + free(route_e);
>>>>>> + }
>>>>>> + hmap_destroy(&sync_routes);
>>>>>> +}
>>>>>> +
>>>>>> void
>>>>>> route_exchange_run(struct route_exchange_ctx_in *r_ctx_in,
>>>>>> struct route_exchange_ctx_out *r_ctx_out OVS_UNUSED)
>>>>>> @@ -46,8 +168,6 @@ route_exchange_run(struct route_exchange_ctx_in
>>>>>> *r_ctx_in,
>>>>>>
>>>>>> const struct advertise_datapath_entry *ad;
>>>>>> HMAP_FOR_EACH (ad, node, r_ctx_in->announce_routes) {
>>>>>> - struct hmap received_routes
>>>>>> - = HMAP_INITIALIZER(&received_routes);
>>>>>> uint32_t table_id = ad->db->tunnel_key;
>>>>>> char vrf_name[IFNAMSIZ + 1];
>>>>>> snprintf(vrf_name, sizeof vrf_name, "ovnvrf%"PRIi32, table_id);
>>>>>> @@ -72,9 +192,21 @@ route_exchange_run(struct route_exchange_ctx_in
>>>>>> *r_ctx_in,
>>>>>> sset_find_and_delete(&old_maintained_vrfs, vrf_name);
>>>>>> }
>>>>>>
>>>>>> - re_nl_sync_routes(ad->db->tunnel_key, &ad->routes);
>>>>>> + struct ovs_list received_routes = OVS_LIST_INITIALIZER(
>>>>>> + &received_routes);
>>>>>> +
>>>>>> + re_nl_sync_routes(ad->db->tunnel_key, &ad->routes,
>>>>>> + &received_routes, ad->db);
>>>>>> +
>>>>>> + sb_sync_learned_routes(&received_routes, ad->db,
>>>>>> + &ad->bound_ports,
>>>>>> r_ctx_in->ovnsb_idl_txn,
>>>>>> + r_ctx_in->sbrec_port_binding_by_name,
>>>>>> +
>>>>>> r_ctx_in->sbrec_learned_route_by_datapath);
>>>>>> +
>>>>>> + re_nl_learned_routes_destroy(&received_routes);
>>>>>> }
>>>>>>
>>>>>> +
>>>>>
>>>>> Nit: unrelated newline.
>>>>>
>>>>>> /* Remove VRFs previously maintained by us not found in the above
>>>>>> loop. */
>>>>>> const char *vrf_name;
>>>>>> SSET_FOR_EACH_SAFE (vrf_name, &old_maintained_vrfs) {
>>>>>> diff --git a/controller/route-exchange.h b/controller/route-exchange.h
>>>>>> index 65520242b..d23bb37a2 100644
>>>>>> --- a/controller/route-exchange.h
>>>>>> +++ b/controller/route-exchange.h
>>>>>> @@ -19,6 +19,9 @@
>>>>>> #define ROUTE_EXCHANGE_H 1
>>>>>>
>>>>>> struct route_exchange_ctx_in {
>>>>>> + struct ovsdb_idl_txn *ovnsb_idl_txn;
>>>>>> + struct ovsdb_idl_index *sbrec_port_binding_by_name;
>>>>>> + struct ovsdb_idl_index *sbrec_learned_route_by_datapath;
>>>>>> /* Contains struct advertise_datapath_entry */
>>>>>> const struct hmap *announce_routes;
>>>>>> };
>>>>>> diff --git a/lib/ovn-util.c b/lib/ovn-util.c
>>>>>> index ed847517a..507847280 100644
>>>>>> --- a/lib/ovn-util.c
>>>>>> +++ b/lib/ovn-util.c
>>>>>> @@ -822,6 +822,16 @@ normalize_v46_prefix(const struct in6_addr *prefix,
>>>>>> unsigned int plen)
>>>>>> }
>>>>>> }
>>>>>>
>>>>>> +char *
>>>>>> +normalize_v46(const struct in6_addr *prefix)
>>>>>> +{
>>>>>> + if (IN6_IS_ADDR_V4MAPPED(prefix)) {
>>>>>> + return normalize_ipv4_prefix(in6_addr_get_mapped_ipv4(prefix),
>>>>>> 32);
>>>>>> + } else {
>>>>>> + return normalize_ipv6_prefix(prefix, 128);
>>>>>> + }
>>>>>> +}
>>>>>> +
>>>>>> char *
>>>>>> str_tolower(const char *orig)
>>>>>> {
>>>>>> diff --git a/lib/ovn-util.h b/lib/ovn-util.h
>>>>>> index 31c2c68df..8d8fd989b 100644
>>>>>> --- a/lib/ovn-util.h
>>>>>> +++ b/lib/ovn-util.h
>>>>>> @@ -207,6 +207,7 @@ bool ip46_parse(const char *ip_str, struct in6_addr
>>>>>> *ip);
>>>>>> char *normalize_ipv4_prefix(ovs_be32 ipv4, unsigned int plen);
>>>>>> char *normalize_ipv6_prefix(const struct in6_addr *ipv6, unsigned int
>>>>>> plen);
>>>>>> char *normalize_v46_prefix(const struct in6_addr *prefix, unsigned int
>>>>>> plen);
>>>>>> +char *normalize_v46(const struct in6_addr *prefix);
>>>>>>
>>>>>> /* Returns a lowercase copy of orig.
>>>>>> * Caller must free the returned string.
>>>>>> diff --git a/tests/system-ovn.at b/tests/system-ovn.at
>>>>>> index 760c97a5d..dc99d4c57 100644
>>>>>> --- a/tests/system-ovn.at
>>>>>> +++ b/tests/system-ovn.at
>>>>>> @@ -15048,6 +15048,16 @@ blackhole 192.0.2.3 proto 84
>>>>>> blackhole 192.0.2.10 proto 84
>>>>>> blackhole 198.51.100.0/24 proto 84])
>>>>>>
>>>>>> +# Now we test route learning.
>>>>>> +check_row_count Learned_Route 0
>>>>>> +check ip route add 233.252.0.0/24 via 192.168.10.10 dev lo onlink vrf
>>>>>> ovnvrf1337
>>>>>> +# For now we trigger a recompute as route watching is not yet
>>>>>> implemented.
>>>>>> +check ovn-appctl -t ovn-controller inc-engine/recompute
>>>>>> +check ovn-nbctl --wait=hv sync
>>>>>> +check_row_count Learned_Route 1
>>>>>> +lp=$(fetch_column port_binding _uuid logical_port=internet-phys)
>>>>>> +check_row_count Learned_Route 1 logical_port=$lp
>>>>>> ip_prefix=233.252.0.0/24 nexthop=192.168.10.10
>>>>>> +
>>>>>> OVS_APP_EXIT_AND_WAIT([ovn-controller])
>>>>>>
>>>>>> as ovn-sb
>>>>>> @@ -15209,6 +15219,7 @@ check ovn-nbctl lr-nat-add pr1 dnat_and_snat
>>>>>> 192.0.2.10 10.0.0.2
>>>>>> check ovn-nbctl lsp-add p2 vif2 \
>>>>>> -- lsp-set-addresses vif2 "00:00:ff:ff:ff:02 198.51.100.10"
>>>>>> check ovn-nbctl lr-route-add internet 198.51.100.0/24 192.0.2.3
>>>>>> + .ovnsb_idl = re->sb_idl,
>>>>>>
>>>>>> # Configure external connectivity.
>>>>>> check ovs-vsctl set Open_vSwitch .
>>>>>> external-ids:ovn-bridge-mappings=phynet:br-ext
>>>>>> @@ -15251,6 +15262,16 @@ blackhole 192.0.2.3 proto 84
>>>>>> blackhole 192.0.2.10 proto 84
>>>>>> blackhole 198.51.100.0/24 proto 84])
>>>>>>
>>>>>> +# Now we test route learning.
>>>>>> +check_row_count Learned_Route 0
>>>>>> +check ip route add 233.252.0.0/24 via 192.168.10.10 dev lo onlink vrf
>>>>>> ovnvrf1337
>>>>>> +# For now we trigger a recompute as route watching is not yet
>>>>>> implemented.
>>>>>> +check ovn-appctl -t ovn-controller inc-engine/recompute
>>>>>> +check ovn-nbctl --wait=hv sync
>>>>>> +check_row_count Learned_Route 2
>>>>>> +lp=$(fetch_column port_binding _uuid logical_port=internet-phys)
>>>>>> +check_row_count Learned_Route 1 logical_port=$lp
>>>>>> ip_prefix=233.252.0.0/24 nexthop=192.168.10.10
>>>>>> +
>>>>>> as ovn-sb
>>>>>> OVS_APP_EXIT_AND_WAIT([ovsdb-server])
>>>>>>
>>>>>
>>>>> Thanks,
>>>>> Dumitru
>>>>>
>>>> _______________________________________________
>>>> dev mailing list
>>>> [email protected]
>>>> https://mail.openvswitch.org/mailman/listinfo/ovs-dev
>>>
>>
>> _______________________________________________
>> dev mailing list
>> [email protected]
>> https://mail.openvswitch.org/mailman/listinfo/ovs-dev
>
_______________________________________________
dev mailing list
[email protected]
https://mail.openvswitch.org/mailman/listinfo/ovs-dev