On 7/9/25 11:35 PM, Han Zhou wrote: > On Wed, Jul 9, 2025 at 11:38 AM Dumitru Ceara <dce...@redhat.com> wrote: >> >> On 7/9/25 8:01 PM, Han Zhou wrote: >>> On Wed, Dec 11, 2024 at 5:35 AM Dumitru Ceara <dce...@redhat.com> wrote: >>>> >>>> On 12/10/24 12:35 PM, Ales Musil wrote: >>>>> On Mon, Nov 11, 2024 at 11:43 AM Dumitru Ceara <dce...@redhat.com> >>> wrote: >>>>> >>>>>> The actions that retrieve the original tuple destination IPs and > ports >>>>>> were used in the following way in ovn-northd: >>>>>> >>>>>> REG_ORIG_DIP_IPV4 " = ct_nw_dst() >>>>>> REG_ORIG_DIP_IPV6 " = ct_ip6_dst() >>>>>> REG_ORIG_TP_DPORT " = ct_tp_dst() >>>>>> >>>>>> Where: >>>>>> REG_ORIG_DIP_IPV4 is reg1 >>>>>> REG_ORIG_DIP_IPV6 is xxreg1 >>>>>> REG_ORIG_TP_DPORT is reg2[0..15] >>>>>> >>>>>> While the actions use a different intermediate register: >>>>>> ct_nw_dst() was reg4<-0; reg4<-ct_nw_dst >>>>>> ct_ip6_dst() was xxreg0<-0; xxreg0<-ct_ip6_dst >>>>>> ct_tp_dst() was reg8[0..15]<-0; reg8[0..15]<-ct_tp_dst >>>>>> >>>>>> We can reduce the set of registers we use in the OVN pipeline by >>>>>> changing the action definition to use the same registers ovn-northd >>> uses >>>>>> as final destination for the ct_*_dst() values. This will generate > the >>>>>> following openflow rules: >>>>>> >>>>>> REG_ORIG_DIP_IPV4 " = ct_nw_dst() >>>>>> reg1<-0; reg1<-ct_nw_dst; reg1<-reg1 >>>>>> REG_ORIG_DIP_IPV6 " = ct_ip6_dst() >>>>>> xxreg1<-0; xxreg1<-ct_ip6_dst; xxreg1<-xxreg1 >>>>>> REG_ORIG_TP_DPORT " = ct_tp_dst() >>>>>> reg2[0..15]<-0; reg2[0..15]<-ct_tp_dst; reg2[0..15]<-reg2[0..15] >>>>>> >>>>>> As Ilya Maximets points out, overlapping source and destination are >>>>>> well defined for move actions: >>>>>> >>>>>> >>> > https://opennetworking.org/wp-content/uploads/2014/10/openflow-switch-v1.5.0.noipr.pdf >>>>>> >>>>>> This action must behaves properly when src_oxm_id overlaps with >>>>>> dst_oxm_id, that is, it behaves as if src_oxm_id were copied out >>>>>> to a temporary buffer, then the temporary buffer copied to >>>>>> dst_oxm_id, if this is not possible the switch must reject the >>>>>> Copy Field action with a Bad Set Type error message. >>>>>> >>>>>> OpenvSwitch doesn't reject such actions. >>>>>> >>>>>> Fixes: 0f806cf08c36 ("Fix load balanced hairpin traffic for > fragmented >>>>>> packets.") >>>>>> Signed-off-by: Dumitru Ceara <dce...@redhat.com> >>>>>> --- >>>>>> include/ovn/logical-fields.h | 7 ++++--- >>>>>> tests/ovn.at | 8 ++++---- >>>>>> 2 files changed, 8 insertions(+), 7 deletions(-) >>>>>> >>>>>> diff --git a/include/ovn/logical-fields.h >>> b/include/ovn/logical-fields.h >>>>>> index d563e044cb..70c6b93c41 100644 >>>>>> --- a/include/ovn/logical-fields.h >>>>>> +++ b/include/ovn/logical-fields.h >>>>>> @@ -60,9 +60,10 @@ enum ovn_controller_event { >>>>>> #define MFF_LOG_LB_AFF_MATCH_LR_IP6_ADDR MFF_XXREG1 >>>>>> #define MFF_LOG_LB_AFF_MATCH_PORT MFF_REG8 >>>>>> >>>>>> -#define MFF_LOG_CT_ORIG_NW_DST_ADDR MFF_REG4 >>>>>> -#define MFF_LOG_CT_ORIG_IP6_DST_ADDR MFF_XXREG0 >>>>>> -#define MFF_LOG_CT_ORIG_TP_DST_PORT MFF_REG8 >>>>>> +#define MFF_LOG_CT_ORIG_NW_DST_ADDR MFF_REG1 /* >>>>>> REG_ORIG_DIP_IPV4 */ >>>>>> +#define MFF_LOG_CT_ORIG_IP6_DST_ADDR MFF_XXREG1 /* >>>>>> REG_ORIG_DIP_IPV6 */ >>>>>> +#define MFF_LOG_CT_ORIG_TP_DST_PORT MFF_REG2 /* >>>>>> REG_ORIG_TP_DPORT >>>>>> + * (bits >>> 0..15). */ >>>>>> >>>>>> void ovn_init_symtab(struct shash *symtab); >>>>>> >>>>>> diff --git a/tests/ovn.at b/tests/ovn.at >>>>>> index 15b78f4c37..03fd2090fc 100644 >>>>>> --- a/tests/ovn.at >>>>>> +++ b/tests/ovn.at >>>>>> @@ -2420,10 +2420,10 @@ mac_cache_use; >>>>>> >>>>>> # ct_nw_dst() >>>>>> reg1 = ct_nw_dst(); >>>>>> - encodes as >>>>>> >>> > set_field:0->reg4,resubmit(,OFTABLE_CT_ORIG_NW_DST_LOAD),move:NXM_NX_REG4[[]]->NXM_NX_XXREG0[[64..95]] >>>>>> + encodes as >>>>>> >>> > set_field:0->reg1,resubmit(,OFTABLE_CT_ORIG_NW_DST_LOAD),move:NXM_NX_REG1[[]]->NXM_NX_XXREG0[[64..95]] >>>>>> >>>>>> xreg1[[3..34]] = ct_nw_dst(); >>>>>> - encodes as >>>>>> >>> > set_field:0->reg4,resubmit(,OFTABLE_CT_ORIG_NW_DST_LOAD),move:NXM_NX_REG4[[]]->NXM_NX_XXREG0[[3..34]] >>>>>> + encodes as >>>>>> >>> > set_field:0->reg1,resubmit(,OFTABLE_CT_ORIG_NW_DST_LOAD),move:NXM_NX_REG1[[]]->NXM_NX_XXREG0[[3..34]] >>>>>> >>>>>> reg1[[3..34]] = ct_nw_dst(); >>>>>> Cannot select bits 3 to 34 of 32-bit field reg1. >>>>>> @@ -2442,7 +2442,7 @@ ct_nw_dst(); >>>>>> >>>>>> # ct_ip6_dst() >>>>>> xxreg1 = ct_ip6_dst(); >>>>>> - encodes as >>>>>> >>> > set_field:0/0xffffffffffffffff->xxreg0,set_field:0/0xffffffffffffffff0000000000000000->xxreg0,resubmit(,OFTABLE_CT_ORIG_IP6_DST_LOAD),move:NXM_NX_XXREG0[[]]->NXM_NX_XXREG1[[]] >>>>>> + encodes as >>>>>> >>> > set_field:0/0xffffffffffffffff->xxreg1,set_field:0/0xffffffffffffffff0000000000000000->xxreg1,resubmit(,OFTABLE_CT_ORIG_IP6_DST_LOAD),move:NXM_NX_XXREG1[[]]->NXM_NX_XXREG1[[]] >>>>>> >>>>>> reg1 = ct_ip6_dst(); >>>>>> Cannot use 32-bit field reg1[[0..31]] where 128-bit field is >>> required. >>>>>> @@ -2455,7 +2455,7 @@ ct_ip6_dst(); >>>>>> >>>>>> # ct_tp_dst() >>>>>> reg1[[0..15]] = ct_tp_dst(); >>>>>> - encodes as >>>>>> >>> > set_field:0/0xffff->reg8,resubmit(,OFTABLE_CT_ORIG_TP_DST_LOAD),move:NXM_NX_REG8[[0..15]]->NXM_NX_XXREG0[[64..79]] >>>>>> + encodes as >>>>>> >>> > set_field:0/0xffff->reg2,resubmit(,OFTABLE_CT_ORIG_TP_DST_LOAD),move:NXM_NX_REG2[[0..15]]->NXM_NX_XXREG0[[64..79]] >>>>>> >>>>>> reg1 = ct_tp_dst(); >>>>>> Cannot use 32-bit field reg1[[0..31]] where 16-bit field is >>> required. >>>>>> -- >>>>>> 2.46.2 >>>>>> >>>>>> _______________________________________________ >>>>>> dev mailing list >>>>>> d...@openvswitch.org >>>>>> https://mail.openvswitch.org/mailman/listinfo/ovs-dev >>>>>> >>>>>> >>>>> Looks good to me, thanks. >>>>> >>>>> Acked-by: Ales Musil <amu...@redhat.com> >>>>> >>>> >>>> Thanks, Ales! Applied to main and backported to 24.09 and 24.03. >>>> >>>> Regards, >>>> Dumitru >>>> >>> >> >> Hi Han, >> >>> Hi Dumitru, Ales, I am trying to figure out the current register usage > and >>> saw this old commit. I am not sure I understand the motivation of this >>> patch. Is there a problem of using different registers for intermediate >> >> There's no problem per se but we were using two registers instead of >> one. The intermediate register was overwritten and then its new value >> immediately copied to the final destination register. >> >> The goal of the patch was to free up registers for other things in we >> might need to do in the pipeline. >> >>> usage? Although this patch tried to use the same register for > intermediate >>> and final destination, a later commit from Ales 91988089c5 ("northd: >>> Consolidate register usage in logical flows.") already broke it again. >>> Shall we fix that again? >>> >> >> I guess we probably should. Maybe this time we should change the way >> the ct_*_dst() actions are encoded? I'm not sure how to prevent this >> from happening again in the future though. >> >
Hi Han, > I think the right way to encode actions that require intermediate registers > is to save the original content of the register on the stack beforehand and > restore it afterwards. This way the intermediate registers won't really > occupy the register space, and we don't need to worry about conflict > usage/overwriting. Otherwise, it is very hard to maintain and not even easy > to ensure the register is not overwritten accidentally if the same action > is called at different stages. > I will go ahead with this approach if it makes sense. What do you think? > Sure, I was thinking of something similar initially too but I'm not sure how we can easily implement it. I might be missing something though. For example: reg4 = ct_nw_dst(); Is actually parsed as the following sequence of logical actions today: // ct_nw_dst() LOAD(reg1, 0) RESUBMIT(table=81) // This might write reg1. LOAD(reg4, reg1) So the problem I saw was that we're losing the information we had in reg1. To fix it, IIUC, you're suggesting to use the stack, I assume something like: // ct_nw_dst() PUSH(reg1) LOAD(reg1, 0) RESUBMIT(table=81) // This might write reg1. LOAD(reg4, reg1) POP(reg1) But the problem is that we're mixing the encoding of two logical actions (the push/pop are for the ct_nw_dst() call while the load(reg4, ..) is for the assignment). I guess, the correct way is to have a proper calling convention for all function-like actions like ct_nw_dst(), get_fdb(), etc. And ensure we always return the value in a predefined way, maybe always push it on the stack? Regards, Dumitru > Regards, > Han > >>> Thanks, >>> Han >>> >> >> Regards, >> Dumitru >> >>> >>>> >>>> _______________________________________________ >>>> dev mailing list >>>> d...@openvswitch.org >>>> https://mail.openvswitch.org/mailman/listinfo/ovs-dev >>> >> > _______________________________________________ dev mailing list d...@openvswitch.org https://mail.openvswitch.org/mailman/listinfo/ovs-dev