Hi Jorge,

Stéphane, Satya and I have gone over this draft again and propose the following 
regarding the Lowest-IP vs. Highest-IP debate.

Option 1
Pick one, e.g. draft explicitly states highest-weight wins. Removes all 
ambiguity.

Option 2:
Support Lowest-weight and Highest-weight both, and it must be signaled.
Signaling is done using DF-Algorithm types:

  *   DF-Type=2 : Preference-DF, Highest-weight
  *   DF-Type=3 : Preference DF, Lowest-weight
. Example/precedent for this approach:  
draft-ietf-bess-evpn-per-mcast-flow-df-election
  claims 2 DF-Types for each S,G and *,G DF election (ignoring that the draft 
is in serious need of revision)
. 7432-fallback if non-agreement (type2<>type3)
. forward-compatible with existing implementations: DF-Type 2 with default 
Highest-wt.

Option 3:
Support Lowest-weight and Highest-weight both, and it must be signaled.
Signaling is done using a Flag in the DF-Elect extcomm

  *   DF-Type=2 : Preference-DF
  *   DF-Elect ExtComm:  specify a Flag: 0=Highest, 1=Lowest.
. 7432-fallback if non-agreement (flag) is more complicated on an EC
. doesn’ match other WG documents’ approach
. forward-compatible with existing implementations: DF-Type 2 with Fl=0.
. minimal support would be alarming based on flag mismatch
. must use ‘Reserved’ and NOT capability bitmap because that would require 
analysis against all other DF Algo modes.


Speaking for myself, I would add that  ‘D’ bit capability is a generic flag 
across DF Type Algos.  Evaluation against all existing DF-types must be done 
but... I would remove the ‘all other DF-types SHOULD ignore’.
Many other DF types may want to leverage and I think it is a good addition as a 
generic capability..


Regards,
Luc André

Luc André Burdet |  Cisco  |  laburdet.i...@gmail.com  |  Tel: +1 613 254 4814


From: Luc André Burdet <laburdet.i...@gmail.com>
Date: Wednesday, November 18, 2020 at 14:05
To: "Rabadan, Jorge (Nokia - US/Mountain View)" <jorge.raba...@nokia.com>, 
"slitkows.i...@gmail.com" <slitkows.i...@gmail.com>
Cc: "draft-ietf-bess-evpn-pref...@ietf.org" 
<draft-ietf-bess-evpn-pref...@ietf.org>, "bess-cha...@ietf.org" 
<bess-cha...@ietf.org>, "bess@ietf.org" <bess@ietf.org>
Subject: Re: [bess] Shepherd's Review of draft-ietf-bess-evpn-pref-df

Hi Stéphane, Jorge,

On the issue of tie-breaking, I would agree with Stéphane.

It is too easy for peering PEs to be misconfigured / in disagreement as to Hi 
or Lo and this should be signaled if at least in an error reporting capacity.

I would propose to go even further. This DF Election algorithm compares fields 
in a given order:

  *   Field 1: Preference Weight
  *   Field 2: IP Address Tie-break if Field1 is a tie, and hope all PEs use 
same Hi or Lo.

The algorithm could be made very generic in terms of tie break and provide for 
a cascading order or skipping a field entirely:

  *   Field 1: Preference Weight

  *   Optional Tie-break Method X: IP Address Field, Highest-IP
  *   Optional Tie-break Method Y: IP Address Field, Lowest-IP
  *   Optional Tie-break Method Z: Local-Policy ( see section 4.1(g) )

We could potentially mandate at least one MUST be provided and signaled incl 
order of resolving tie-break?
Basically my point is that IP is unique but also an arbitrary Field to use, 
many more could be better suited for tie-break dép. use-case.  There are 
practical examples where local policies will affect weight but perhaps that 
local-policy itself serves as a tie-break.
One could split the Reserved field of section 3 into 2 nibbles: Tie-break-1, 
Tie-break-2 for mismatch detection, and this draft defines "well-known values" 
IP-Hi=0 and IP-Lo=1 (leaving all others to implementation-specific values 
(non-assigned) and ONLY for error detection.

Luc André Burdet | laburdet.i...@gmail.com<mailto:laburdet.i...@gmail.com> | +1 
613 254 48 14
.:|:.:|:. Cisco Systems Canada Inc.


On Tue, Sep 1, 2020 at 1:25 AM Rabadan, Jorge (Nokia - US/Mountain View) 
<jorge.raba...@nokia.com<mailto:jorge.raba...@nokia.com>> wrote:
Hi Stephane,

Please see in-line with [jorge2].
Thank you!
Jorge

From: slitkows.i...@gmail.com<mailto:slitkows.i...@gmail.com> 
<slitkows.i...@gmail.com<mailto:slitkows.i...@gmail.com>>
Date: Monday, August 31, 2020 at 11:49 AM
To: Rabadan, Jorge (Nokia - US/Mountain View) 
<jorge.raba...@nokia.com<mailto:jorge.raba...@nokia.com>>, 
draft-ietf-bess-evpn-pref...@ietf.org<mailto:draft-ietf-bess-evpn-pref...@ietf.org>
 
<draft-ietf-bess-evpn-pref...@ietf.org<mailto:draft-ietf-bess-evpn-pref...@ietf.org>>,
 bess-cha...@ietf.org<mailto:bess-cha...@ietf.org> 
<bess-cha...@ietf.org<mailto:bess-cha...@ietf.org>>, 
bess@ietf.org<mailto:bess@ietf.org> <bess@ietf.org<mailto:bess@ietf.org>>
Subject: RE: Shepherd's Review of draft-ietf-bess-evpn-pref-df
Hi Jorge,

Please find additional feedback inline.
(Stripping things we agree on)

Stephane


From: Rabadan, Jorge (Nokia - US/Mountain View) 
<jorge.raba...@nokia.com<mailto:jorge.raba...@nokia.com>>
Sent: vendredi 19 juin 2020 20:37
To: slitkows.i...@gmail.com<mailto:slitkows.i...@gmail.com>; 
draft-ietf-bess-evpn-pref...@ietf.org<mailto:draft-ietf-bess-evpn-pref...@ietf.org>;
 bess-cha...@ietf.org<mailto:bess-cha...@ietf.org>; 
bess@ietf.org<mailto:bess@ietf.org>
Subject: Re: Shepherd's Review of draft-ietf-bess-evpn-pref-df

Hi Stephane,

Thanks for the review and my apologies for the delay.
We just posted a new revision.

As usual, very good points. Please see in-line.

Thx
Jorge

From: "slitkows.i...@gmail.com<mailto:slitkows.i...@gmail.com>" 
<slitkows.i...@gmail.com<mailto:slitkows.i...@gmail.com>>
Date: Wednesday, February 26, 2020 at 3:20 PM
To: 
"draft-ietf-bess-evpn-pref...@ietf.org<mailto:draft-ietf-bess-evpn-pref...@ietf.org>"
 
<draft-ietf-bess-evpn-pref...@ietf.org<mailto:draft-ietf-bess-evpn-pref...@ietf.org>>,
 'BESS' <bess@ietf.org<mailto:bess@ietf.org>>
Cc: "bess-cha...@ietf.org<mailto:bess-cha...@ietf.org>" 
<bess-cha...@ietf.org<mailto:bess-cha...@ietf.org>>
Subject: Shepherd's Review of draft-ietf-bess-evpn-pref-df
Resent-From: <alias-boun...@ietf.org<mailto:alias-boun...@ietf.org>>
Resent-To: <jorge.raba...@nokia.com<mailto:jorge.raba...@nokia.com>>, 
<senthil.sathap...@nokia.com<mailto:senthil.sathap...@nokia.com>>, 
<p...@juniper.net<mailto:p...@juniper.net>>, 
<w...@juniper.net<mailto:w...@juniper.net>>, 
<jdr...@juniper.net<mailto:jdr...@juniper.net>>, 
<saja...@cisco.com<mailto:saja...@cisco.com>>, 
<satya...@cisco.com<mailto:satya...@cisco.com>>
Resent-Date: Wednesday, February 26, 2020 at 3:20 PM



Section 4.1:

        “ Note that, by default, the Highest-Preference is chosen for each
       ES or vES, however the ES configuration can be changed to the
       Lowest-Preference algorithm as long as this option is consistent
       in all the PEs in the ES.
I don’t think it is a good idea to open this modification. People can play with 
preference values to achieve the required behavior while always keeping high 
pref.
We have no way to ensure consistency, except if we advertise the behavior as 
part of the exct. Consistency of DF election is key and needs to be ensured as 
much as we can.
[Jorge] the idea is have the highest preference as default (maybe use normative 
language?), which means that it will work fine. Opening to lowest is to give 
more flexibility, knowing that if the user has to change the config from the 
default, they will do it in all the PEs of the ES.

[SLI] I don’t think this is a good idea, consistency is really important, if 
you absolutely want to do both lowest and highest, you can allocate a new DF 
Alg for lowest so we ensure that everybody uses the same algorithm. But I don’t 
think this is necessary, having highest preference is enough. I don’t remember 
any feature using a preference that can do both highest and lowest, it is 
usually one or the other.


[Jorge2] ok, we can leave just the highest-preference in section 4.1. We’ll fix 
it in the next revision. Note that in 4.2 we still need to have highest and 
lowest pref per ethernet tag range to make sure there is load balancing.








“When PE3's vES2 comes back up, PE3 will start a boot-timer (if

       booting up) or hold-timer (if the port or EVC recovers).  That

       timer will allow some time for PE3 to receive the ES routes from

       PE1 and PE2. »



Are you changing the way the DF election works ? Usually, PE advertises its 
route and then wait to receive other routes.

[Jorge] those timers are on top of the FSM defined in RFC8584, e.g. we need to 
give some extra time before the ES goes up and we advertise the ES route, if 
the ES is configured with the DP capability. This is because the advertised 
preference and DP values may not be the same as the configured ones, and the 
‘in-use’ values will depend on the other ES routes in the ES. If we advertise 
the ES route immediately after the ES is up, we may not have received the other 
ES routes and we don’t know what “in-use” values to advertise in order to avoid 
preemption in the ES. I added some text on point 5 (section 4.3).





[SLI] As we are updating the FSM, could we have new state and events defined to 
update the FSM similarly as we have in RFC8584 ?
[jorge2] not sure if we should update the FSM, the reason being that those hold 
timers are generic for any redundancy mechanism, to avoid attracting traffic 
from the access before the core IGP/BGP are up and have all the required routes 
available. Some implementations use fixed timers, others configurable timers, 
others watch when the IGP/BGP come up and leave an additional delta. I thought 
the current text is generic enough to allow all those implementations.





What happens if all PEs on the ES are failing at the same time or the ES 
booting up on all the PEs at the same time ? you have no way to hear what is 
the pref from the remote.

[Jorge] The non-revertive capability makes sense when there is at least one PE 
alive in the ES and we don’t want to preempt it so that there is no traffic 
impact. If all the PEs fail, there is traffic impact anyway, so there is really 
no non-revertive behavior, but an initialization in all the PEs.



[SLI] Let’s say that you have a CE attached to 3 PEs (same ESI). PE1 pref 100, 
PE2 pref 200, PE3 pref 300. PE1 to 3 are configured with DP set. PE4 is a 
remote PE.

T0:CE fails and boots up.

T1:PE1 to PE3 starts their HOLD_TIMER

T2: PE1 HOLD_TIMER expires,  advertises ES route and starts DF_TIMER

T3: PE4 discovers ES from PE1 and starts DF_TIMER
[jorge2] I assume you mean PE3 here.

T4: PE2 HOLD_TIMER expires,  advertises ES route and starts DF_TIMER, does it 
advertises with Pref100 and DP=0 as it knows from PE1 route even if PE1 is not 
DF yet ?
[jorge2] this is an example in which all PEs are ‘initializing’ at the same 
time, so there is traffic impact anyway because the CE reboots. So it would be 
more like an initialization example. Nevertheless, PE2 in this case should 
advertise the ES route with in-use pref 100 and DP=0.





Brgds,

Stephane

_______________________________________________
BESS mailing list
BESS@ietf.org<mailto:BESS@ietf.org>
https://www.ietf.org/mailman/listinfo/bess
_______________________________________________
BESS mailing list
BESS@ietf.org
https://www.ietf.org/mailman/listinfo/bess

Reply via email to