Aijun Wang
China Telecom

> On Nov 22, 2021, at 18:44, Tony Przygienda <tonysi...@gmail.com> wrote:
> 
> 
> Just to prop a voice of support to Tony Li's arguments which I have nothing 
> further to add really. He basically elucidated ;-) with flourishes what I 
> wrote in my short, terse email I think. 
> 
> As he says "you either make easy choices and get an operationally unstable 
> environment at scale/large disturbances or you make hard architectural 
> choices & scale much better over time".

[WAJ] Easy choices doesn’t definitely get operationally unstable environment(we 
have explained in previous mails). And would you give some examples to 
illustrate “hard architecture but scale much better over time?”

> Examples of that are abound in systems design but it's unfortunately often 
> RFC1295 (4). As a sidenote: IETF has no intention or mechanism to stop people 
> doing unscalable, poorly designed things with their specs

[WAJ] like RIFT or flood reflector solutions in IGP?

> and that was ok as long people did not try to push them onto the whole world 
> without listening to folks who took the scar tissue to get us the IETF 
> working specs we have today which seems to have become fashionable in last 
> couple of years. 
> 
> And fast flooding is a red herring here IMO, it does nothing but accelerate 
> the control loop, if the control loop is positively stable, it's good, if the 
> control loops are oscillating or start to negatively amplify accelerating 
> things just melts everything faster. 

[WAJ] There is no control loop oscillation. We track only the down event.
> 
> Ultimately, having followed this "discussion" my opinion is still that if 
> authors would like to abuse IGP as "domain wide broadcast" for liveliness 
> notification the "events" draft is far less fragile and convoluted but should 
> be kept in a service instance as basically "event based BFD substitute" to 
> not start to cause head-blocking on IGP resources.

[WAJ] Seems you are not listening other person’s discussion. Please refer to 
Acee’s comments at 
https://mailarchive.ietf.org/arch/msg/lsr/zzaPetGzjHsu49KcyBrixQlo5G8/

> And AFAIR Robert observed it's still not a very good indication compared to 
> BFD, a good solution would be e.g. in PE case a (hierarchical) MP2MP BFD PMSI 
> (assuming UDP healthy = TCP healthy which is however far less an assumption 
> than "flooding feels transport is OK"). 

[WAJ] Current solutions are not aimed only the BGP overlay service, but also 
the tunnel services. For tunnel services how you build MP2MP PMSI?

> 
> -- tony 
> 
>> On Mon, Nov 22, 2021 at 7:55 AM Tony Li <tony...@tony.li> wrote:
>> 
>> Les,
>> 
>> 
>>> The problem is that restricting the prefix length does nothing to limit the 
>>> number of advertisements that get flooded.  In a high-scale situation, when 
>>> there is a mass failure, it would lead to a flooding spike. That’s exactly 
>>> not the time to stress the system.
>>>  
>>> [LES:] As I have stated previously, I share your concern about the behavior 
>>> during massive events – and some care has to be taken to prevent making a 
>>> bad situation worse.
>>> That said, the WG (including you and I)  is taking on enhancements to 
>>> support much faster flooding – on the order of hundreds (perhaps thousands) 
>>> of LSPs/second. We believe this can be done safely (though proof has not 
>>> yet been established).
>> 
>> 
>> And the point of doing that was to help improve IGP convergence time…
>> 
>> 
>>> So, if you believe (as your active participation suggests) that IGPs can 
>>> support faster flooding – why do you believe they cannot support liveness 
>>> notification at a similar scale?
>> 
>> 
>> … not waste our time by inflating the LSDB by the same amount that we sped 
>> up flooding.
>> 
>> Also, I don’t see how faster flooding has ANYTHING to do with it. Adding 
>> negative liveness information is primary a scale issue.
>> 
>>  
>>> I get that you consider such notifications as architecturally undesirable – 
>>> we can agree to disagree on that point.
>>> But I don’t get why you think the IGP’s ability to handle large scale 
>>> events is a showstopper in this case.
>> 
>> 
>> I am opposed to anything that adds to the scale of the LSDB. Doubly so if it 
>> does so during failures, when the IGP is already under stress. As you well 
>> know, making an IGP stable during normal operations is one thing. Ensuring 
>> that it is stable during worst case topological changes is quite another. 
>> Adding scale during a mass failure is pessimal timing.
>> 
>> T
>> 
>> 
> _______________________________________________
> Lsr mailing list
> Lsr@ietf.org
> https://www.ietf.org/mailman/listinfo/lsr
_______________________________________________
Lsr mailing list
Lsr@ietf.org
https://www.ietf.org/mailman/listinfo/lsr

Reply via email to