In support of what Tony has said, I think any comparison between what RIFT is 
doing and what is proposed in this draft is inappropriate.

RIFT is able to determine what destinations exist in the network but are not 
reachable via a certain subset of the topology – and then generate negative 
advertisements appropriately. There is also full determinism in knowing when 
the negative advertisement should be removed.

draft-wang-lsr-prefix-unreachable-announcement by contrast tries to provide an 
advertisement for a destination that no longer exists. This leads to the lack 
of determinism which necessitates arbitrary timers and creates problems for 
nodes who connect to the network after the disappearance of the destination.

Not comparable at all IMO…

   Les

From: Lsr <lsr-boun...@ietf.org> On Behalf Of Tony Przygienda
Sent: Friday, September 04, 2020 11:12 AM
To: Aijun Wang <wangai...@tsinghua.org.cn>
Cc: Gyan Mishra <hayabusa...@gmail.com>; Robert Raszuk <rob...@raszuk.net>; 
Huzhibo <huzh...@huawei.com>; Aijun Wang <wang...@chinatelecom.cn>; Peter 
Psenak <ppsenak=40cisco....@dmarc.ietf.org>; lsr <lsr@ietf.org>; Acee Lindem 
(acee) <a...@cisco.com>; Xiaoyaqun <xiaoya...@huawei.com>
Subject: Re: [Lsr] New Version Notification for 
draft-wang-lsr-prefix-unreachable-annoucement-03.txt

I read the draft since the longish thread triggered my interest. As Peter said 
very thin ice walking with magic soft-state-timers for (to me) entirely unclear 
benefit and lots of interesting questions completely omitted like e.g. what 
will happen if a mix of old and new routers are in the network.

RIFT works completely differently BTW (and I don't think we _also_ noticed the 
problem, AFAIK RIFT is the first protocol that defined the concept of at least 
negative disaggregation to deal with black-hole avoidance in presence of 
summaries). RIFT defines precisely how negative disaggregation state is 
transitively propagated (if necessary) and next-hop resolved via recursive 
inheritance to provide black-hole and loop free routing in case of links 
failures on IP fabrics. No soft-timers or undescribed magic there. However, to 
do what RIFT is doing a sense of direction on the graph is needed (this is 
relatively easy on semi-lattice RIFT supports but would precondition uniform 
topological sorting on generic graphs, probably ending up in RPL type of 
solutions [which still need a direction indicator on arc to work and would take 
out a lot of links out of the topology possibly {but Pascal is better to judge 
that}]).

-- tony

On Mon, Aug 24, 2020 at 11:11 AM Aijun Wang 
<wangai...@tsinghua.org.cn<mailto:wangai...@tsinghua.org.cn>> wrote:
Hi, Gyan:

Sorry for replying you so late.
You are right about the summary address behavior, but the summary address is 
configured by manually, and if one of the specific prefix within this summary 
range is down, the black hole(route to this specific prefix) will be formed.  
PUA mechanism just want to amend this.
Glad to know Rift has also noticed such issues.  In OSPF/ISIS, such problem 
needs also be solved.

If you are interested this topic, welcome to join us to the solution.


Best Regards

Aijun Wang
China Telecom

From: lsr-boun...@ietf.org<mailto:lsr-boun...@ietf.org> 
[mailto:lsr-boun...@ietf.org<mailto:lsr-boun...@ietf.org>] On Behalf Of Gyan 
Mishra
Sent: Thursday, August 6, 2020 4:44 PM
To: Aijun Wang <wangai...@tsinghua.org.cn<mailto:wangai...@tsinghua.org.cn>>
Cc: Peter Psenak 
<ppsenak=40cisco....@dmarc.ietf.org<mailto:40cisco....@dmarc.ietf.org>>; Robert 
Raszuk <rob...@raszuk.net<mailto:rob...@raszuk.net>>; Huzhibo 
<huzh...@huawei.com<mailto:huzh...@huawei.com>>; Aijun Wang 
<wang...@chinatelecom.cn<mailto:wang...@chinatelecom.cn>>; lsr 
<lsr@ietf.org<mailto:lsr@ietf.org>>; Acee Lindem (acee) 
<a...@cisco.com<mailto:a...@cisco.com>>; Xiaoyaqun 
<xiaoya...@huawei.com<mailto:xiaoya...@huawei.com>>
Subject: Re: [Lsr] New Version Notification for 
draft-wang-lsr-prefix-unreachable-annoucement-03.txt

Hi Aijun and authors

I am catching up with this thread after reading through this draft.

I understand the concept that we are trying to send a PUA advertisement which 
sounds similar to Rift Negative Disaggregation prefix now with a  null setting 
when a longer match component prefix that is part of a summary range is down to 
detect prefix down conditions with longer match component prefixes that are 
part of a summary.

Basically how summarization works with both OSPF and ISIS is that at minimum a 
single longer match within the defined IA summary range must exist for the IA 
summary to be sent.  So the summary is sent conditionally similar to a BGP 
conditional advertisement or even a ospf default originate which requires a 
default in the RIB to exist before the advertisement is sent.  A good example 
of non conditional analogy with BGP if you have a null0 static for a summary 
for an exact match BGP advertisement the prefix is always advertised no matter 
what even if no component prefixes exist.  This can result in black hole 
routing. BGP has conditional feature to conditionally advertisement based on 
existence of a route advertisement of downstream neighbor in the BGP RIB.  So 
with ospf and Isis the summary is in fact conditional similar to a BGP 
conditional advertisement and in the example given in the draft in section 3.1 
when node T2 is down and pt2 becomes unreachable and let’s say that prefix is 
1.1.1.1/32<http://1.1.1.1/32> and the summary is 1.1.1.0/30<http://1.1.1.0/30> 
and no other component prefix exists within the summary range the prefix will 
not get adversed.  So there will not be any black hole.

The summary represents all prefixes within the range that would be suppressed 
with the summary when advertised into the backbone area.  However only at a 
minimum one prefix must exist in the range for the summary to be generated.  
That is done by design as the summary represents all prefixes within the range. 
 So let’s say there are a 100 prefixes and let’s say a few devices are down and 
so now only 5 prefixes exist within the range.  By design it is OK for router 
to generate the summary for the 5 prefixes it is representing and that will not 
cause any routing loop or black hole.


I am trying to understand wage gap exists and what we are trying to solve 
related to summarization in the context of IPv6.  Both IPv4 and IPV6 
summarization operates the similarly as far as the requirement of at minimum a 
single component route within the summary range must exist  as a condition to 
be present in the RIB before the summary can be advertised.

Kind Regards

Gyan

On Tue, Aug 4, 2020 at 11:25 PM Aijun Wang 
<wangai...@tsinghua.org.cn<mailto:wangai...@tsinghua.org.cn>> wrote:
Hi, Robert:

From: lsr-boun...@ietf.org<mailto:lsr-boun...@ietf.org> 
[mailto:lsr-boun...@ietf.org<mailto:lsr-boun...@ietf.org>] On Behalf Of Robert 
Raszuk
Sent: Friday, July 31, 2020 6:21 PM
To: Aijun Wang <wang...@chinatelecom.cn<mailto:wang...@chinatelecom.cn>>
Cc: Peter Psenak 
<ppsenak=40cisco....@dmarc.ietf.org<mailto:40cisco....@dmarc.ietf.org>>; 
Huzhibo <huzh...@huawei.com<mailto:huzh...@huawei.com>>; Aijun Wang 
<wangai...@tsinghua.org.cn<mailto:wangai...@tsinghua.org.cn>>; lsr 
<lsr@ietf.org<mailto:lsr@ietf.org>>; Acee Lindem (acee) 
<a...@cisco.com<mailto:a...@cisco.com>>; Xiaoyaqun 
<xiaoya...@huawei.com<mailto:xiaoya...@huawei.com>>
Subject: Re: [Lsr] New Version Notification for 
draft-wang-lsr-prefix-unreachable-annoucement-03.txt

[WAJ] Such information is for underlay link state and should be flooded via 
IGP? The ambiguity arises from IGP summary behavior and should be solved by 
itself?

Well if we look at this problem from a distance while on surface it seems like 
an IGP issue (not to mention some which use BGP as IGP :) IMO it is only 
hurting when you have some service overlay on top utilizing the IGP.
[WAJ] There are situations that the PUA mechanism apply when no service overlay 
running over IGP.  Scenarios described in 
draft-wang-lsr-prefix-unreachable-annoucement-03#section-3<https://datatracker.ietf.org/doc/html/draft-wang-lsr-prefix-unreachable-annoucement-03#section-3>
 are not tightly coupled with the overlay service.

So typically today if I am running any service with BGP I do count on BGP to 
remove routes which are no longer reachable. IGP just tells me how to get to 
the next hop, which direction to go and not if the endpoint (service CPE or PE 
connected to given CE) is up or down.

So today smart BGP implementations in good network design can use RD based 
withdraws to very fast (milliseconds) remove the affected service routes. When 
I said should we do it in BGP I meant to ask WG if this is good enough to 
quickly remove service routes. If not maybe we should send such affected next 
hop in BGP to even faster invalidate all routes with such next hop as failing 
PE.

Bottom line if you think the problem is IGP then I think Acee's comments apply.
[WAJ] Which comment is not addressed yet?

Last - See today you are bringing the case to signal transition to DOWN ... but 
for some people and applications it may be not enough. In fact UP/DOWN they can 
get via BGP. But if you have two ABRs and one will due to topology changes in 
its area suddenly will be forced to reach atomic destination covered by summary 
over much higher metric path that for applications running above may be much 
more severe case and not acceptable one too.
[WAJ] Or else, the application traffic will be broken.

And BGP will not remove service routes nor modify best path in any way as 
summary is masking the real metric to some next hops. So while in the network 
you may have alternate better native transit paths with a lot of free capacity 
if you only switch to a different bgp next hop (not talking about any TE at 
all) you are stuck offering much worse service to your customers.
[WAJ] if there are other links to reach the affected prefix via the ABR, then 
this ABR will not send the PUA information.

Those cases are starting to be solved by performance routing both at the 
service itself or at BGP nh levels. Should IGP assist here ... I am not sure.
[WAJ] when node become down, it can only depend on other nodes within the same 
IGP to send such unreachability information. IGP can certainly help here ☺


Many thx,
R.
_______________________________________________
Lsr mailing list
Lsr@ietf.org<mailto:Lsr@ietf.org>
https://www.ietf.org/mailman/listinfo/lsr
--

[Image removed by sender.]<http://www.verizon.com/>

Gyan Mishra

Network Solutions Architect

M 301 502-1347
13101 Columbia Pike
Silver Spring, MD

_______________________________________________
Lsr mailing list
Lsr@ietf.org<mailto:Lsr@ietf.org>
https://www.ietf.org/mailman/listinfo/lsr
_______________________________________________
Lsr mailing list
Lsr@ietf.org
https://www.ietf.org/mailman/listinfo/lsr

Reply via email to