Re: [Lsr] Thoughts about PUAs - are we not over-engineering?

Peter Psenak Thu, 16 Jun 2022 03:04:21 -0700

Hi Gunter,

please see inline (##PP):


On 16/06/2022 10:09, Van De Velde, Gunter (Nokia - BE/Antwerp) wrote:

Hi Gyan, Daniel, Peter, All,

Thanks for sharing your insights and I agree mostly with your feedback
I agree and understand that summarization is needed to reduce the sizeof the LSDB. I also agree summarization good design practice, especiallywith IPv6 and SRv6 in mind. There never has been doubt about that.
I am not sure I agree that UAP/UPA is ‘optimal-design’. Maybe it is thebest we can do, however I have a healthy worry we could be sufferingtunnel vision and that proposed solution may not be good enough.
We should not be blind and believe that advertising UPA/PUA does notcome without a cost. The architectural PUA/UPA usage complexity cost maynot be worth the effort (none of the integration of using a PUA/UPAevent triggers come for free). Do we really believe that PUA/UPA solveall the SID reachability problems for all IGP network design and SRuse-cases elegantly? Maybe some use-case design constraints andassumptions should be documented to clarify architecturally wherePUA/UPA is most beneficial for operators? Just stating “outside scope ofthe draft” seems unfair to operators interested in PUA/UPAs


##PP

we are trying to solve a particular problem of remote PE going down innetwork where summarization is used. I believe that is stated clearly inthe UPA draft.

Let me give two examples where PUA/UPA benefit is unclear:

(1) Multiple-ABRs
I was wondering for example if a ingress router receives a PUA signalingthat a given locator becomes unreachable, does that actually reallysignals that the SID ‘really’ is unreachable for a router?
For example (simple design to illustrate the corner-case):

ingressPE#1---area#1---ABR#1---area---ABR#2---area#3---egressPE#2

      |                                                      |

      |                                                      |

      +--------area#1---ABR#3---area---ABR#4---area#3--------+

What if ABR#4 would loose connectivity to egressPE#2 and ABR#2 does not?
In that case ABR#4 will originate a UPA/PUA and ABR#2 does not originatea PUA/UPA.
How is ingressPE#1 supposed to handle this situation? The only thingingressPE#1 see is that suddenly there is a PUA/UPA but reachability maynot have changed at all and remains perfectly reacheable.


##PP
we are not trying to solve the area partitioning problem with UPA.

Clearly, if you summarize on both ABRs and your area partitions, youconnectivity is broken, as you have no control on which ABR the trafficwill use to enter the partitioned area. If you hit the one that has noconnectivity to the egress PE, your traffic will be dropped.

With UPA, at least the service traffic can be switched to an alternateegress PE, if there is one, preserving the connectivity for the serviceprefixes.

(2) with sr-policy or SRv6 SRTE
What if we have an inter-area/domain/level SRTE or sr-policy andsuddenly there is a PUA/UPA for one of the SIDs in the sid-list of the path.
will this impact the srte or sr-policy in any way? Will transit routersdo anything with the UPA/PUA and drop packets. Will transit routerstrigger fast-restoration?


##PP

we are not specifying any of that. If the implementation decide to useUPA on transit routers for some application, we do not prohibit it.

Can PCEs/controllers use the SID for crafting paths? Will allSRTE/sr-policy using the locator be pruned or re-signaled?
Will ingress router do something with the PUA information? ShouldPUA/UPA draft give guidelines around this?


##PP

UPA draft only describes the ISIS asignalling part, not the externalapplication handling of the UPA. That would not be appropriate in IGP draft.


thanks,
Peter


Be well,

G/

If there is an SRTE or sr-policy using a given SID somewhere in the SIDlist… and suddenly


*From:*Gyan Mishra <hayabusa...@gmail.com>
*Sent:* Thursday, June 16, 2022 6:12 AM
*To:* Voyer, Daniel <daniel.voyer=40bell...@dmarc.ietf.org>

*Cc:* Van De Velde, Gunter (Nokia - BE/Antwerp)<gunter.van_de_ve...@nokia.com>;draft-ppsenak-lsr-igp-ureach-prefix-annou...@ietf.org;draft-wang-lsr-prefix-unreachable-annoucement<draft-wang-lsr-prefix-unreachable-annoucem...@ietf.org>; lsr@ietf.org

*Subject:* Re: [Lsr] Thoughts about PUAs - are we not over-engineering?

Summarization has always been a best practice for network scalabilitythereby reducing the size of the RIB and LSDB.

So in this case as Dan pointed out, the summary route is an abstractionof the area and so if a component prefix of the summary becameunreachable we need a way to signal that the PE next hop is no longerreachable to help optimize convergence.

We are just trying to make summarization work better then it does todayso we don’t have to rely on domain wide flooding of host routes.


Thanks

Gyan

On Wed, Jun 15, 2022 at 4:42 PM Voyer, Daniel<daniel.voyer=40bell...@dmarc.ietf.org<mailto:40bell...@dmarc.ietf.org>> wrote:


    Hi Gunter,

    Thanks for your comments,

    The idea, here, with summarization is to "reduce" the LSDB quite a
    lots and make a given backbone much more scalable / flexible and
    allow to simplify NNI's within that given backbones considerably.
    Summarization is "needed" for better scale and, in the context of
    IPv6, will help in preventing blowing up the IGP.  With the size of
    an IPv6 prefix range (ex. /64) allocated per domain - summarization
    will help to contain the LSDB to that domain.

    What we are "highlighting" in
    draft-ppsenak-lsr-igp-ureach-prefix-announce-00, is an easy way to
    overcome the fact that PEs are hidden behind a summary route and
    need a fast way to notify other PEs when they become unreachable.

    I don't see "over-engineering" here, I see "optimal-engineering"
    instead.

    Thanks
    Dan

    On 2022-06-14, 4:59 AM, "Van De Velde, Gunter (Nokia - BE/Antwerp)"
    <gunter.van_de_ve...@nokia.com
    <mailto:gunter.van_de_ve...@nokia.com>> wrote:

         Hi All,

         When reading both proposals about PUA's:
         * draft-ppsenak-lsr-igp-ureach-prefix-announce-00
         * draft-wang-lsr-prefix-unreachable-annoucement-09

         The identified problem space seems a correct observation, and
    indeed summaries hide remote area network instabilities. It is one
    of the perceived benefits of using summaries. The place in the
    network where this hiding takes the most impact upon convergence is
    at service nodes (PE's for L3/L2/transport) where due to the
    summarization its difficult to detect that the transport tunnel
    end-point suddenly becomes unreachable. My concern however is if it
    really is a problem that is worthy for LSR WG to solve.

         To me the "draft
    draft-wang-lsr-prefix-unreachable-annoucement-09" is not a preferred
    solution due to the expectation that all nodes in an area must be
    upgraded to support the IGP capability. From this operational
    perspective the draft
    "draft-ppsenak-lsr-igp-ureach-prefix-announce-00" is more elegant,
    as only the A(S)BR's and particular PEs must be upgraded to support
    PUA's. I do have concerns about the number of PUA advertisements in
    hierarchically summarized networks (/24 (site) -> /20 (region) ->
    /16 (core)). More specific, in the /16 backbone area, how many of
    these PUAs will be floating around creating LSP LSDB update churns?
    How to control the potentially exponential number of observed PUAs
    from floating everywhere? (will this lead to OSPF type NSSA areas
    where areas will be purged from these PUAs for scaling stability?)

         Long story short, should we not take a step back and re-think
    this identified problem space? Is the proposed solution space not
    more evil as the problem space? We do summarization because it
    brings stability and reduce the number of link state updates within
    an area. And now with PUA we re-introduce additional link state
    updates (PUAs), we blow up the LSDB with information opaque to SPF
    best-path calculation. In addition there is suggestion of new
    state-machinery to track the igp reachability of 'protected'
    prefixes and there is maybe desire to contain or filter updates
    cross inter-area boundaries. And finally, how will we represent and
    track PUA in the RTM?

         What is wrong with simply not doing summaries and forget about
    these PUAs to pinch holes in the summary prefixes? this worked very
    well during last two decennia. Are we not over-engineering with PUAs?

         G/

------------------------------------------------------------------------------

         External Email: Please use caution when opening links and
    attachments / Courriel externe: Soyez prudent avec les liens et
    documents joints


    _______________________________________________
    Lsr mailing list
    Lsr@ietf.org <mailto:Lsr@ietf.org>
    https://www.ietf.org/mailman/listinfo/lsr
    <https://www.ietf.org/mailman/listinfo/lsr>

--

<http://www.verizon.com/>

*Gyan Mishra*

/Network Solutions Architect /

/Email gyan.s.mis...@verizon.com <mailto:gyan.s.mis...@verizon.com>/

/M 301 502-1347/


_______________________________________________
Lsr mailing list
Lsr@ietf.org
https://www.ietf.org/mailman/listinfo/lsr

Re: [Lsr] Thoughts about PUAs - are we not over-engineering?

Reply via email to