Re: RtgDir review: draft-ietf-rtgwg-bgp-pic-00

Ahmed Bashandy (bashandy) Mon, 01 Aug 2016 17:09:54 -0700

Thanks a lot for the detailed review. I have uploaded version 02containing corrections for the typos indicated.


I am also CCing Jeff and Rob (the shepherd)


Ahmed

On 7/28/2016 1:34 AM, [email protected] wrote:

Re: RtgDir review: draft-ietf-rtgwg-bgp-pic-00

Hi Ahmed,

Many thanks for the updated version and your below answers.

Looks good to me.

Please find below 2 minor typos

Thanks.

Bruno

:s/unreacreachable/unreachable

:s/hierarchal/hierarchical
*From:*rtgwg [mailto:[email protected]] *On Behalf Of *AhmedBashandy (bashandy)
*Sent:* Tuesday, June 21, 2016 6:36 PM
*To:* [email protected]
*Cc:* [email protected]; [email protected]
*Subject:* Re: RtgDir review: draft-ietf-rtgwg-bgp-pic-00

Hi,

Thanks a lot for the detailed review.
I submitted version 01. I CCed Jeff and Rob Shakir, who volunteeredto be a shepherd (Thanks a lot):):)
I have restructured the document to address your comment. Besides Iwent over all the "Minor issues" as well as "nits" and corrected them,except 1-2 comments which I explained why I did not make themodifications.
Version 01 is modified and now it contains an "overview" section.Hence all sections have been shifted by one. So for example, thecomment about section 2.3.3 now applies to section 3.2
Please see my reply inline  starting with "#Ahmed"

Thanks

Ahmed
On 4/20/2016 6:53 AM, [email protected]<mailto:[email protected]> wrote:
    Hello,

    I have been selected as the Routing Directorate reviewer for this
    draft. The Routing Directorate seeks to review all routing or
    routing-related drafts as they pass through IETF last call and
    IESG review, and sometimes on special request. The purpose of the
    review is to provide assistance to the Routing ADs. For more
    information about the Routing Directorate, please see 
    http://trac.tools.ietf.org/area/rtg/trac/wiki/RtgDir

    Although these comments are primarily for the use of the Routing
    ADs, it would be helpful if you could consider them along with any
    other IETF Last Call comments that you receive, and strive to
    resolve them through discussion or by updating the draft.

    Document: draft-ietf-rtgwg-bgp-pic-00
    Reviewer: Bruno Decraene
    IETF LC End Date: “QA review” pre WG LC
    Intended Status: Informational

    *Summary:*

    I have some minor concerns about this document that I think should
    be resolved before publication.

    *Comments:*

    - Document is interesting. Document is relatively clear but
    sometime it feels like there is a little room for some
    reformulation/edition to improve fluidity. In particular, the
    learning curve is a bit steep at the beginning of the doc as most
    of the concepts are introduced in 3 pages (pages 4-6) in the form
    of a list of terminology and a pseudo code. I would find useful to
    have an overview section just after the introduction, with a high
    level view of the solution with a limited number of new terms.

    - The text feels like authoritative, while probably many terms are
    implementation specific. A priori, I would not expect all
    implementation of BGP PIC to use the same terms, and possibly not
    the same data structure. May be the text could be generalized to
    cover multiple implementations; or modified to describe a
    generalized concept (i.e. data-structure designed to share as much
    data as possible between elements, at the cost of additional
    indirections); or the document could state that it describes a
    specific implementation with implementations specific terminology,
    data structure, and specifics. Or a combination of both (e.g.
    adding a section being both generalized and describing the
    concept, and then the existing sections after stating that they
    are specific to one implementation).
#Ahmed: I have restructured the document to make much more general.Please take a look. All feedback is most welcomed.
*Minor Issues:*
- I find figure 2 very useful to understand the data-structure. Iwould move it sooner in the doc, somewhere before §2.2. (with itssubsequent text below) e.g. a new §2.2 "FIB data-structure"
It would need to be generalized i.e. example non-specific. I couldthink of:
IP Leaf: Pathlist:       IP Leaf:                Pathlist:

-------- ---------       -------                 --------
BGP NLRI ---> BGP NH1 ----> IGP IP1 (BGP NH1) ---> IGP NH1, I1 --->Adjacency1
BGP NHi --...                         IGP NHi, Ii  --..

|                                    |

|                                         |

|                                         |

v                                         v

OutLabel Array:                           OutLabel Array:

--------------                            --------------

          L (NLRI, NH1)                             L (IP1, NH1)

L (NLRI, NHi)                             L (IP1, NHi)

- Figure 1 could be enhanced with IGP-NH1, IGP-NH2, I1 and I2.

#Ahmed: Corrected
- Example 3 does not use the same naming convention than examples 1and 2, this make it harder to follow for a priori no reason. e.g. VPNlabels are named VPN-L11 in examples 1 and 2, but are namedVPN-PE21(P1) in exmaple 3; transport labels are named LDP-L12 inexmaples 1 and 2, but LASBR11(PE22) and L11 in figure 3.
#Ahmed: I renamed the VPN labels to conform to the convention used inExamples 1 and 2. Also All IGP labels were renamed to IGP-Lij to makethem applicable to both LDP and Segment routing.For ASBR labels, we need to differentiate between ASBR labels and IGPlabels. Hence the labels advertised by ASBRs are named differently.
- §2.3.3

"The local labels of the next hops".

 - All labels are locally assigned. So what do you mean by "local"
- "next-hop" sometimes refers to IGP/connected next-hop (a priori thecase here) and sometimes to BGP next-hop. I find it hard to follow. Irather use a different name (e.g; connected next-hop vs BGP next-hop)
#Ahmed: Corrected.

- §3
"the hashing at the BGP level yields path 0 while the hashing at theIGP level yields path 1. In that case, the packet will be sent out ofinterface I1 with the label stack "LDP-L12,VPN-L21".
Does not seem to match my understanding. For "LDP-L12,VPN-L21" I wouldassume BGP used path index 1 and IGP used path index 0.
#Ahmed: Corrected.

IMHO:
OLD: "Hence ASBR22 swaps "LASBR22(PE22)" with the LDP/SR label ofPE22, pushes the label of the next-hop towards PE22 in domain 2, andsends the packet along the shortest path towards PE22."
NEW: "Hence ASBR22 swaps "LASBR22(PE22)" with the LDP/SR label forPE22 advertised by the next-hop towards PE22 in domain 2, and sendsthe packet along the shortest path towards PE22."
(in all cases "swaps" then "pushes" would increase the label stack by1, which is not the case.)
#Ahmed: Corrected

§4.1

"the useable paths in the loadinfo"
loadinfo is a proprietary FIB datastructure which has not beenintroduced/defined. You need to either remove that term (if possible)or define it somewhere.
#Ahmed: Corrected. "loadinfo" is replaced with "pathlist", which isthe intended term
"Hence traffic restoration occurs within the time frame of IGPconvergence,"
agree.
..."and, for local link failure, within the timeframe of localdetection. Thus it is possible to achieve sub-50 msec convergence asdescribed in [10] for local link failure"
IMO, this is restricted to specific cases. e.g. external (eBGP) linkfailure, ECMP case, possibly IP FRR. So possibly
OLD: for local link failure, within the timeframe of local detection.Thus it is possible to achieve sub-50 msec convergence as described in[10] for local link failure
NEW: for local link failure, assuming a backup path has beenprecomputed, within the timeframe of local detection (e.g. 50ms).Example of solutions precomputing a backup path are IP FRR [LFA],[RLFA], [MRT], [TI-LFA] or eBGP path having a backup path [10].
#Ahmed: Corrected

§4
I would find useful to indicate, for each type of failure, the numberof data-structure that need to be updated.
#Ahmed: In general it is not always possible to know the number ofupdated data structures due to a topology change because it depends onthe topology itself. For example, in Section 5.1 (The section aboutBGP-PIC core), it is not possible to outline the datastructures thatare modified for a remote link failure. However for local linkfailure, we added a statement in second paragraph of Section 5.1indicating that only pathlists with paths using the local link need tobe updated. We also indicated in the 3rd paragraph that datastructuresare modified starting from the IGP leaves without walking back to BGPpathlists or data structures
---

§4.2.2

"To avoid loops, ePE2 MUST treat any core facing path as a backup

      path, otherwise ePE2 may redirect traffic arriving from the core

      back to ePE1 causing a loop."
Looks a bit under-described to me. Could you please elaborate a bit?In particular:
- if 2 PE (PE1, PE2) are connected in U to 2 P (P1, P2)(P1-PE1-PE2-P2), PE1 being nominal and PE2 only used in backup, in thenominal situation, if the core network sends the trafic to PE1 via PE2(used as a P/transit), how does PE2 know that it must send thistraffic to PE1? (rather than CE2)
- this behavior looks like an additional specific feature. How doewePE1 knows that ePE2 have this feature?
---
#Ahmed: The statement is removed. As mentioned in Section 6.1 how toavoid loops in case of CE node failure is a different topic and needsto be addressed separately
§4.3

"  Hence if the platform supports the "unflattened" forwarding chain,

   then a single pathlist needs to be updated while if the platform

   supports a shallower forwarding chain, then two pathlists need to be

   updated."
IINM "single" and "two" pathlist applies to the specific example. Inthis last sentence/summary, I'd prefer a more general statement. Apriori, without digging too much in this most complex use case, itseems like :s/single/o(1) :s/two/o(PE) . The former looks close(single vs o(1)) but IMHO there is a significant difference between 2and o(PE) (i.e. 100s)
#Ahmed: Agreed. The last paragraph is generalized and modified toindicate possible outcomes of flattenning forwarding chains
---

§5.1
Good paragraph. It's quite clear that the convergence time does notdepend on the number of BGP prefixes, which is good. For the benefitof the reader, it would be even more interesting if, for each type offailure, the text could indicate on what it depends. e.g. o(1),o(connected interfaces), o(PE), o(PEnominal*PEbackup)....
#Ahmed: Each subsection indicates the convergence time dependency. Forexample, in Section 6.1.1, the last paragraph says that the dependenceis on the IGP convergence time only
--

§7
"No additional security risk is introduced by using the mechanismsproposed in this document"
In general, with such a sentence, it's difficult to evaluate whetherthe authors have very quickly evaluated the risk or if this evaluationhas been performed in details. So in general, some more text detailingwhich aspects have been evaluated is interesting for the reader (yetpainful for the authors).
As the document describe an internal box behavior, this is difficultto evaluate and discuss. But from a bad experience, I fear that theremay be an impact. Indeed, with such structure, the FIBstructure/memory is typically different between BGP prefixes and IGPprefixes. In general, the implementation is designed to support the"right" numbers of both. But assuming an accident or an attack, thenumbers may not be "right". e.g. one upon a time, someone hasredistributed the BGP table into the IGP. In this case, the totalnumber of IP prefixes in the FIB is exactly the same. But as the datastructure used in the FIB was different between BGP and IGP prefixes,the FIB ran out of memory and the line card crashed (well actuallyonly the IP FIB, so IS-IS hello packet were still correctly sent andforwarded. As a result, traffic was permanently black holed)
#Ahmed: I added a statement indicating that the behavior is internalto the router.On another front, although the scenario that you mentioned is relatedto reliability rather than security, it indicates that BGP-PICactually improves the reliability of FIB because it greatly reducesthe use of hardware and software memory
---

§ 9

OLD: that allows achieving prefix independent convergence

NEW: that allows achieving BGP prefixes independent convergence

(it's still depend on the number of IGP prefixes and/or BGP pathlist)

#Ahmed: Corrected

*Nits:*

Abstract

"via more than one path."
In this 1rst sentence, it's not clear what path really means. (e.g cfthe terminology section where you have more than one). I guess thatyou mean "BGP path". (as there are also typically multiple IGP path toreach each BGP Next Hop)
#Ahmed: modified to next-hop to indicate BGP next-hops

"The objective is achieved through organizing the forwarding chains"
"chain" does not self self explicit to me. what about :s/chains/datastructure"
#Ahmed: Agreed, changed to "data structures"

"complete transparency"

what do you mean? transparency to what / from who?

#Ahmed: Removed the word "transparency

§1

OLD: to allow for more than one path for a given prefix

NEW: to allow for BGP to advertise more than one path for a given prefix

#Ahmed: Modified as suggested
OLD: Another more common and widely deployed scenario is L3VPN withmulti-homed VPN sites
NEW: Another more common and widely deployed scenario is L3VPN withmulti-homed VPN sites with unique Route Distinguisher.
#Ahmed: Modified as suggested

---

§1.2

"Pathlist: It is an array of paths"
"OutLabel-Array: The OutLabel-Array is a list of one or more outgoinglabels "
So a list is defined as an array and the array is defined as a list :-).

What about using the same term, e.g. a list?

#Ahmed: Corrected. Both are now pathlist and OutLabel-List:)

--

The OutLabel-Array is a list of one or more

      outgoing labels and/or label actions where each label or label

      action has 1-to-1 correspondence to a path in the pathlist. It

      is possible that the number of entries in the OutLabel-array is

      different from the number of paths in the pathlist and the ith

      Outlabel-Array entry is associated with the path whose path-

      index is "i".
- I don't see how one can have a 1-to-1 correspondance if the numberof elements is not the same.
#Ahmed Corrected.

- Last sentence could be splitted in 2.

--
Since the term ingres PE is defined, you could also detine the termegress PE. Possibly in the same sentence.
OLD: "Ingress PE, "iPE": It is a BGP speaker that learns about a

      prefix through another IBGP peer and chooses that IBGP peer as

      the next-hop for the prefix

NEW:      "Ingress PE, "iPE": It is a BGP speaker that learns about a
prefix through a IBGP peer and chooses an egress PE as thenext-hop for the prefix.
#Ahmed: Modified as suggested
As a side note, the previous definition assume that there were noRoute Relfector (the iBGP peer is the BGP Next Hop)
#Ahmed: Correct. I did not want to unnecessarily complicate thediscussion with information
--

§2.3
Figure 1 represents a VPN network with 3 PE and a CE. In this context,"VPN-P1" sounds a bit like a P router. What about :s/VPN-P1/VPN-IP1 ?Same comment for IGP-P1.
#Ahmed: Agreed. All modified as suggested

--

§2.3.2

OLD: ePE2 constructs the forwarding chain depicted in Figure 1

NEW: ePE2 constructs the forwarding chain depicted in Figure 3

#Ahmed: Corrected

OLD: VPL-L11

NEW: VPN-L11

#Ahmed: Corrected

§2.3.3

OLD: can reach ASBR1

NEW: can reach ASBR11

#Ahmed: Corrected

OLD: The label for advertised by ASBR11 to iPE

NEW: The label advertised by ASBR11 to iPE

OLD: The labels for advertised by ASBR12 to iPE

NEW: The labels advertised by ASBR12 to iPE

#Ahmed: Corrected

OLD: The labels for advertised to iPE by ASBR11 using BGP-LU

NEW: The labels advertised  by ASBR11 to iPE using BGP-LU

#Ahmed: Corrected

---

§3
OLD: Let's applying the above forwarding steps to the exampledescribed in Figure 1 Section 2.3.1.
OLD: Let's applying the above forwarding steps to the exampledescribed in Figure 2 Section 2.3.1.
#Ahmed: Corrected
(somewhat guesssing. But in all cases, there is no figure 1 in section2.3.1)
---

§4.1

IMO
OLD: As soon as the IGP convergence is effective for the BGP nhopentry, the new forwarding state is immediately available to alldependent BGP prefixes.
NEW: As soon as the IGP convergence is effective for a BGP next-hopentry, the new forwarding state is immediately available to alldependent BGP prefixes.
more generally

:s/nhop/next-hop

#Ahmed: Replaced nhop with next-hop in the entire document

---

§4.3

:s/PE222/PE22

#Ahmed: Corrected

Best regards,

Bruno

**

_________________________________________________________________________________________________________________________
Ce message et ses pieces jointes peuvent contenir des informations confidentielles ou privilegiees et ne doivent donc
pas etre diffuses, exploites ou copies sans autorisation. Si vous avez recu ce 
message par erreur, veuillez le signaler
a l'expediteur et le detruire ainsi que les pieces jointes. Les messages 
electroniques etant susceptibles d'alteration,
Orange decline toute responsabilite si ce message a ete altere, deforme ou 
falsifie. Merci.
This message and its attachments may contain confidential or privileged information that may be protected by law;
they should not be distributed, used or copied without authorisation.
If you have received this email in error, please notify the sender and delete 
this message and its attachments.
As emails may be altered, Orange is not liable for messages that have been 
modified, changed or falsified.
Thank you.

_________________________________________________________________________________________________________________________

Ce message et ses pieces jointes peuvent contenir des informations 
confidentielles ou privilegiees et ne doivent donc
pas etre diffuses, exploites ou copies sans autorisation. Si vous avez recu ce 
message par erreur, veuillez le signaler
a l'expediteur et le detruire ainsi que les pieces jointes. Les messages 
electroniques etant susceptibles d'alteration,
Orange decline toute responsabilite si ce message a ete altere, deforme ou 
falsifie. Merci.

This message and its attachments may contain confidential or privileged 
information that may be protected by law;
they should not be distributed, used or copied without authorisation.
If you have received this email in error, please notify the sender and delete 
this message and its attachments.
As emails may be altered, Orange is not liable for messages that have been 
modified, changed or falsified.
Thank you.

_______________________________________________
rtgwg mailing list
[email protected]
https://www.ietf.org/mailman/listinfo/rtgwg

Re: RtgDir review: draft-ietf-rtgwg-bgp-pic-00

Reply via email to