Re: [lisp] Benjamin Kaduk's Discuss on draft-ietf-lisp-rfc6830bis-20: (with DISCUSS and COMMENT)

Fabio Maino Fri, 28 Sep 2018 15:46:15 -0700

On 9/28/18 3:41 PM, Joel M. Halpern wrote:

Thank you Benjamin.  This response helps me understand the situation.
I have sent a note to the WG about making LISP-SEC MTI. That kind ofchange needs WG support.

I second that. The email was indeed very helpful, and I think we can useit (together with Eric's notes) as a guide to move forward.


Thanks,
Fabio

Yours,
Joel

On 9/28/18 6:03 PM, Benjamin Kaduk wrote:
Hi Joel,


On Wed, Sep 26, 2018 at 11:53:02PM -0400, Joel M. Halpern wrote:
Is there text we can add about the scoping that will change yourdiscuss
into a series of useful comments?
I had attempted to structure my Discuss points so that they wouldeither beuseful comments as is, or rendered moot by a reduced scope. I guessI cantry to clarify those below. (To be clear, reducing the scope is onlygoing
to move from "has potentially existentially bad problems" to "has
substantial issues that likely require reengineering to resolve".)
If so, Some indication of how you would like that phrased would help us
address these.
I think Ekr's ballot position on 6833bis has a good summary of the
architecture assumptions that the reduced scope allows us to make.
In order to have the document be able to plausibly make those claims, it
looks like we'd need to at least:
(1) update the Abstract/Introduction to clarify that the EIDnamespace is
     only defined within a single administrative domain.
(2) (optionally, if it makes sense) mention in the introduction thatthis administrative domain can include transport over other networksin the same way that a VPN would function[*], without requiringcooperation
     from or interaction with the other networks' administrators
(3) remove the "global" text from the EID-to-RLOC Database and Map-Cache
     definitions
(4) update the EID-Prefix definition to talk about the local site or
     administrative domain's "address allocation authority"
(5) Take a look at the EID definition to consider whether referencesto "on
     the public Internet" are still valid, and the text about assignment
in a hierarchical manner should be revised for the new scope aswell. Likewise for EID-internal structure that is "not visible to theglobal
     routing system"

(I stopped skimming and looking for problematic text around section 6)
[*] Ideally this would be done without using the term "VPN" itself,since
I'd like to get a movement going to restrict "VPN" to include
confidentiality (i.e., privacy) protection. "virtual network" or"overlay
network" may or may not be good candidate replacement terms.
If not, we seem to have a larger problem.
Well, we appear to have five ADs that are supporting making LISP-SEC a
normative reference and thus MTI; I don't know if that scale of change
meets your threshold for a "larger problem".
Yours,
Joel

On 9/26/18 11:44 PM, Benjamin Kaduk wrote:
Benjamin Kaduk has entered the following ballot position for
draft-ietf-lisp-rfc6830bis-20: Discuss

When responding, please keep the subject line intact and reply to all
email addresses included in the To and CC lines. (Feel free to cutthis
introductory paragraph, however.)
Please refer tohttps://www.ietf.org/iesg/statement/discuss-criteria.html
for more information about IESG DISCUSS and COMMENT positions.


The document, along with other ballot positions, can be found here:
https://datatracker.ietf.org/doc/draft-ietf-lisp-rfc6830bis/



----------------------------------------------------------------------
DISCUSS:
----------------------------------------------------------------------

I have grave concerns about the suitability of LISP as a whole, in its
present form, for advancement to the Standards-Track. While someof my
concerns are not specific to this document, as the core protocol
(data-plane) spec, it seems an appropriate place to attach them to.
I am told, out of band, that the intended deployment model is nolonger to
cover the entire Internet (c.f. the MISSREF-state
draft-ietf-lisp-introduction's "with LISP, the dge of the Internetand the
core can be logically separated and interconnected by LISP-capable
routers", etc.), and that full Internet-scale operation is no longer a
goal. However, since that does not seem to be reflected in thecurrent
batch of documents up for IESG review, I am forced to ballot on them
"as-is", namely as targetting global Internet deployment. Therequirements
placed on the mapping system are so stringent so as to be arguably
unachievable at Internet-scale, though that arguably has more of an
interaction with the control-plane than the data-plane. It's still in
scope here, though, as part of the overall description of the protocol
flow.
(rendered moot by scope reduction)
There are an almost innumerable number of downgrade attackspossible, and
the control-plane and data-plane security mechanisms are not normative
dependencies of the current corpus of documents, and as such arenot up forconsideration as mitigating the security concerns with the coredocuments.
The downgrade attacks will probably require some further analysis;LISP-SECwould protect a lot of the header bits but I think there may be someother
data flows to be looked at.
Section 3 defines the EID-to-RLOC Datbaase:

     EID-to-RLOC Database:   The EID-to-RLOC Database is a global
distributed database that contains all knownEID-Prefix-to-RLOC mappings. Each potential ETR typically contains a smallpiece of
        the database: the EID-to-RLOC mappings for the EID-Prefixes
        "behind" the router.  These map to one of the router's own
globally visible IP addresses. Note that there MAY betransient conditions when the EID-Prefix for the site and Locator-Setfor
        each EID-Prefix may not be the same on all ETRs. This has no
        negative implications, since a partial set of Locators can be
        used.
No compelling architecture for a trustworthy global distributeddatabasehas been presented that I've seen so far, and LISP relies heavilyon themapping system's database for its functionality. I am concernedthat somany requirements are placed on the mapping system so as to be ineffectunimplementable, in which case it would seem that the architectureas awhole (that is, for a global Internet-scale system) is not fit forpurpose.
(rendered moot by scope reduction)
Section 4.1's Step (6) only mentions parsing "to check for format
validity".  I think it is appropriate to mention (and refer to) source
authentication checks as well, since bad Map-Reply data can allowall sorts
of attacks to occur.
(not affected by scope reduction)
There are some fairly subtle ordering requirements between theorder ofentries in Map-Reply messages and the Locator-Status-Bits indata-planetraffic (so that the semantic meaning of the status bits aremeaningful),which is only given a minimal treatment in the control-planedocument. Theneed for synchronization in interpreting these bits should bementioned
more prominently in the data-plane document as well.
(not affected by scope reduction)
The usage of the Instance ID does not seem to be adequatelycovered; from
what I've been able to pick up so far it seems that both source and
destination participants must agree on the meaning of an InstanceID, andthe source and destination EIDs must be in the same Instance. Thisdoesnot seem like it is compatible with Internet scale, especially ifthere are
only 24 usable bits of Instance ID.
(not affected by scope reduction)
There seems to be a lot of intra-site synchronization requirements,notably
with respect to Map-Version consistency, the contents and ordering of
locator sets for EIDs in the site, etc.; the actual hardrequirements forsynchronization within a site should be clearly called out, ideallyin a
single location.
(not affected by scope reduction, since ETRs are affected and not just
Map-Servers)
The security considerations attempt to defer substantially to the
threat-analysis in RFC 7835, which does not really seem like acompletethreat analysis and does not provide analysis as to whatrequirements areplaced on the boundaries between the different components of LISP(data
plane, control plane, mapping system, various extensions, etc.).  The
secdir reviewer had some good thoughts in this space.
(not affected by scope reduction)
The security considerations throughout the LISP documents place aheavyfocus on the risk of over-claiming for routing EID-prefixes. Thisis a
real concern, to be clear, but it should not overshadow the risk of an
attacker who is able to move traffic around at will, strip security
protections, cause denial of service, alter data-plane payloads, etc.
Similarly, this document's security considerations call out denial of
service as a risk from Map-Cache insertion/spoofing, but the risksfrom anattacker being able to read and modify the traffic, perhaps evenwithout
detection, seems a much greater threat to me.
(not affected by scope reduction)
I am not convinced that this protocol meets the current IETFrequirementsfor the security properties of Standards-Track Protocols without atleastLISP-SEC as a mandatory-to-implement component, and possiblyadditional orstronger requirements. (I did not do a full analysis of the systemin the
presence of those security mechanisms, since that is not what is being
presented for review.)
(noting that LISP-SEC needs to be MTI and analysis performed underthe new
assumptions)
Having an EID that is associated to user-correlatable devices hassevereprivacy considerations, but I could not find this mentionedanywhere in all
of the LISP documents I've read so far.
(not affected by scope reduction)

-Benjamin
----------------------------------------------------------------------
COMMENT:
----------------------------------------------------------------------
I apologize for the somewhat scattered nature of these comments;there area lot of them and I was focusing my time more on trying tounderstand thebroader system, and the intended security posture, so they did notget asmuch clean-up as I would have liked. (Most of my review wasperformed on the
-18, though I have tried to update to the -20 as relevant.)
The instance ID provides for organizational correlation, anotherprivacy
exposure.
Is there anything different between an "EID-to-RLOC Map-Request"and just a
"Map-Request"?  (Same question for "Map-Reply", too.)

There's a lot of stuff that seems to work best if there is symmetric
bidirectional traffic, with inline signalling of map version and
reachability changes, though clearly everything is designed to alsoworkwith asymmetric connectivity or unidirectional traffic. It wouldbe niceto have a high-level summary in or near the introduction about whatkinds
of behavior/performance differences are expected for bidirectional vs.
unidirectional traffic.

Section 2
That's not the 8174 boilerplate; it's more than just adding a citeto the
2119 boilerplate.

Section 3
nit: "An address family that pertains to the Data-Plane." is asentence
fragment.
Ingress Tunnel Router (ITR): An ITR is a router that residesin a
        [...]
mapping lookup in the destination address field. Note thatthis
        destination RLOC MAY be an intermediate, proxy device that has
        better knowledge of the EID-to-RLOC mapping closer to the
This doesn't seem like a 2119 MAY is necessary, but rather astatement of
fact that may not be known to the encapsulating ITR.
Specifically, when a service provider prepends a LISPheader for Traffic Engineering purposes, the router that does this isalso regarded as an ITR. The outer RLOC the ISP ITR uses can bebased on the outer destination address (the originating ITR'ssupplied
        RLOC) or the inner destination address (the originating host's
        supplied EID).

I'm confused here, perhaps in multiple ways.  Are there now *two* LISP
headers on the packet? Is the "outer RLOC the ISP ITR uses" thesource
RLOC or the destination RLOC?
Negative Mapping Entry: A negative mapping entry, also knownas a negative cache entry, is an EID-to-RLOC entry where anEID-Prefix is advertised or stored with no RLOCs. That is, theLocator-Set for the EID-to-RLOC entry is empty or has an encodedLocator count
        of 0.

Is "empty" a distinct representation from "locator count of zero"?

Perhaps something of an aside, but the check described for
Route-Returnability is a somewhat weak check, and in some casescould still
be spoofed.  (I don't expect this to surprise anyone, of course, but
perhaps some more qualifiers could be added to the text.)

Section 4

     An additional LISP header MAY be prepended to packets by a TE-ITR
     when re-routing of the path for a packet is desired.  A potential
     use-case for this would be an ISP router that needs to perform
Traffic Engineering for packets flowing through its network. In such a situation, termed "Recursive Tunneling", an ISP transit actsas an
     additional ITR, and the RLOC it uses for the new prepended header
would be either a TE-ETR within the ISP (along an intra-ISPtraffic engineered path) or a TE-ETR within another ISP (an inter-ISPtraffic
     engineered path, where an agreement to build such a path exists).

"the RLOC it uses for the new prepnded header", again, this is as the
destination RLOC (vs. source RLOC)?

Section 4.1

     o  Map-Replies are sent on the underlying routing system topology
        using the [I-D.ietf-lisp-rfc6833bis] Control-Plane protocol.
Just to check my understanding: is the "underlying routing systemtopology"
the same as the "underlay"?
Is step (3) just describing more of what step (2) says is "notdescribed in
this example"?

Section 5.3

The word "nonce" is normally used for something used exactly once.
E.g., with some AEAD algorithms, if the same "nonce" input is used for
different encryptions, the entire security of the system iscompromised.
It would be better to refer to this field with a different term, given
that "the same nonce can be used for a period of time whenencapsulating tothe same ETR". "Uniquifier" or "random value" might be reasonablechoices.
Why is there no discussion of the Map-Version or Instance-ID fields
in this section?

When doing ETR/PETR decapsulation:
o The inner-header 'Time to Live' field (or 'Hop Limit'field, in the case of IPv6) SHOULD be copied from the outer-header'Time to Live' field, when the Time to Live value of the outerheader is less than the Time to Live value of the inner header. Failing to perform this check can cause the Time to Live of the innerheader
        to increment across encapsulation/decapsulation cycles.  This
check is also performed when doing initial encapsulation,when a
        packet comes to an ITR or PITR destined for a LISP site.
Er, what is "this check" that is also performed for initialencapsulation?
How are there multiple TTL values to compare?
o The inner-header 'Differentiated Services Code Point'(DSCP) field
        (or the 'Traffic Class' field, in the case of IPv6) SHOULD be
copied from the outer-header DSCP field ('Traffic Class'field, in
        the case of IPv6) to the inner-header.

nit: the first "inner-header" seems like an editing remnant?

Section 7.1
How is this stateless if it invovles knowledge about the routersbetweenthe ITR and all possible ETRs (i.e., a set that could change overtime)?
Section 8

This 32-bit vs 24-bit thing is pretty hokey for a standards-track
specification (yes, I know that LISP-DDT is not standards track at the
moment).

Section 9
Alternatively, RLOC information MAY be gleaned from receivedtunneled
What is this an alternative to?  The list of four options above?
packets or EID-to-RLOC Map-Request messages. A "gleaned"Map-Cache entry, one learned from the source RLOC of a receivedencapsulated
     packet, is only stored and used for a few seconds, pending
verification. Verification is performed by sending aMap-Request to the source EID (the inner-header IP source address) of thereceived
     encapsulated packet.
The source EID is some random end system, right? So this relys onsomemagic in the ETR to detect that there's a Map-Request and replydirectlyinstead of passing it on to the EID that won't know what to do withit?
Talking about the "R-bit" of the Map-Reply" is detail from 6833bis and
might benefit from an explicit section reference to the otherdocument.
Section 10

What is the "CE" of "CE-based ITRs"?  Presumably Customer Edge, but it
is not marked as well-known at
https://www.rfc-editor.org/materials/abbrev.expansion.txt soexpansion is
probably in order.
Again, when we are talking about the internal structure of theMap-Reply, a
detailed section refernce to 6833bis is useful.
Modifying LSBs seems like a fine DoS attack vector for an on-pathattacker.
value of 1. Locator-Status-Bits are associated with aLocator-Set per EID-Prefix. Therefore, when a Locator becomesunreachable, the Locator-Status-Bit that corresponds to that Locator's positionin the
     list returned by the last Map-Reply will be set to zero for that
     particular EID-Prefix

Doesn't this imply a stateful relationship between the ordering of
Map-Replys and data-plane traffic?

Section 10.1
Note that "ITR" and "ETR" are relative terms here. Bothdevices MUST
     be implementing both ITR and ETR functionality for the echo nonce
     mechanism to operate.
Perhaps they could be given actual names so as to disambiguatewhich steps
are performed with ITR vs. ETR role?
The echo-nonce algorithm is bilateral. That is, if one sidesets the E-bit and the other side is not enabled for echo-noncing, thenthe
     echoing of the nonce does not occur and the requesting side may
erroneously consider the Locator unreachable. An ITR SHOULDonly set
     the E-bit in an encapsulated data packet when it knows the ETR is
enabled for echo-noncing. This is conveyed by the E-bit inthe RLOC-
     probe Map-Reply message.
Why is this even optional? If it was mandatory to use, then therewouldnot be a question. But at least clarify that the "this" that isconveyedis whether the peer supports the echo-nonce algorithm. (Also,subject to
downgrade.)

Section 13
When a Locator record is removed from a Locator-Set, ITRs thathave the mapping cached will not use the removed Locator becausethe xTRs will set the Locator-Status-Bit to 0. So, even if the Locatoris in the list, it will not be used. For new mapping requests, thexTRs can set the Locator AFI to 0 (indicating an unspecifiedaddress), as
     well as setting the corresponding Locator-Status-Bit to 0.  This
     forces ITRs with old or new mappings to avoid using the removed
     Locator.
The behavior describe here seems like it would be better describedas "whena Locator is taken out of service" than "removed from aLocator-Set", sinceif it is not in the set at all, it has no index, and no LSB or AFIto set.
Should actually depopulating it like this be forbidden?
I guess the Map Versioning is supposed to help with this, but weneed to
nail down the semantics more and/or give a clearer reference to it.

Section 13.1
An ITR, when it encapsulates packets to ETRs, can convey itsown Map-
     Version Number.  This is known as the Source Map-Version Number.
Replacing "its own Map-Version Number" with something like "theMap-Versionnumer for the LISP site of which it is a part". Writing thiscauses me tonote that the semantics of the Map-Version are unclear, here --what is it
scoped to?  An EID-Prefix?  An RLOC?  Oh, you say that in the next
paragraph (EID-Prefix).

     A Map-Version Number can be included in Map-Register messages as
well. This is a good way for the Map-Server to assure thatall ETRs for a site registering to it will be synchronized according toMap-
     Version Number.
Huh? I must be confused how this works. (Also, wouldn't this bebetter in
the control plane document which covers Map-Register?)

Section 15
o When a tunnel-encapsulated packet is received by an ETR,the outer destination address may not be the address of the router. This makes it challenging for the control plane to get packetsfrom the hardware. This may be mitigated by creating specialForwarding Information Base (FIB) entries for the EID-Prefixes of EIDsserved
        by the ETR (those for which the router provides an RLOC
translation). These FIB entries are marked with a flagindicating
        that Control-Plane processing SHOULD be performed.
I assume this is just my lack of background showing, but I'mconfused how
it makes sense to mark these for control-plane processing. Isn't the
control plane much slower, and we're not putting all of the LISPdata-plane
traffic onto the slow path?

Section 18
o Data-Plane gleaning for creating map-cache entries has beenmade optional. If any ITR implementations depend or assume theremote
        ETR is gleaning should not do so.
nit: this is ungrammatical; "they should not" or "Any ITRimplementations
that depend on or assume that" would fix it.

Section 19.1

Presumably IANA also updated the reference column to point to this
document?
_______________________________________________
lisp mailing list
[email protected]
https://www.ietf.org/mailman/listinfo/lisp



_______________________________________________
lisp mailing list
[email protected]
https://www.ietf.org/mailman/listinfo/lisp

Re: [lisp] Benjamin Kaduk's Discuss on draft-ietf-lisp-rfc6830bis-20: (with DISCUSS and COMMENT)

Reply via email to