Hi Dave, hi Jonathan,

I like it! Outside of my area of expertise, sure, but still short, sweet and 
focussed. As I commented on github, it might halp if you could make a point how 
this could actually help core-routers to soften their load, as it is those 
devices that need to learn to use SCE for the proposed endp-point methods to 
get anywhere in real life.

Best Regards
        Sebastian


> On Mar 10, 2019, at 19:04, Dave Taht <dave.t...@gmail.com> wrote:
> 
> I would love to have some fresh eyeballs on a new IETF draft for the
> TSVWG we intend to submit tonight.
> 
> I've attached the html for easy to read purposes, but I would prefer
> that folk referred back to the github repository for the most current
> version, which is here:
> 
> https://github.com/dtaht/bufferbloat-rfcs/blob/master/sce/draft-morton-taht-SCE.txt
> 
> and in open source tradition, discuss here, or file bugs, and submit
> pull requests to the gitub.
> 
> The first draft (of 3 or more pending), is creating the SCE codepoint
> and defining the state machine, is pretty short, and we think the
> basic concept solves a zillion problems with ECN in one stroke. It's
> easy to implement (one line of code in codel), backward compatible
> with all the RFCs, and somewhat incompatible with the stalled out TCP
> Prague/dualpi effort in the IETF.
> 
> We have several other drafts in progress which I increasingly doubt
> we'll finish today, but I think only this one is required to get an
> audience in the tsvwg at the coming IETF meeting.
> 
> If ya have any comments and spare time today, I'd like to get the
> first draft in tonight, and the filing deadline for final drafts is
> sometime tomorrow. It may help for context to review some of the other
> work in the github repo.
> 
> THX!
> 
> -- 
> 
> Dave Täht
> CTO, TekLibre, LLC
> http://www.teklibre.com
> Tel: 1-831-205-9740
Title: The Some Congestion Experienced ECN Codepoint
Transport Working Group J. Morton
Internet-Draft Bufferbloat.net
Updates: 3168, 8311 (if approved) D. Täht
Intended status: Standards Track TekLibre
Expires: September 11, 2019 March 10, 2019

The Some Congestion Experienced ECN Codepoint
draft-morton-taht-SCE-00

Abstract

This memo reclassifies ECT(1) to be an early notification of congestion on ECT(0) marked packets, which can be used by AQM algorithms and transports as an earlier signal of congestion than CE. It is a simple, transparent, and backward compatible upgrade to existing IETF-approved AQMs, [RFC3138], and nearly all congestion control algorithms.

Status of This Memo

This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79.

Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet-Drafts is at https://datatracker.ietf.org/drafts/current/.

Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress."

This Internet-Draft will expire on September 11, 2019.

Copyright Notice

Copyright (c) 2019 IETF Trust and the persons identified as the document authors. All rights reserved.

This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Simplified BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Simplified BSD License.


Table of Contents

1. Terminology

The keywords MUST, MUST NOT, REQUIRED, SHALL, SHALL NOT, SHOULD, SHOULD NOT, RECOMMENDED, MAY, and OPTIONAL, when they appear in this document, are to be interpreted as described in [RFC2119].

2. Introduction

This memo reclassifies ECT(1) to be an early notification of congestion on ECT(0) marked packets, which can be used by AQM algorithms and transports as an earlier signal of congestion than CE ("Congestion Experienced").

This memo limits its scope to the redefinition of the ECT(1) codepoint as SCE, "Some Congestion Experienced", with few brief illustrations of how it may be used.

3. Background

[RFC3168] defines the lower two bits of the (former) TOS byte in the IPv4/6 header as the ECN field. This may take four values: Not-ECT, ECT(0), ECT(1) or CE.

Binary Keyword References

------------------------------------------------------------

 00     Not-ECT (Not ECN-Capable Transport)     [RFC 3168]
 01     ECT(1) (ECN-Capable Transport(1))       [RFC 3168]
 10     ECT(0) (ECN-Capable Transport(0))       [RFC 3168]
 11     CE (Congestion Experienced)             [RFC 3168]

Research has shown that the ECT(1) codepoint goes essentially unused, with the "Nonce Sum" extension to ECN having not been implemented in practice and subsequently obsoleted by some RFCBLA. Additionally, known [RFC3168]-compliant senders do not emit ECT(1), and compliant middleboxes do not alter the field to ECT(1), while compliant receivers all interpret ECT(1) identically to ECT(0). These are useful properties which represent an opportunity for improvement.

Experience gained with 7 years of [RFC8290] deployment in the field suggests that it remains difficult to maintain the desired 100% link utilisation, whilst simultaneously strictly minimising induced delay due to excess queue depth - irrespective of whether ECN is in use. This leads to a reluctance amongst hardware vendors to implement the most effective AQM schemes because their headline benchmarks are throughput-based.

The underlying cause is the very sharp "multiplicative decrease" reaction required of transport protocols to congestion signalling (whether that be packet loss or CE marks), which tends to leave the congestion window significantly smaller than the ideal BDP when triggered at only slightly above the ideal value. The availability of this sharp response is required to assure network stability (AIMD principle), but there is presently no standardised and backwards-compatible means of providing a less drastic signal.

4. Some Congestion Experienced

As consensus has arisen that some form of ECN signaling should be an earlier signal than drop, this Internet Draft changes the meaning of ECT(1) to be SCE, meaning "Some Congestion Experienced". The above ECN-field codepoint table then becomes:

Binary Keyword References

------------------------------------------------------------

 00     Not-ECT (Not ECN-Capable Transport)     [@RFC3168]
 01     SCE (Some Congestion Experienced)       [This Internet-draft]
 10     ECT (ECN-Capable Transport)             [@RFC3168]
 11     CE (Congestion Experienced)             [@RFC3168]

This permits middleboxes implementing AQM to signal incipient congestion, below the threshold required to justify setting CE, by converting some proportion of ECT codepoints to SCE ("SCE marking"). Existing [RFC3168] compliant receivers MUST transparently ignore this new signal, and existing middleboxes MUST still be able to convert SCE to CE as they presently do for ECT, thus ensuring backwards compatibility.

Additionally, SCE-aware middleboxes MUST still produce CE markings as at present, retaining current behavior with [RFC3168] ECN endpoints.

Permitted ECN codepoint packet transitions by middleboxes are:

   	Not-ECT ->   Not-ECT or DROP
   	ECT     ->   ECT or SCE or CE or DROP
   	SCE     ->   SCE or CE or DROP
   	CE      ->   CE or DROP

In other words, for ECN-aware flows, the ECN marking of an individual packet MAY be increased by a middlebox to signal congestion, but MUST NOT be decreased, and packets MUST NOT be altered to appear to be ECN-aware if they were not originally, nor vice versa. Note however that SCE is numerically less than ECT, but semantically greater, and the latter definition applies for this rule.

New SCE-aware receivers and transport protocols must continue to apply the [RFC3168] interpretation of the CE codepoint, that is, to signal the sender to back off send rate to the same extent as if a packet loss were detected. This maintains compatibility with existing middleboxes, senders and receivers.

New SCE-aware receivers and transport protocols should interpret the SCE codepoint as an indication of mild congestion, with the relative incidence of ECT and SCE codepoints received indicating the relative severity of such congestion, and respond accordingly by applying send rates intermediate between those resulting from a continuous sequence of ECT codepoints, and those resulting from a CE codepoint.

Details of how to implement SCE awareness at the transport layer will be left to additional Internet Drafts yet to be submitted.

To maximise the benefit of SCE, middleboxes MUST be capable of producing SCE markings earlier than they presently produce CE markings.

5. Examples of use

5.1. Cubic

Consider a TCP transport implementing CUBIC congestion control. This presently exhibits exponential cwnd growth during slow-start, polynomial cwnd growth in steady-state, and multiplicative decrease upon detecting a single CE marking or packet loss in one RTT cycle.

With SCE awareness, it might exit slow-start upon detecting a single SCE marking, switch from polynomial to Reno-linear cwnd growth when the SCE:ECT ratio exceeds 1:2, halt cwnd growth entirely when it exceeds 1:1, and implement a Reno-linear decline when it exceeds 2:1, in addition to retaining the sharp 40% decrease on detecting CE. In ideal circumstances, this would result in the cwnd stabilising at a level which produces between 50% and 66% SCE marking at some bottleneck on the path.

5.2. TCP receiver side handling

SCE can potentially be handled entirely by the receiver and be entirely independent of any of the dozens of [RFC3168] compliant congestion control algorithms.

A SCE TCP aware receiver MAY (based on a setsockopt option) choose to interpret SCE markings as CE and send ECE notifications back to the sender based on the ratio of markings, and/or variations in TCP timestamps.

A TCP receiver desiring low latency SHOULD respond with ECE signals earlier, one desiring higher bandwidth, later.

Alternatively, a SCE aware reciever can attempt to infer what congestion control is being used on the sender side of the connection.

5.3. Other

New transports under development such as QUIC SHOULD implement a multi-bit and finer grained signal back to the sender based on SCE.

6. Related Work

[RFC8087] [RFC7567] [RFC7928] [RFC8290] [RFC8289] [RFC8033] [RFC8034]

7. IANA Considerations

There are no IANA considerations.

8. Security Considerations

There are no security considerations.

9. Acknowledgements

Much thanks to the members of the "cake" bufferbloat.net mailing list, the ietf AQM mailing list, and tsvwg.

10. References

10.1. Normative References

[RFC2119] Bradner, S., "Key words for use in RFCs to Indicate Requirement Levels", BCP 14, RFC 2119, DOI 10.17487/RFC2119, March 1997.
[RFC3138] Meyer, D., "Extended Assignments in 233/8", RFC 3138, DOI 10.17487/RFC3138, June 2001.

10.2. Informative References

[RFC3168] Ramakrishnan, K., Floyd, S. and D. Black, "The Addition of Explicit Congestion Notification (ECN) to IP", RFC 3168, DOI 10.17487/RFC3168, September 2001.
[RFC7567] Baker, F. and G. Fairhurst, "IETF Recommendations Regarding Active Queue Management", BCP 197, RFC 7567, DOI 10.17487/RFC7567, July 2015.
[RFC7928] Kuhn, N., Natarajan, P., Khademi, N. and D. Ros, "Characterization Guidelines for Active Queue Management (AQM)", RFC 7928, DOI 10.17487/RFC7928, July 2016.
[RFC8033] Pan, R., Natarajan, P., Baker, F. and G. White, "Proportional Integral Controller Enhanced (PIE): A Lightweight Control Scheme to Address the Bufferbloat Problem", RFC 8033, DOI 10.17487/RFC8033, February 2017.
[RFC8034] White, G. and R. Pan, "Active Queue Management (AQM) Based on Proportional Integral Controller Enhanced PIE) for Data-Over-Cable Service Interface Specifications (DOCSIS) Cable Modems", RFC 8034, DOI 10.17487/RFC8034, February 2017.
[RFC8087] Fairhurst, G. and M. Welzl, "The Benefits of Using Explicit Congestion Notification (ECN)", RFC 8087, DOI 10.17487/RFC8087, March 2017.
[RFC8289] Nichols, K., Jacobson, V., McGregor, A. and J. Iyengar, "Controlled Delay Active Queue Management", RFC 8289, DOI 10.17487/RFC8289, January 2018.
[RFC8290] Hoeiland-Joergensen, T., McKenney, P., Taht, D., Gettys, J. and E. Dumazet, "The Flow Queue CoDel Packet Scheduler and Active Queue Management Algorithm", RFC 8290, DOI 10.17487/RFC8290, January 2018.

Authors' Addresses

Jonathon Morton Bufferbloat.net Kökkönranta 21 PITKÄJÄRVI, 31520 FINLAND Phone: +358 44 927 2377 EMail: chromati...@gmail.com
David M. Täht TekLibre 20600 Aldercroft Heights Rd Los Gatos, Ca 95033 USA Phone: +18312059740 EMail: d...@taht.net
> _______________________________________________
> Cake mailing list
> Cake@lists.bufferbloat.net
> https://lists.bufferbloat.net/listinfo/cake

_______________________________________________
Cake mailing list
Cake@lists.bufferbloat.net
https://lists.bufferbloat.net/listinfo/cake

Reply via email to