Hi Yaakov, Thanks for the clarifications, that helps a lot to position correctly your draft. As You have wrote: "Determinism is not the main goal of this draft" what was not clear from the text.
I may be somewhat mis-leaded by the name You have used for the concept. TSN is widely used in IEEE as "Time-Sensitive Networking" and its main target is deterministic communication. You may consider to use a different abbreviation to avoid that confusion. E.g., "srtsc" (segment routing time sensitive communication) or anything else You like. Many thanks Bala'zs From: Yaakov Stein <yaako...@rad.com> Sent: Monday, March 8, 2021 8:27 AM To: Balázs Varga A <balazs.a.va...@ericsson.com>; det...@ietf.org; spring@ietf.org; p...@ietf.org Subject: RE: new draft on segment routing approach to TSN Bala'zs, Sorry that I never got back to you on this. It is much easier to shoot back replies to shorter emails :) > 1, What is the node behavior if deadtime expires? Nothing special is done if a local deadline is exceeded. There is still hope that the time can be made up. If too many packets fail to conform to their overall budget, then maybe something has changed and reoptimization (or realizing that the application budget is not feasible) is required. > 2, Do I understand it correctly that the described solution is practically a > new per-hop-behavior? Well, new to here but hardly "new" (similar things were done in ATM...) The new per-router behavior requires a new data structure in the router instead of a FIFO queue, but there is no absolute requirement for other tools. As I just hinted and have mentioned in a previous email, later on we may need some OAM to see if configuration needs to be updated. I have already experimented with periodic RSVP-like synthetic packets that collect timestamps along the path. > 3, Design of a guaranteed upper bound usually requires deterministic arrival > at each node along the path. Here is where we differ (or at least put different emphasis on the word "usually". Determinism is not the main goal of this draft. Meeting challenging delay budgets is. Sometimes absolute determinacy is required (e.g., in some forms of "teleprotection" in some electricity distribution networks) in which case either TDM/Qbv/calendaring can be used, or this method coupled with a differential delay compensation buffer. I agree that PDV increases from hop to hop to an extent, but when the delay budget is really challenging you simply can't allow that many hops. In mixed 5G xHaul cases in which I have been involved there are no more than 4 hops (usually fewer). If you wish something more deterministic then look at the Andrews-Zhang strategy. They gate at the ingress router and EDF at all the following ones. They show that (statistically) they ride "green waves" and thus the PDV doesn't increase significantly (see their graphs). This comes at the expense of a long cycle time for the initial gate, which means the budget can't be too tight. > 4, Wire-speed timestamping and calculation I come from the PTP world where hardware timestamping with submicro resolution is common. Since the timestamp is on the rising edge of the first bit, i.e., before the classifier, it is carried out and placed in the packet metadata for every packet whether or not it is PTP. I always shed tears over throwing out this information. The RSVP-like method just mentioned simply copies it into the packet. The SRTSN packets use this information in the scheduler. I have consulted my ASIC people about the use of priority heaps and insert-queues and they ensure me that they can put something like that into our 100G chip. > 5, Time gated Queuing [802.1Qbv] I didn't say that in principle you can't build a Qbv system that does great things. It is flexible enough to do anything you want. The problem is the scalability. I have played around with optimizing it and haven't found a good "on line" algorithm, that is, a way of adding a new flow to an existing optimized design. And even in the "off line" case (clean slate with all flows known) slight changes and even bad clocks in the user equipment ruins things. But the main idea of this draft is that even given a good on-line algorithm for Qbv you would need to distribute the changes in the all the schedules to all the routers. What happens in the meantime? What happens when one router is already updated but another isn't? With the stack method only a single ingress router needs to be updated and consistency is ensured! > 6, Deadtime calculation is also complex if impact of other DetNet flows must > be considered. Of course! The offset vector needs to be optimized. I have several such methods and am running simulations of various strategies. As stated in the ID the specific calculation was for illustration purposes - the stack mechanism is much more generic. > 7, Impact of some DetNet functions I have not yet gotten to the point where I have considered this. > 8, The ever-green topic ( and not the green-wave :--))) ): label stack size. Yes, I didn't want to bring this up yet, but have been forced to. I have to admit that I am not a fan of SRv6 for just this reason. But, the SRTSN stack can be really small! SR stack entries need only be ceil ( log2 (number of routers) ) in size (requires a little local information base in the router) assuming once the stack is popped you use the original DA. If you are not willing to use this additional information you still only need sufficient suffixes, which is still small. The deadline entries can be miniscule since the wraparound constraint is derived from the maximum flight time of a packet in the network. In my designs I have gotten away with 16-32 bits per entry, which means 4-8 hop stacks only require the room of a single IPv6 address. Hope this answers your questions and kicks off a fruitful discussion. Y(J)S From: Balázs Varga A <balazs.a.va...@ericsson.com<mailto:balazs.a.va...@ericsson.com>> Sent: 08/03/2021 08:35 To: Yaakov Stein <yaako...@rad.com<mailto:yaako...@rad.com>>; det...@ietf.org<mailto:det...@ietf.org>; spring@ietf.org<mailto:spring@ietf.org>; p...@ietf.org<mailto:p...@ietf.org> Subject: RE: new draft on segment routing approach to TSN CAUTION: External sender. Do not click links or open attachments unless you know the content is safe. Hi Yaakov, Any view on these comments? Thanks Bala'zs From: detnet <detnet-boun...@ietf.org<mailto:detnet-boun...@ietf.org>> On Behalf Of Balázs Varga A Sent: Friday, February 26, 2021 3:03 PM To: Yaakov Stein <yaako...@rad.com<mailto:yaako...@rad.com>>; det...@ietf.org<mailto:det...@ietf.org>; spring@ietf.org<mailto:spring@ietf.org>; p...@ietf.org<mailto:p...@ietf.org> Subject: Re: [Detnet] new draft on segment routing approach to TSN Hi Yaakov, many thanks for this well-written and information-rich draft. I have enjoyed to read it (and also the discussions on the lists). :--))) Conceptual (clarification) questions: 1, What is the node behavior if deadtime expires? Drop? That requires a quite good design/allocation of the latency budget. Otherwise it may result in unnecessary loss of a packet being relative late at a hop, however that might be compensated by the remaining hops. (E.g., see your Figure 2, getting 35 microsecond at R1 instead of the designed 25, but R2,R3,R4 adds only 10-10-10 extra microseconds). You may consider to add a "late flag" to the packet instead of a drop ... 2, Do I understand it correctly that the described solution is practically a new per-hop-behavior? Does it have no tools to "influence" the arrival time at the next hops? In end-to-end latency design the DetNet/TSN functions are often used to "adapt" the arrival of packets belonging to a DetNet flow/TSN stream at given network points. If "latency design" requires such a "time shift" then the described method has to be combined with other DetNet/TSN tools. 3, Design of a guaranteed upper bound usually requires deterministic arrival at each node along the path. In this solution the flow becomes less-an-less deterministic as it reaches the destination. I mean e.g., as per Figure 2, arrival time range at R1 is 2-2 usec, at R2 is 26-51 usec, at R3 is 70-120 usec and at R4 is 92-167 usec. So, the flow is getting less-and-less deterministic at each hop. That can make deterministic design of other flows passing R4 more difficult or even impossible in some scenarios. Have I missed something here? 4, Wire-speed timestamping and calculation: I would be interested in your view regarding how realistic are (1) the wire speed timestamping capability and (2) adding multiple calculated deadline to packets. Both are required for this solution. DetNet flows can have high bandwidth, not like 1588 packets. Specific comments (Section 2-3): 5, Time gated Queuing [802.1Qbv] allows multiple gates to be open simultaneously. Furthermore preemption [802.1Qbu] can be combined with Qbv. So I cannot fully agree with your evaluation of time-aware scheduling. (Section 2, "However, time-aware scheduling suffers from two major disadvantages.") Of course with a bad design one may shoot in his/her own leg, but in well-designed scenarios efficiency loss can be eliminated without expensive computation. 6, Deadtime calculation is also complex if impact of other DetNet flows must be considered. The pre-calculation of individual "local" deadlines may need to be re-calculated when flows added to the network and they are using some common links. So I cannot agree with your statement. (Section 3, "... Since the ingress router inserts the deadline stack into the packet headers, no other router needs to be aware of the requirements of the time sensitive flows. Hence admitting a new flow only requires updating the information base of the ingress router."). Adding a flow may result in updating many ingress routers' configuration. Some topics for further discussions/considerations: 7, Impact of some DetNet functions (e.g., PREOF [RFC8655]) needs further discussions. The usage of replication/elimination impacts the deadline calculation. Disjoint paths used by member flows requires different label stack and deadlines. Furthermore, these path specific labels+deadlines must be added by the replication node (PRF). So at least, a combination of PRF with B-SID and computing/adding deadlines (ingress SRTSN function) seems to be necessary in your solution. 8, The ever-green topic ( and not the green-wave :--))) ): label stack size. You need multiple labels per hop and the label stack contains them for each hop. Could result in a quite big label stack to be pushed at the ingress (nx10s of labels). I am happy to have further discussions on this interesting idea Thanks & Cheers Bala'zs From: detnet <detnet-boun...@ietf.org<mailto:detnet-boun...@ietf.org>> On Behalf Of Yaakov Stein Sent: Tuesday, February 23, 2021 2:14 PM To: det...@ietf.org<mailto:det...@ietf.org>; spring@ietf.org<mailto:spring@ietf.org>; p...@ietf.org<mailto:p...@ietf.org> Subject: [Detnet] new draft on segment routing approach to TSN All, I would like to call your attention to a new ID https://www.ietf.org/archive/id/draft-stein-srtsn-00.txt<https://protect2.fireeye.com/v1/url?k=c3902149-9c0b184c-c39061d2-86d2114eab2f-ba16ddb9b86334c8&q=1&e=25a393fd-8aca-4a3b-914c-6ce612f6740b&u=https%3A%2F%2Feur01.safelinks.protection.outlook.com%2F%3Furl%3Dhttps%253A%252F%252Fwww.ietf.org%252Farchive%252Fid%252Fdraft-stein-srtsn-00.txt%26data%3D04%257C01%257Cyaakov_s%2540rad.com%257C270d392bafcf4742c92d08d8e1fc5c56%257Cf9047108cc2c4e4897a343fad1b3bf9d%257C1%257C0%257C637507821310056080%257CUnknown%257CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%253D%257C1000%26sdata%3Do332wbi4u5fb%252BKtUW1CsB3jz9OEd32h%252BazNaEDnDHJY%253D%26reserved%3D0> which describes using a stack-based approach (similar to segment routing) to time sensitive networking. It furthermore proposes combining segment routing with this approach to TSN resulting in a unified approach to forwarding and scheduling. The draft is information at this point, since it discusses the concepts and does not yet pin down the precise formats. Apologies for simultaneously sending to 3 lists, but I am not sure which WG is the most appropriate for discussions of this topic. * DetNet is most relevant since the whole point is to control end-to-end latency of a time-sensitive flow. * Spring is also directly relevant due to the use of a stack in the header and the combined approach just mentioned. * PCE is relevant to the case of a central server jointly computing an optimal path and local deadline stack. I'll let the chairs decide where discussions should be held. Y(J)S
_______________________________________________ spring mailing list spring@ietf.org https://www.ietf.org/mailman/listinfo/spring