[DNSOP] state management related to TTL

Paul Vixie Tue, 14 Nov 2017 22:44:42 -0800

tonight's exchanges here related to "use-stale" seem discordant to me.i'd like to play the straight man for a moment and ask some indulgentperson to bring me up to speed by way of correcting my impressions.

the DNS TTL field is a state management variable. in this case the heldstate is in the form of cached RRsets, and the TTL associated with theRRset describes the period of time during which they can be reused. bythe original DNS specifications, after this reuse period, these RRsetsare to be discarded, and if the data is still needed, it is re-fetched.

in practice, TTL expiry is often not discovered until the records areabout to be reused; this avoids the cpu and memory bandwidth costs ofsweeping the cache periodically in search of expiration-ready RRsets,and avoids the additional state requirements of threading these RRsetsby TTL in addition to the standard cost of threading them by recency ofuse (to facilitate LRU based purge when the cache reaches its limit.)what this practice leads to is a "sudden concurrent need" for the RRsetat the precise moment when it is being discarded.

in order to avoid simultaneous "not having" and "great need", some RDNSservers do in fact sweep their caches or perhaps thread their RRsets byTTL expiration, in order to pre-launch a refreshment query when the TTLstill has some fraction (like 5%) or period (like one minute) remaining.this is non-ideal since we often find that we're refreshing data thatwill not be used soon or perhaps ever. work is underway by several teamsto find a "tuning set" of variables and thresholds which will betterpredict reuse in order to avoid refresh costs for non-reuse.

another method that's been deployed of avoiding simultaneous "don'thave" with "great need" is to liberally reinterpret TTL such that RRsetscan be reused beyond their explicit TTL lifetime, while their refreshqueries proceed in the background. commonly, the authority serversresponsible for answering these refresh events are down or unreachableat the time of most acute need. therefore the term "serve stale" toindicate a state management method whereby stale (beyond its TTL) datais served for some period of time, measured in minutes or hours, untilthe authority server can be reached to either refresh the RRsets orverify that they have in fact disappeared.

the danger of TTL stretching is that reuse beyond TTL may cause RRsetsthat are in fact supposed to be unreachable, to be effectivelyreachable. examples include security-related takedown of criminal DNSservers or networks, or failover strategies where end systems will nottry to reach their backup servers unless they cannot reach their primaryservers, and the unreachability of those primary servers is hidden fromthem by TTL stretching. fundamentally, an RRset and its TTL are theproperty of the zone administrator, and it's controversial for any otherparty to use this data beyond its specified use parameters.

all of this trouble comes from DNS's use of a single state variable(TTL) to represent usability lifetime, rather than two such variables,one indicating the periodicity of refresh, the other indicating theperiodicity of discard. many of us would like our data to be recheckedhourly by all caching servers who store it, but used for days or weeksif we become unreachable by some or all of those servers. using onevariable for two purposes represents an inconvenient compromise whichoften provides "no right answer" as to setting. therefore an idealizedsolution would be to provide a second variable, and where that secondvariable is present, the meaning of the existing variable (TTL) could besubtly altered to support a two-variable setting.

therefore a "serve stale" team within IETF-DNSOP was convened, to try tostandardize the methods and signal patterns necessary to extend theusability lifetime of records when their authority servers are notreachable at the time of normal TTL-based expiry. most of us recognizethat TTL's will continue to be stretched no matter what changes are orare not made to the specification, and so we expect the resulting RFC todocument current practice _without recommending it_ and to also documenta new practice _with recommendations_ as to its proper uses.

there are hangups in signaling options due to the sloppy specificationfor EDNS, about which the author of EDNS0 feels just awful, believe me.however, we are all relatively sure that EDNS can be used to encode adesire for new state management behaviour, within the limitation thatEDNS must first be signaled by the initiator before it can be answeredby a responder, and we might wish it otherwise. that's why it wasimportant to realize that if _any_ EDNS option is provided by aninitiator, then _any_ EDNS option can be provided by a responder. intheory this means we could provide state management options in aresponse without having heard any state management options in a request-- so long as some form of EDNS was in fact used in the request. it'snot yet clear that this evasive maneuver will be required, however.

the most straightforward signaling would be for an RD=0 initiator(normally a recursive DNS server) to ask some or all of its responders(normally authority servers) for permission to stretch the TTL. someresponders will not answer this signal at all, some will say no, andsome will say yes and give maximum tension values for the RRsetscontained in the answer and authority sections -- but not for theadditional section since that data might have a different authorityserver and may only be present as "glue". the new tension variable mightbe "maximum stretch interval" in which case the RRset's TTL _in thisanswer or authority section_ would be interpreted as a refresh interval.this system would allow gradual insertion of the new state managementlogic on an opportunistic basis -- motivated authority and recursiveserver operators, which would include CDN operators who must performboth services perfectly -- would be early adopters, and like ECS beforeit, the "hot" part of the community would be upgraded years earlier thanthe last outlier.

noone has proposed any new signaling between the stub and the recursive,but it's possible that a stub may want a true TTL and so we might addsignaling from the stub (as initiator) saying, don't stretch, or perhapssaying, if this is a stretched TTL, tell me so explicitly.

if this understanding isn't wrong or incomplete, then i fail to see whythere would be any drama that would prevent the construction of a draft.


--
P Vixie

_______________________________________________
DNSOP mailing list
DNSOP@ietf.org
https://www.ietf.org/mailman/listinfo/dnsop

[DNSOP] state management related to TTL

Reply via email to