Re: Feed History -02

Mark Nottingham Wed, 10 Aug 2005 10:26:16 -0700


So, you're really looking for entry-level, time-based invalidation, no?

I guess the simplest way to do this would be to dereference the linkand see if you get a 404/410; if you do, you know it's no longer good.

That's not terribly efficient, but OTOH managing metadata in multipleplaces is tricky, and predicting the future doubly so :) Most peopleget expiration times really wrong. And clock sync becomes an issue aswell.

I'd think that if you have reasonable control over the polling of thefeed, and a solid enough state model (which might include an explicitdeletion mechanism), you could have a similar effect by just removingthe items from the feed when they expire, with the expectation thatwhen they disappear from the feed, they disappear from the client.Would that work for your use case?



On 09/08/2005, at 9:07 PM, James M Snell wrote:

First off, let me stress that I am NOT talking about cachingscenarios here... (my use of the terms "application layer" and"transport layer" were an unfortunate mistake on my part that onlyserved to confuse my point)
Let's get away from the multiprotocol question for a bit (it neverleads anywhere constructive anyway)... Let's consider an aggregatorscenario. Take an entry from a feed that is supposed to expireafter 10 days. The feed document is served up to the aggregatorwith the proper HTTP headers for expiration. The entry isextracted from the original feed and dumped into an aggregatedfeed. Suppose each of the entries in the aggregated feed aresupposed to have their own distinct expirations. How should theaggregator communicate the appropriate expirations to thesubscriber? Specifying expirations on the HTTP level does notallow me to specify expirations for individual entries within afeed. Use case: an online retailer wishes to produce a "specialoffers" feed. Each offer in the feed is a distinct entity withit's own terms and own expiration: e.g. some offers are valid fora week, other offers are valid for two weeks, etc. The expirationof the offer (a business level construct) is independent of whetheror not the feed is being cached or not (a protocol levelconstruct); publishing a new version of the feed (e.g. by adding anew offer to the feed) should have no impact on the expiration ofprior offers published to the feed.
Again, I am NOT attempting to reinvent an abstract or transport-neutral caching mechanism in the same sense that the atom:updatedelement is not attempting to reinvent Last-Modified or that the vialink relation is not attempting to reinvent the Via header, etc.They serve completely different purposes. The expires and max-ageextensions I am proposing should NOT be used for cache control ofthe Atom documents in which they appear.
>I think we can declare victory here by simply a) using whatevercaching mechanism is available, and b) designating a "won'tchange" flag.Speaking *strictly* about cache control of Atom documents, +1. Nodocument level mechanisms for cache control are necessary.
- James


Mark Nottingham wrote:
HTTP isn't a transport protocol, it's a transfer protocol; i.e.,the caching information (and other entity metadata) are *part of*the entity, not something that's conceptually separate.
The problem with having an "abstract" or "transport-neutral"concept of caching is that it leaves you with an awkward choice;you can either a) exactly replicate the HTTP caching model, whichis difficult to do in other protocols, b) "dumb down" HTTPcaching to a subset that's "neutral", or c) introduce acontradictory caching model and suffer the clashes between HTTPcaching and it.
This is the same road that Web services sometimes tries to godown, and it's a painful one; coming up with the grand, protocol-neutral abstraction that enables all of the protocol-specificfeatures is hard, and IMO not necessary. Ask yourself: are thereany situations where you *have* to be able to seamlessly switchbetween protocols, or is it just a "convenience?"
I think we can declare victory here by simply a) using whatevercaching mechanism is available, and b) designating a "won'tchange" flag.
On 09/08/2005, at 11:53 AM, James M Snell wrote:
Henry Story wrote:
Now I am wondering if the http mechanism is perhaps all that isneededfor what I want with the unchanging archives. If it is thenperhaps thiscould be explained in the Feed History RFC. Or are there otherreasons to
add and "expires" tag to the document itself?
On the application level, a feed or entry may expire or ageindepedently of whatever caching mechanisms may be applied atthe transport level. For example, imagine a source thatpublishes special offers in the form of Atom entries that expireat a given point in time. Now suppose that those entries arebeing distributed via XMPP and HTTP. It is helpful to have atransport independent expiration/max-age mechanism whosesemantics operate on the application layer rather than thetransport layer.
- James
--
Mark Nottingham   Principal Technologist
Office of the CTO   BEA Systems



--
Mark Nottingham   Principal Technologist
Office of the CTO   BEA Systems

Re: Feed History -02

Reply via email to