Re: I-D ACTION:draft-nottingham-atompub-feed-history-00.txt

James M Snell Thu, 30 Jun 2005 13:48:43 -0700


Mark Nottingham wrote:

Hi James,

On 29/06/2005, at 10:09 AM, James M Snell wrote:
1. This appears to be addressed at solving the same problem as BobWyman's RFC3229+feed proposal [http://bobwyman.pubsub.com/main/2004/09/using_rfc3229_w.html]. Do you have any empiracle datasimilar to what Bob provides @ http://bobwyman.pubsub.com/main/2004/10/massive_bandwid.html that would indicate that your approachis a better solution to this problem? These are actually notmutually exclusive solutions, they're just different and could beused for different scenarios -- e.g. Bob's tends to make a lot ofsense for blog dashboard feeds like what we use within IBM to showall post and commenting activity within our internal blogs serverwhile your mechanism would work rather well for things like Top Tenlists, etc. I would just like to see a bit of a compare/contrast onthe two approaches.
It's orthoganal to RFC3229. The problem I'm solving is how toreconstruct the *entire* state of the logical feed, not just onepartial representation of it; although RFC3229 could be used to dothat, it would require feed authors to post the entire content oftheir feed (potentially, many megabytes). This would incur a hugeload, because any clients that don't support RFC3229 would have toGET the entire feed, leading to severe bandwidth problems.
To give a concrete example, Dave Winer would have to post one RSSfile containing every entry he's made in Scripting News for the past10+ years to use RFC3229 to meet the same goal; with this proposal,he'd just have to add a 'prev' to each archived feed (assuming he hasarchives around, which if he doesn't, I imagine he could reconstruct).

At times we do get spolied by the ability to dynamically generateresponses don't we ;-) You're obviously correct when it comes tostatically generated content - RFC3229+feed does not provide a workablesolution in that case.

2. Is the feed state mechanism a way of paging through the currentcontents of a collection or a snapshot-in-time view of a feed? Thatis...
   is it
A) Collection has a bunch of entries. Each feedrepresentation has 15 entries and the prev linkacts like a paging mechanism similar to what we seecurrently use in search results. Deletingthe first ten entries out of the collection would causeall of the entries in the feed to "shift backwards"
            in the feeds
B) Each prev link is representative of how the feed lookedat a given point in time. E.g. the feed as it would
             have appeared at a given hour of a given day
If it's A, then Bob's RFC3229+feed solution seems much moreefficient. (see #1)
If it's B, then I'm wondering why you don't just use an ETagbased approach, e.g.
      <fs:Stateful>1</fs:Stateful>
      <fs:prev>{ETag}</fs:prev>
This would allow clients to only ever have to deal with a singleURI for a feed and use conditional-gets with ETag to differentiatewhich snapshot of the feed they want to get and would likely make iteasier to remediate potential recursive reference attacks, (e.g.feed A references feed B which references feed C which is a blindredirect to Feed A).
This proposal doesn't handle deletion or other aspects of identity infeeds; I tried to introduce language like that earlier in Atomitself, but we failed to gain consensus around it.
How does an ETag help you locate a previous feed to reconstructstate? Even if it could, I'm not sure intermingling HTTP protocoldetails with application semantics; although there's nothing toprevent this theoretically, in many implementations, it might beproblematic to predict what the ETag is.

It's not so much using ETag to reconstruct state as much as it is toview access previous views of the feed. Btw, I threw this out fordiscussions sake and not because I think it's the "right" solution. I'mnot particularly in love with it myself.

3. Microsoft's RSS Lists spec uses <cf:treatAs /> to attachbehavioral semantics to a feed. This proposal uses <fs:Stateful />to attach behavioral semantics. It would be nice if we could comeup with a relatively simple and standardizable way of attachingbehavioral semantics. For example, a standardized <treatAs /> element:
   <atomex:treatAs>stateful</atomex:treatAs>
The value of the treatAs element would be a list of tokens withdefined semantics. Each token SHOULD be registered with IANA.Unknown tokens would be ignored. Incompatible tokens would beignored with first-in-the-list takes precedence semantics. Forexample:
   <atomex:treatAs>stateful list</atomex:treatAs>
Indicates that the feed should be treated as a list whose paststates can be queried using the kind of mechanism you've defined.
That seems like an awfully heavyweight solution. What does definingthe container and an IANA registry add?

The value is that I would really like to see a common and consistent wayof attaching behavioral semantics to the feed rather than eachindividual vendor / spec defining their own app and impl specificmethods. It could be done without IANA support, of course, but it'sjust annoying to see relatively similar tasks done in completelydifferent ways.


--
Mark Nottingham     http://www.mnot.net/

Re: I-D ACTION:draft-nottingham-atompub-feed-history-00.txt

Reply via email to