Re: More on Atom XML signatures and encryption

Dan Sandler Mon, 20 Jun 2005 23:14:27 -0700


On Jun 20, 2005, at 11:17 PM, James M Snell wrote:

The thought here then is that feeds would not be considered atomicunits and that <entry /> elements can be pulled as is out of acontaining <feed /> element and passed around independently of it.


That's basically the idea, yes.

That really doesn't seem to square up with some basic XML signatureprinciples and other atom conventions such as the ability to omit theauthor element from a contained <entry /> if the containing feed hasan author...

Which XML-DSig principles does this violate? It seems to me that ifyou come across a signed node, as long as you don't break thewell-formedness of it, you can do what you like with it.

As for Atom conventions, such as the omission of key information froman <entry>---yes, this is a complication. One approach would be tomake my Atom feeds "disassemblable", such that each entry found in thefeed could equally stand on its own as an Atom Entry Document.

Then again, this may not be such a big deal. In the case of anaggregator it's an issue, but in the peers-sharing-entries case, it'sconceivable that each peer already knows the <feed> level metadata andcan refer to it when necessary. (Perhaps the peers periodically---butrarely!---poll the conventional Atom feed, or perhaps they exchange theentire <feed> contents among themselves if it too has been signed bythe original publisher.)

So the question with regards to enabling aggregation services iswhether or not those services could even exist without performing alevel of processing against the feed and entries that wouldnecessarily break the digital signature. In other words, if the entryin the first example above were to be included in an aggregate feedcontaining entries from multiple authors, the <author /> element fromits containing feed would need to be added to the entry element'scollection of metadata... (<entry>...</entry> becomes <entry> ...<author /> ... </entry>) thereby invalidating any signaturecalculated over the entry.

I agree that destructively processing the entries is almost certain tocause trouble (and relates directly to your canonicalization argumentbelow). This might not be such a big deal; aggregators might simplypass along what users can verify as authentic, placing the pressure onoriginal Atom content producers (i.e. publishers and their software) tocreate "self-sufficient" entries.

One would also have to contend with the potential problems introducedby namespace declarations with the feed. The bottom line of this isthat an entry with a signature could not simply be copied over to anew containing feed element with the signature in tact making theaggregator scenario unworkable.

Ugh. I don't have an answer for the namespaces problem. (XML isn'tdoing me any favors here. Or perhaps the reverse is more true?)

I suppose that in my mind, an Atom feed is mostly a vector (punintended) for carrying Atom entries. After all, the entries are whatthe user (or end processor, or what have you) is really interested in!They are stable, mostly-unchanging, "atomic" (!) bits of data that canbe exchanged and stored and reasoned about. On the other hand, thefeed is always changing, even if most of its data remains the same; asa sliding window of Atom "events", it's a moving target, almostentirely uninteresting in itself from the perspective of processingapplications. Digital signatures just bring this point to the fore.

Which brings me to the following heresy: Wouldn't it be nice if an Atomfeed were just a list of self-contained Atom Entry Documents? Yes,some data will be duplicated between items, but there's a tremendousflexibility benefit, to my mind. Entries are now much more looselycoupled to one another, and can survive on their own in any context,for any purpose.

(The reason I think that this is a *useful* flexibility is that when Ilook at current applications of newsfeeds, I see a strong focus onentries. Popular newsreaders spend the bulk of their UI energy onlisting and presenting entries, because that's where the user's focusis. The "objects" in the system, from the user's perspective, are theentries. The feed is just the box they came in. How long beforenewsreaders allow users to clip and save their favorite entries,individually, in a *different* box---like saving email messages to afolder? Even this simple operation is made difficult by bindingentries tightly to their feeds. But I digress.)

The only potential way around this problem would be to define astandard canonicalization mechanism for Atom entries that would makeit possible to reliably sign and verify them across multiple feeds.

I agree that canonicalization would also solve this. (Just to be surewe're on the same page: XML Canonicalization is woefully insufficientfor this domain-specific task.) You might view the concept of an AtomEntry Document as a start in this direction.

Unless such a canonicalization mechanism is defined, it would appearas if there would be no way of ensuring that the individual entrieswithin a synthesized aggregate feed are indeed "authentic" unless a)they contained a self-referential pointer back to a digitally signedversion of itself, b) the synthesized feed in which it was containedis digitally signed by a trusted entity and c) the version of theentry contained in the synthesized feed is identical to its digitallysigned reference copy.

That seems pretty complicated. Also, it requires the client, havingreceived an entry signed by an aggregator, to go fetch the entry fromthe original server in order to get the "authentic" version (signed byoriginator). In that case, why bother distributing the content in theaggregator? The aggregator could save some effort and just publishlist of URLs for the clients to fetch.

[...] The key challenge with this approach is that one would reallyhave to trust the aggregator in order for it to work.


Right, which is the kind of trust I'm trying to avoid relying on.

Another challenge is the fact that the Atom specification onlyaccounts for Signature elements at the document level -- e.g. as achild of the top level <feed /> or <entry /> elements -- and not onthe child entry level.

Yes. This has always struck me as an unnecessary limitation in thespec. I chalked it up to getting 1.0 out the door, and trying to hashout all the ugly details of signatures later (in which case, missionaccomplished).

So... given all this... I think I'm gong to make an assertion and openthat assertion up for debate: The need for an end-to-end trust modelfor Atom capable of traversing any number of intermediaries is largelya myth. What is really needed is a simple mechanism for protectingfeeds against spoofed sources (e.g. man-in-the-middle serving up abogus feed) and for indicating that content is trustworthy* on thedocument level (as opposed to individual feed entry level).* by which I mean, for example, binary enclosures are trustworthy, thefeed itself does not contain any malicious content, etc

If indeed end-to-end authenticity is unnecessary, then I agree:document-level trust is all you need.

However, providing only document-level authenticity locks Atom (as adata format) out of other distribution models. In essence, you'restuck with either "client polls source for entire feed" or "client getsentire feed from somewhere else". There's no opportunity to senddeltas, and there's no opportunity to combine feeds. (Unless securityis unimportant to Atom applications, in which case there's no reason tosign feeds or entries at all.)


---dan, kicking in way more than his two-cent quota

Re: More on Atom XML signatures and encryption

Reply via email to