Re: [Standards] Proposed XMPP Extension: Roster Versioning

Dave Cridland Thu, 06 Mar 2008 03:56:10 -0800

On Thu Mar  6 09:45:17 2008, Richard Dobson wrote:

A few hours of testing does not a reliable protocol make. Underwhat
conditions? Is this a public server that people can test against?
No you are correct, but even so the tone of some of the messages onthis subject as far as I read them said that it wouldn't work, andunder my limited testing it does seem to. I can make it publicallyaccessible if you like and actually want to and will have a go.

You can't rely on testing, sadly; you need to prove that it works.

Timestamps probably do work in most cases, as long as the updates tothe roster are atomically and sequentially performed on a singlepoint, and your implementation is sufficiently slow that timestampsare obtained at a maximum frequency lower than their resolution. Thisis simply because then you have a strictly increasing sequence.

However, timestamps don't always work like that even on a singlemachine. It just happens that they usually do. I'm not terribly keenon making the specification define a new RFC 2119 entry candidate forMUST USUALLY. ;-)

"It's a lot more efficient with an int, and everything else iseitherworse performance, fails entirely, or else is equivalentdifficulty to
an int."
What sort of things are these? Im not trying to say im right andhe's wrong but id like a more full explanation on the points aboveand below so I can understand why Dave thinks it will break,because as far as I can see all the problems pointed out so far inthis thread can be easily overcome.

If you modify a timestamp value, or for that matter generate it insuch a way that it's strictly increasing, then everything *will*work. But since you can model that trivially as a non-negativeinteger, it's not a problem. But since you can, it's a lot easier tosimply use an arbitrary strictly increasing integer sequence anyway.

"One interesting point is that with an int, there's alwayssomething aclient can use to get the entire roster, with versioning turnedon."
Yea it can just omit the version attribute.

Then it doesn't get versioning. (BTW, I hate that name - there are noversions kept. It's not like a client can ask for a previous version,and nor does a server need to keep any previous versions).

"Put it this way, since the counter-proposal involved timestamps,which
are known to be broken, I'm pretty sure people will get stuff wrong
without it being a MUST."
Why are they broken?

See previous messages. They're not strictly increasing, nor are theystrictly non-decreasing, nor are they even unique.

And [3]:

"You can't use timestamps - they're not strictly increasing, for
various reasons.
Why does it have to be strictly increasing, even if it was atimestamp?

Because you want them to have the property that for any known state,the server can produce a delta to the current state. Now, as ithappens, we allow in this specification for a server to give up andprovide the complete roster, and given this escape hatch, "unique" issufficient if you wanted to produce a really poor implementation.

However, you do need to ensure it really is unique, and givenmulti-core multi-threading clustered servers, about the simplest wayis to just use a strictly increasing integer sequence. After all, youneed some central point to perform atomic updates of the roster,right?

Now, if what you have is a sequence which usually goes up, butsometimes goes down, and is not unique, then updates can simply belost to a client.

Firstly, two roster changes could happen at precisely the same
moment. To be fair, by introducing cluster node identifiers, and
having a strict strong ordering of them, you could avoid this.
Why is it a problem if two updates share the same versionidentifier? Couldn't they not just become part of a single atomicchange?

That's "strictly non-decreasing", and neither "unique" nor "strictlyincreasing".

The problem here is that a roster push only contains one item, so tworoster pushes would involve the same sequence value. (See rfc3921bis,Section 2.1.4)


Given this, you need to ensure that either:

a) There is some way to indicate to the client that it now has allroster pushes up to and including the sequence value. (ACAP style, ifyou're wondering, since ACAP also has atomic updates spanningmultiple notifications). This involves an additional stanza, and ifit's lost, a client will observe all the previous pushes with thesame sequence value repeated - as such, it's inherently lessefficient on dodgy connectivity.

b) You cheat and lower the sequence value included in the rosterpushes, such that the sequence value the client next produces toresynchronize will include all those pushes - effectively using asubsequent push as an indicator to the client that all pushesrelating to previous sequence values have now been received.

(b) is less efficient than (a), which has more failure cases thansimply using a strictly increasing sequence. (b) will always send theclient data it already has.

Secondly, the clock on a computer can, and surprisingly often does,
go backwards. That's a much harder problem to solve.
As previously requested could you describe this further or point tosome more information on this so I can understand how much of aproblem this actually is.

It happens. Really and truly, it happens. How often isn't really thepoint.


What happens is typically this:

1) The machine receives updates from NTP.
2) Typically, it adjusts its clock skew to converge on the real time.
3) But, if the clock is sufficiently wrong, it's simply moved.
4) This might be backwards.

Some machines are also configured to move the clock to an NTP derivedtime on reboot - again, if reboots are sufficiently fast, and theclock's offset was sufficiently large, that can cause the clock toreverse.

Finally, even with a timestamp that never goes backwards, you need toensure that all members of a clustered setup use the same source, andmoreover test that the obtained timestamp is higher than any previoustimestamp obtained, so I just fail to see any advantage, really.

Thirdly, in a clustering situation, you'd have to ensure that the
time on each cluster node was perfectly synchronized.
No they don't as previously pointed out (the database layer couldgenerate all the ids).

Yes, and it can generate integer ids just as well, if not better.

So the closest you can do would be a modified timestamp that had
additional logic during generation to ensure it never wentbackwards,
in which case you don't need the cluster identifier anymore, and
that's effectively the same as having a strictly increasing integer
sequence anyway, so it's easier to just do that. But even if youdid
want to use timestamps, just representing them as an integer is
pretty trivial. Look at the definition of "modtime" in ACAP (RFC
2244), which defines a strictly increasing modified timestamp
represented using digits."
Yup exactly so the issue about clocks going backwards can be easilyovercome then.

"represented using digits". Feel free to do this if you want. RFC2244 section 3.1.1, although you'll need to fix the subsecondaccuracy, so it's comparable as an int rather than a string. But:

1) I would note that the same guys who designed that essentiallyported it to IMAP as CONDSTORE, and dropped the entire timestampthing, making it an integer sequence instead. (*NOT* a contiguousinteger sequence, please note - many of the IMAP CONDSTORE examplesuse ACAP modtime formats)

2) I've written an ACAP server. Getting modtimes right is neither assimple, nor as fast, as getting an integer sequence right. And it'sstill a strictly increasing integer sequence. (Although it says"strictly ascending"). In fact, I know there are failure cases in mycode, I just hope they're sufficiently unlikely as to not affectanyone. Given that I'm probably the only user of my ACAP server, I'mprobably safe.

I've got a fondness for doing it this way, as it happens, but I'dcertainly not mandate it, nor encourage anyone else to do the same.

Is there any chance we can use BASE64 or something to compress theversion identifiers in the roster pushes and whatnot?, just like isdone with the hashes in caps? Or do we think its probably notreally going to be a problem in the future? Granted as an autoincrementing integer it probably wont be much of an issue unlessyou have an incredibly busy roster, just trying to plan ahead.

It won't compress them if they're integers until you get over 9999,since base64 comes in multiples of four characters. So the best wayfor efficiency for the majority is to ensure you use a compactsequence.

FWIW, I'm not convinced that the additional traffic here is worthworrying about at all either way.


Dave.
--
Dave Cridland - mailto:[EMAIL PROTECTED] - xmpp:[EMAIL PROTECTED]
 - acap://acap.dave.cridland.net/byowner/user/dwd/bookmarks/
 - http://dave.cridland.net/
Infotrope Polymer - ACAP, IMAP, ESMTP, and Lemonade

Re: [Standards] Proposed XMPP Extension: Roster Versioning

Reply via email to