[messaging] Matrix.org: Decentralized group chat

Erik Johnston Wed, 11 Mar 2015 04:23:43 -0700


On Sunday, March 8, 2015 00:13 GMT, [email protected] (Ximin Luo) wrote:

For a private dynamic group chat, the merging requirements are not exactly the 
same as in a collaborative environment like Google Wave:

- for a basic application, one does not *need* to merge arbitrary state. The
only thing you need to merge is an unordered set (the membership set).

- merging unordered sets is easy and a solved problem if you have full history. However, the fact
that this set represents "who has permission to see what history" makes the problem hard
again, in the context of the other constraints (secure, casually-consistent, decentralised). Merge
algorithms require history, but if someone can't see past history (e.g. after joining) then they
don't have enough information to execute the merge algorithm, nor to verify others' claimed merges.
(This is not necessarily an impossible problem, and indeed is the "last missing piece" in
the bundle of ideas I haven't yet written down.)

Briefly skimming over their spec: http://matrix.org/docs/spec/ "The state of the
room at a given point is calculated by considering all events preceding and including a
given event in the graph. Where events describe the same state, a merge conflict
algorithm is applied."

My impression too is that they underestimate how hard this will be. Also, their security practises
are questionable - they do have a list of threats in the spec, but there are no suggestions or even
hints on techniques on how they will defend against these threats. I guess they will add this stuff
in later, but as I'm sure everyone on this list knows, you can't just "add security
later". It sounds like they just did the equivalent of adding partial ordering to a federated
protocol similar to XMPP, using TLS "for security", and hoping that everything will be
naturally straightforward to conceptually lock down as they write the code.

--
GPG: 4096R/1318EFAC5FBBDBCE
git://github.com/infinity0/pubkeys.git

I've been working on a lot of the federation side of Matrix since itsinception, and I can assure everyone that we really do know how hardthese sorts of generic merge algorithms are. Hence why we've promptlynot tried to solve them.

What we have done is attempted to solve a much simpler subset of thegeneral problem. The basic requirement is along the lines of: "Allservers in the group chat (that can talk to each other) will/eventually/ agree what the state of the room is." Note the carefulomission of the word "correct".

(The state of the room, or more accurately the state of the room at anygiven event according to a given server, is simply a mapping betweenkeys and nodes/events in the directed acyclic graph. The key is includedin the event.)


The main points to note are:

- We differentiate "auth events", i.e. events that affectauthorization of "new" events. There are very few types of auth events,and Matrix doesn't support custom auth events.- The merge algorithm of non auth events, as Ximin points out, isrelatively easy.- The merge algorithm for auth events is designed so that maliciouslybranching and re-merging does not allow circumvention of auth changes.This is the hardest part of merging, and is heavily special cased.- Each event points to the auth events that allow that event tohappen, separately to the full event DAG.- All events are signed by the originating server, and all eventrelationships are protected by hashes (à la git).- Events can be "redacted" by servers, where all non protocol relevantkeys are stripped, e.g. message contents. This allows servers toeffectively "delete" nasty messages without leaving a whole in thegraph. This is possible since we include a hash of the full event, butservers only sign the hash and protocol relevant keys rather than thefull event.- Every server is required to store the full auth chain of the eventsit sends (for at least as long as it stores the original event). Theauth chain is simply all the auth events for that event, and the authevents for those, etc.- The auth chain is enough for a server to accept that an event couldhave happened.- Servers can tell each other if they think another server shouldn'thave accepted an event and why. (For example the sender of the event hadbeen kicked between, in topological terms, the join event and themessage event)- When a server joins the room, it gets given (by the server it isjoining via) the other servers view of the current state and associatedauth chains.- Servers can throw away old history/events. They still have to keepall the auth events.

Thus, different servers can have different views of what the currentstate is if buggy or malicious servers are involved; for example, whenjoining a room your server can be given an incorrect view of the currentstate. However, the auth chain ensures that these views can only beincomplete or stale, servers cannot simply include arbitrary synthesizedevents.If two servers discover that they disagree on the state (which theyquickly will), then they will exchange their views and attempt to provethe other side wrong; by the end of this process two correctlyfunctioning servers will come to an agreement of the current state. Ineffect providing a "healing" mechanism.

(This does mean that servers may initially accept an event, only to besubsequently shown that the event should have been rejected.)

The outcome of all this is (hopefully) that the subgroup of correctlyfunctioning servers *will* eventually agree on the current state, and sowhich events should accepted and rejected.

Hopefully this clarifies a little what Matrix is trying to do and whatthe federation protocol does and doesn't aim to provide and guarantee.


Erik.

_______________________________________________
Messaging mailing list
[email protected]
https://moderncrypto.org/mailman/listinfo/messaging

[messaging] Matrix.org: Decentralized group chat

Reply via email to