Fwd: A distributed James server and SelectedMailboxImpl statefulness

Tellier Benoit Sun, 25 Oct 2015 17:12:01 -0700

Disclaimer : the following message is long, and might take time to read,but I think this is a topic we have to exchange on in order to have aworking James in a distributed environment...


=============================================================

Hi every one,

I am working on a distributed event system for the mailbox. The idea isto make the IDLE functionality work with an arbitrary large number ofservers.

To work on this feature, I had a look to the different types ofMailboxListener. I found this one : SelectedMailboxImpl.

SelectedMailboxImpl is stateful : it maintains (among other things) thecorrespondence between UIDs and Message Sequence Number ( called messageindex in SelectedMailbox ).

When the mailbox is first selected, all the UIDs and flags for messagein this mailbox are fetch and used to compute the UID for the givenMessage Sequence Number. The actual index is event updated using themailbox event system.

Well, I have troubles to see how to make this work in a distributedsystem. Message systems do not offer perfect guaranties and we might geta lot of troubles in case of network partitions. Double event delivery,or no event delivery at all might arise. It is not that bad with IDLE,but can lead SelectedMailboxImpl to be inconsistent. I guess we haveseveral options :


# Go stateless on this (note : only for distributed implementation)

## Option 1

- We can recompute the correspondence between UID and MessageSequence Number each time a message sequence number is used. It mightcost compute and network resources. But no node stores specific data. Wecan imagine give a configuration option, that gives the choice betweenthe two options.


Depending on the implementation, we get different trade-off :
Option 1 a)

On a read request using Message Sequence Number, select all thedata (all the message content) from the database, and select the messagewe want. It is consistent but highly ineffective. Note : this works forReads, but not deletions...

Option 1 b)

On a read request we first fetch the mailbox to have informationsabout the UID to be used, and then gets the message data. But due todelay between read and write, our information can become inconsistent.Thus we can do some serious damages (eg : delete the wrong message)


## Option 2

- Store the message sequence number and update it. To do this in aconsistent way, we need a CP data store with the notion of transactionand attach the Message Sequence Number directly to the Message, storedin the database. It works, we have ineffective queries like **UPDATEmessages SET seq_number = seq_number - 1 WHERE seq_number > deleted ANDmailboxId=159**. We might have to handle transaction that fails to commit.Option 2 is still dangerous on databases that lacks the notion oftransaction. For example a process can crash before updating thesequence number. On Cassandra and other AP data stores, we haveconsistencies problems on concurrent updates of sequence number (mightlead to a wrong result, and even messages having the same sequence number).

Other note : adding a message requires to know the last UID used for amailbox that stills correspond to an existing message. Here we have twooptions : store it or recompute it. In both case we have troubleswithout the notion of transactions.


# Keep it stateful

## Option 3 :

- Say that a mailbox should be handled by a single James. Thiscan't be achieved threw load balancing. Here is a little user story thatshows it :


Bob and Alice uses James.
Bob give the right to Alice to see and add messages to his INBOX

Bob and Alice are handled by different servers. Bob is on James 1 andAlice on James 2. All there new connections will go to these server.

Bob connects to his INBOX. Alice connects to Bob's INBOX. Problem.

Collocating Bob and Alice ( and more generally user sharing rights) ishard but also impossible, as this graph might not be disconnected.

The solution with this option might be to proxy Alice commands relatedto Bob INBOX selection to Bob's server. To do this, we need to knowwhich server is in charge of Bob. A solution might be ConsistentHashing, but it is complicated to implement. Without such a thing,sharing mailboxes across users might be difficult.


With this option we have other concerns :

- How is our solution better than a Cyrus implementation if we havethe same load balancing troubles ?

 - We should document how to deploy such a load balancer

Obviously, with this, we do not need the distributed event system...

## Option 4

Don't care. I don't like this one. Even if inconsistent sequence numbershave a session lifetime, it can lead to critical scenarios like deletingthe wrong e-mail.


## Option 5

Don't create a distributed implementation. Well, as my involvement onJames project is about to make it distributed, I definitely down votethis option.


# Avoid the problem

## Option 6

On distributed implementation, partly implement IMAP and refuse requestsbased on sequence numbers. Seems stupid, but this is the onlyimplementation that works safely with Cassandra that comes to my mind...


# Option 7

If you have other ideas...

Fwd: A distributed James server and SelectedMailboxImpl statefulness

Reply via email to