Re: [whatwg] RFC: Alternatives to storage mutex for cookies andlocalStorage

Chris Jones Fri, 04 Sep 2009 13:32:59 -0700

Mike Wilson wrote:

Interesting. I've been following this discussion as my
experience is that it is *extremely* hard to make an
invisible locking mechanism that is to provide bothconsistency and performance (no lockouts).So far it seems a silver bullet hasn't been found.
Your suggestion is in line with what I would expect from asolution that makes a "best effort" compromise between themulti-tab browsing experience and the burden put onapplication authors.
What if cookies are accessed between beginTransaction() and
commitTransaction(), would it make sense to throw forupdates with side-effects here as well? (Even though this
would not be the case if done outside the transaction.)
In some cases it may be helpful to get this "side-effect
notification" for cookies as well...

I would prefer that cookies and localStorage not interact in this way.It seems confusing to have cookies be "sometimes transactional,sometimes not," although your proposal is certainly feasible.

The side-effect ("stomp") notification for cookies seems like aseparate, and good, idea, irrespective of localStorage.


Cheers,
Chris

Best regards
Mike Wilson

Chris Jones wrote:
I'd like to propose that HTML5 specify different schemes than aconceptual global storage mutex to provide consistency guarantees forlocalStorage and cookies.
Cookies would be protected according to Benjamin Smedberg'spost in the"[whatwg] Storage mutex and cookies can lead to browser deadlock"thread. Roughly, this proposal would give scripts aconsistent view ofdocument.cookie until they completed. AIUI this is strongerconsistencythan Google Chrome provides today, and anecdotal evidencesuggests eventheir weaker consistency hasn't "broken the web."
localStorage would be changed in a non-backwards-compatible way. Ibelieve that web apps can be partitioned into two classes: those thathave planned for running concurrently (single-event-loop or not) inmultiple "browsing contexts", and those that haven't. Ifurther positthat the second class would break when run concurrently in multiplecontexts regardless of multiple event loops, and thusregardless of thestorage mutex. Even in the single-event-loop world, sitesnot preparedto be loaded in multiple tabs can stomp each other's data even thoughscript execution is atomic. (I wouldn't dare use my bank'swebsite intwo tabs at the same time in a single-event-loop browser.) In otherwords, storage mutex can't help the second class of sites.
(I also believe that there's a very large, third class of pages thatwork "accidentally" when run concurrently in multiple contexts, eventhough they don't plan for that. This is likely because theydon't keepquasi-persistent data on the client side.)
Based on that, I believe localStorage should be designed withthe firstclass of web apps (those that have considered data consistency acrossmultiple concurrent contexts) in mind, rather than the secondclass. Isa conceptual global storage mutex the best way for, say, gmail toguarantee consistency of its e-mail/contacts database? Idon't believeso: I think that a transactional localStorage is preferable.Transactional localStorage is easier for browser vendors to implementand should result in better performance for web apps in multi-processUAs. It's more of a burden on web app authors than thehidden storagemutex, but I think the benefits outweigh the cost.
I propose adding the functions

   window.localStorage.beginTransaction()
   window.localStorage.commitTransaction()
or
   window.beginTransaction()
   window.commitTransaction()
(The latter might be preferable if we later decide to addmore resourceswith transactional semantics.)
localStorage.getItem(),. setItem(), .removeItem(), and .clear() wouldremain specified as they are today. beginTransaction() would do justthat, open a transaction. Calling localStorage.*() outsideof an opentransaction would cause a script exception to be thrown; this wouldunfortunately break all current clients of localStorage.There might becleverer ways to mitigate this breakage by a UA pretending not tosupport localStorage until a script called beginTransaction().
yieldForStorageUpdates() would no longer be meaningful and should beremoved.
A transaction would successfully "commit", atomically applying itsmodifications to localStorage, if localStorage was notmodified betweenbeginTransaction() and commitTransaction(). Note that a transactionconsisting entirely of getItem() could fail just as those actuallymodifying localStorage. If a transaction failed, the UAwould throw aTransactionFailed exception to script. The UA would beallowed to throwthis exception at any time between beginTransaction() andcommitTransaction().
There are numerous ways to implement transactional semantics.Single-event-loop UAs could implement beginTransaction() andcommitTransaction() as no-ops. Multi-event-loop UAs could reuse theglobal storage mutex if they had already implemented that(beginTransaction() == lock, commitTransaction() == unlock).
Some edge cases:
* calling commitTransaction() without beginTransaction()would throwan exception
* transactions would not be allowed to be nested, even on differentlocalStorage DBs. E.g. if site A's script begins a transaction onA.localStorage, and calls into site B's script embedded in an iframewhich begins a transaction on B.localStorage, an exceptionwould be thrown.
* transactions *could* be spread across script executions, alert()dialogs, sync XHR, or anywhere else the current HTML5 specrequires thestorage mutex be released. Note that UAs wishing to forbid thatbehavior could simply throw a TransactionFailed exception where thestorage mutex would have been released in the current spec. Or thiscould be made illegal by the spec.
* it's not clear to me how to handle async XHRs and Worker messagessent from within a failed transaction. They could be specified to besent or not and either behavior implemented easily. My gut tells methat they *should* be sent regardless.
Feedback very much desired.

Cheers,
Chris
Addendum: I think that a past argument against atransactional approachwas that scripts can cause side effects during transactionsthat can'tbe (easily, performantly) rolled back. This is true, andtroubling inthat it deviates from SQL semantics, but because this proposal isdesigned for the first class of web apps I don't believe it's acompelling argument. Further, a script can only corrupt itsbrowsing-context-local state by mishandling failedtransactions. Usinggmail as a convenient example, if a transaction failed butgmail wasn'tprepared to handle the failure, that particular gmail instance wouldjust break. No e-mails or contacts would be corrupted, and the usercould reload gmail and regain full functionality. Servers shouldalready be prepared to deal with clients behaving unpredictably.

Re: [whatwg] RFC: Alternatives to storage mutex for cookies andlocalStorage

Reply via email to