Re: MVar semantics: proposal

Jan-Willem Maessen Fri, 31 Mar 2006 09:49:26 -0800

John -

You are, in effect, proposing a memory model for MVars and IORefs.The high-level model for programmers is "In order to communicate databetween threads, you *must* use an MVar, and never an IORef."

But the devil is in the details. I'd like to strongly urge *against*adopting the extremely loose model you have proposed. The followingthings seem particularly important:

* reads and writes to IORefs should be atomic, meaning either acomplete update is observed or no change is observed. In the absenceof this guarantee, misuse of IORefs can cause programs to crash inunrepeatable ways. If the machine doesn't make this easy, theimplementor ought to sweat a little so that Haskell programmers don'thave to sweat at all.

* I assume forkIO constitutes a sequence point. I suspect throwTo etal ought to as well.

* I would urge that atomicModifyIORef constitute a sequence point---Isuspect it loses a great deal of its utility otherwise.

Now, on to more difficult issues... Consider the following example(untested):


data RefList a = Nil | Cons a (IORef (RefList a))

cons :: a -> RefList a -> IO (RefList a)
cons x xs = do
  a <- newIORef xs
  return (Cons x a)

hd :: RefList a -> a
hd (Cons a _) = a

tl :: RefList a -> IO (RefList a)
tl (Cons a t) = readIORef a

setTl :: RefList a -> RefList a -> IO ()
setTl (Cons a t) t' = writeIORef t t'

main = do a <- cons 'a' Nil
          forkIO $ do
            c <- cons 'c' Nil
            b <- cons 'b' Nil
            setTl b c
            setTl a b
          at <- tl a
          case at of
            Nil -> return ()
            Cons _ _ -> do
              putChar (hd at)
              att <- tl at

This program is, by your informal model, buggy. The question isthis: how badly wrong is it?Let's say at happens to read b. Is (hd at) well defined? That'sassuming very strong consistency from the memory system already. Howabout the IORef in at? Is that fully allocated, and properlyinitialized? Again, if it is, that implies some pretty strongconsistency from the memory system.

Now, what about att? By your argument, it may or may not be c. Wecan ask the same questions about its contents assuming it happens tobe c.

People have talked a lot about weakly-ordered NUMA machines for morethan a decade, and they're always just a couple of years away. Inpractical terms, non-atomic NUMA memory models tend to be so hard toprogram that these machines have never found any traction---you needto throw away all of your software, including your OS, and startafresh with programmers that are vastly more skilled than the oneswho wrote the stuff you've already got.

My feeling is that the purely-functional portion of the Haskelllanguage already makes pretty stringent demands of memoryconsistency. In light of those demands, and the fact that mutablestate is used in pretty tightly-controlled ways, it's worthconsidering much stronger memory models than the one you propose.I'd even go so far as to say "IORefs and IOArrays are sequentiallyconsistent". The only argument against this behavior is their use inthe internals of arrays, file I/O, the FFI, etc., etc. (though reallyit's all about IOUArrays in the latter cases) where we mightconceivably pay a bundle in performance.

Another possibility is an algebraic model based on commuting IOactions. That approach is a particular bias of mine, having tangledwith these issues extensively in the past. It'd go something like this:* Any data written to an IORef can safely be read by anotherthread; we cannot observe

      partially-written objects.
  * readIORef commutes with readIORef.
  * newIORef commutes with newIORef.

* writeIORef and newIORef commute with writeIORef or readIORef toa different IORef.

  * Nothing commutes with readMVar, writeMVar, or atomicModifyIORef.
  * Nothing before a forkIO can be commuted to after forkIO.

I think it's a Good Idea to choose a model that is conceptuallysimple now, at the cost of imposing a few constraints onimplementors, rather than a complex specification which permitsmaximum implementation flexibility but is utterly opaque.Realistically, the machines which are likely to be built will make iteasy to comply with a strong specification.


-Jan-Willem Maessen
_______________________________________________
Haskell-prime mailing list
[email protected]
http://haskell.org/mailman/listinfo/haskell-prime

Re: MVar semantics: proposal

Reply via email to