Re: [Haskell-cafe] You are in a twisty maze of concurrency libraries, all different ...

Patrick Caldon Fri, 04 Dec 2009 04:28:32 -0800

Neil Brown wrote:

Patrick Caldon wrote:
I'm looking for the "right" concurrency library/semantics for whatshould be a reasonably simple problem.
I have a little simulator:

runWorldSim :: MTGen -> SimState -> IO SimState
it takes about a second to run on a PC. It's functional except itwhacks the rng, which needs IO. I run 5-10 of these jobs, and then use:
mergeWorld :: [SimState] -> SimState
to pick the best features of the runs and build another possibleworld (state). Then I use this new world to run another 5-10 jobsand so on. I run this through ~20000 iterations.
It's an obvious place for parallelism.

I'm looking for a concurrency library with something like:

forkSequence :: Int -> [IO a] -> IO [a]

which I could call with something like this:

forkSequence 4 (take 10 (repeat  (runWorldSim g ss)))
this would construct 4 threads, then dispatch the 10 jobs onto thethreads, and pack up the
results into a list I could run through my merger.
Why particularly do you want to run the 10 jobs on 4 threads?Haskell's run-time is quite good at spreading out the lightweightthreads onto all your cores, so the easiest thing to do is run the 10jobs on 10 (light-weight) threads and let the run-time sort out therest.


Thanks so much for that! I'll give it a go.

Different threads is just because some of the jobs are memory hogs, andI want to minimize the number running simultaneously. I'll see whathappens with a runPar-like approach, and use a queue-based approach ifit becomes a problem.

So if what you want is a function:

runPar :: [IO a] -> IO [a]
you can easily construct this. Shameless plug: my CHP libraryeffectively has this function already, runParallel :: [CHP a] -> CHP[a] (CHP being a slight layer on top of IO). But you can do it justas easily with, say, STM. Here is a version where order doesn'tmatter (apologies for the point-free style):
import Control.Concurrent
import Control.Concurrent.STM
import Control.Monad

modifyTVar :: TVar a -> (a -> a) -> STM ()
modifyTVar tv f = readTVar tv >>= writeTVar tv . f

runPar :: [IO a] -> IO [a]
runPar ps
 = do resVar <- newTVarIO []
      mapM_ (forkIO . (>>= atomically . modifyTVar resVar . (:))) ps
      atomically $ do res <- readTVar resVar
                      when (length res < length ps) retry
                      return res
If order does matter, you can zip the results with an index, and sortby the index afterwards. If efficiency matters, you can perform othertweaks. But the principle is quite straightforward. Or you canrefactor your code to take the IO dependency out of your random numbergeneration, and run the sets of pure code in parallel using theparallel library. If all you are using IO for is random numbers,that's probably the nicest approach.

Good, fast random numbers are unfortunately necessary - I had a niceimplementation using System.Random, but had to rewrite it becauseperformance was poor :( .

P.S. take 10 . repeat is the same as replicate 10


Thanks again!

Patrick.
_______________________________________________
Haskell-Cafe mailing list
Haskell-Cafe@haskell.org
http://www.haskell.org/mailman/listinfo/haskell-cafe

Re: [Haskell-cafe] You are in a twisty maze of concurrency libraries, all different ...

Reply via email to