Re: Revamped concurrency API

Fawzi Mohamed Mon, 12 Oct 2009 15:25:20 -0700

If anyone has ideas, suggestions, and code to help defining a newconcurrency API for D, please post.

I think that the correct way to handle concurrency is through alibrary, D is flexible enough, so that a library can be used, and thenone can even write special handlers for special cases.

In blip (http://dsource.org/project/blip) I actually did exactly that,in D 1.0 for two reasons:

1) I need 64 bits

2) concurrency is complex, and I had a hard time already with bugs inthe stable branch, furthermore the possibility of breakage in verysensitive code that I don't want to touch too often by language changeswas not appealing to me.

That said there is a feature that would help much D 1.0, and that Iwould really love to have.

I know that D 1.0 is frozen and stuff... but I will try to ask all the same...
real closures from delegates when you put a "new" in front of them
        new delegate(ref int x){x+=y}
would create a closure (allocating y).

I know that D 2.0 has basically that by default (and scope to get theprevious behavior), but as I said D 2.0 is not yet an option for me (Iwant to work on my projects, I think I am already doing enough for thecommunity), so I thought that asking for a feature that does not breakexisting D 1.0 code and is in some way already implemented in D 2.0could be worth trying :)

The other D 2.0 features are nice, I do like structureconstructors/destructors, post blits, template constraints, const...,well shared I am not so sure about, but anyway all of them are notneeded for concurrency.

Now about the concurrency infrastructure, here I will discuss SMPparallelization, there is also a more coarse grained parallelizationthat needs another approach (MPI and agent based model, for now I justwrapped mpi and serialization in a way that could be implementeddirectly on tcp, and allow to easily communicate arbitrary objects).

The concurrency has two sides one is the user/programmer side and theother the efficient realization, I will discuss the user level api, asI realized it in Blip, which is optimized for recursive operations thathave to be finished (i.e computations to be performed, not to simulateconcurrent systems).The idea is to separate each tasks in chunks that are as small aspossible, while still being large enough so that the switching time issmall wrt. to the computing time.This subdivision typically is not directly dependent on the number ofprocessors


----

Task is a class the represents a task, it has a string name (fordebugging purposes) and can be initialized with a delegate, a function,a fiber or a generator (in two flavors).

There are some optimizations to make allocation cheaper.

a task can spawn subtasks, and it "knows" how many subtasks there areexecuting.

a task is considered finished when the task is complete and all itssubtasks are completed


you can append operations to be executed after a task has finished executing.

you can wait for a task to finish (but try avoiding it, addint the taskto the onFinish of the task is much more efficient).

a task has a given level, subtasks have level+1, and tasks that cannothave subtasks have a very high level (int.max/2).


a new task can be submitted with
t.submit()
or t.submitYield()

submitYield submits the current task and possibly stops the currentone, this together with the fact that tasks with higher level areexecuted before tasks with lower level means that execution ispreferentially a depth first reduction which minimizes the number ofsuspended tasks (I have also task stealing that steals preferentiallythe tasks with lowest level, but that part is not yet really used).

other important operations are delay and resubmitDelayed that allow oneto delay a task, for example when you have to do i/o and you use amethod like epoll you can delay the current task, add the file handlerto the one to control, and execute resubmitDelayed when that handlerhas new data.


executeNow is useful to execute a task synchronously.

That is basically the core that one has to know, the library isactually much more rich, but for a typical usage this is enough.


With it one can write code like:

import blip.parallel.WorkManager;

class SubProblem{
 void execute(){
   if(problem is large){
     SubProblem firstHalf=...;
     Task("subproblem1",&firstHalf.execute).autorelease.submitYield();
     SubProblem secondHalf=...;
     Task("subproblem2",&secondHalf.execute).autorelease.submitYield();
   } else {
     direct solver
   }
 }
}

solveProblem(){
 if(problemIsLarge){
   SubProblem wholeProblem=...;
   Task("solveProblem",&wholeProblem.execute).executeNow(default);
  } else {
   direct solver
  }
}

just to show a very basic divide and conquer approach

Re: Revamped concurrency API

Reply via email to