Re: [DISCUSS][C++][Proposal] Threading engine for Arrow

Wes McKinney Mon, 15 Jul 2019 13:16:33 -0700

On Mon, Jul 15, 2019 at 12:01 PM Antoine Pitrou <solip...@pitrou.net> wrote:
>
> On Mon, 15 Jul 2019 11:49:56 -0500
> Wes McKinney <wesmck...@gmail.com> wrote:
> >
> > For example, suppose we had a thread pool with a limit of 8 concurrent
> > tasks. Now 4 of them perform IO calls. Hypothetically this should
> > happen:
> >
> > * Thread pool increments a "soft limit" to allow 4 more tasks to
> > spawn, so at this point technically we have 12 active tasks
> > * When each IO call returns, the soft limit is decremented
> > * The soft limit can be constrained to be some multiple of the hard
> > limit. So if we have a hard limit of 8 CPU-bound threads, then we
> > might allow an additional 8 tasks to be spawned if a CPU bound thread
> > indicates that it's waiting for IO
>
> Well, there are two approaches to this:
>
> * the approach you are proposing
> * the approach where IO is done in separate worker threads so that we
>   needn't resize the main thread pool when IO is done
>
> Advantages of the second approach:
>
> * No need to dynamically resize the main thread pool (which may
>   difficult to achieve in an efficient manner).
> * CPU-bound threads can stay pinned on the same HW cores and threads
>   most of the time, which is probably good for cache locality and to
>   avoid migration costs.
>
> Advantages of the first approach:
>
> * The programming model is probably simpler.
>
> Also, the first approach is not workable if e.g. TBB doesn't support it
> (?).


Agreed with both points. I'd like to investigate these approaches to
see what makes the most sense from a programming model and efficiency
/ performance standpoint.

Currently we have lots of code that looks like (pseudocode)

function Func(State* mutable_state) {
   CPUTask1(mutable_state);
   IOTask1(mutable_state);
   CPUTask2(mutable_state)
   IOTask2(mutable_state);
   CPUTask3(mutable_state);
   ...
}

Either approach is going to require us to develop a programming model
where a task scheduler is passed into many functions, so such code has
to be refactored to push work into the scheduler rather than doing the
work in the current thread. You could certainly argue that we should
elect for an API which maximizes our flexibility with regards to
scheduling work (e.g. having separate thread pools for IO and CPU).

Task scheduling may also need to be aware of IO resource identities to
control concurrent reads of sources that are sensitive to that (e.g.
some filesystems may work fine accessed by 16 threads in parallel,
where others will not).

Probably we need to figure out at least what the programming model
ought to look like so we can start refactoring old code (e.g.
parquet-cpp internals) and writing new code in a more
concurrency-minded way.

>
> Regards
>
> Antoine.
>
>

Re: [DISCUSS][C++][Proposal] Threading engine for Arrow

Reply via email to