[O-MPI devel] Linux processor affinity

Jeff Squyres Tue, 29 Nov 2005 12:21:49 -0500

Greetings all. I'm writing this to ask for help from the generaldevelopment community. We've run into a problem with Linux processoraffinity, and although I've individually talked to a lot of peopleabout this, no one has been able to come up with a solution. So Ithought I'd open this to a wider audience.


This is a long-ish e-mail; bear with me.

As you may or may not know, Open MPI includes support for processor andmemory affinity. There are a number of benefits, but I'll skip thatdiscussion for now. For more information, see the following:


http://www.open-mpi.org/faq/?category=building#build-paffinity
http://www.open-mpi.org/faq/?category=building#build-maffinity
http://www.open-mpi.org/faq/?category=tuning#paffinity-defs
http://www.open-mpi.org/faq/?category=tuning#maffinity-defs
http://www.open-mpi.org/faq/?category=tuning#using-paffinity

Here's the problem: there are 3 different APIs for processor affinityin Linux. I have not done exhaustive research on this, but which APIyou have seems to depend on your version of kernel, glibc, and/or Linuxvendor (i.e., some vendors appear to port different versions of the APIto their particular kernel/glibc). The issue is that all 3 versions ofthe API use the same function names (sched_setaffinity() andsched_getaffinity()), but they change the number and types of theparameters to these functions.

This is not a big problem for source distributions of Open MPI -- ourconfigure script figures out which one you have and uses preprocessordirectives to select the Right stuff in our code base for yourplatform.

What *is* a big problem, however, is that ISVs can therefore not ship abinary Open MPI installation and reasonably expect the processoraffinity aspects of it to work on multiple Linux platforms. That is,if the ISV compiles for API #X and ships a binary to a system that hasAPI #Y, there are two options:

1. Processor affinity is disabled. This means that the benefits ofprocessor affinity won't be visible (not hugely important on 2-waySMPs, but as the number of processors/cores increases, this is going tobecome more important), and Open MPI's NUMA-aware collectives won't beable to be used (because memory affinity may not be useful withoutprocessor affinity guarantees).

2. Processor affinity is enabled, but the code invokes API #X on asystem with API #Y. This will have unpredictable results, the bestcase of which will be that processor affinity is simply [effectively]ignored; the worst case of which will be that the application will fail(e.g., seg fault).


Clearly, neither of these solutions are attractive.

My question to the developer crowd out there -- can you think of a wayaround this? More specifically, is there a way to know -- at run time-- which API to use? We can do some compiler trickery to compile allthree APIs into a single Open MPI installation and then run-timedispatch to the Right one, but this is contingent upon being able todetermine which API to dispatch to. A bunch of us have poked aroundand not found anything on the system that indicates which API you have(e.g., looked in /proc and /sys), but not found anything.


Does anyone have any suggestions here?

Many thanks for your time.

--
{+} Jeff Squyres
{+} The Open MPI Project
{+} http://www.open-mpi.org/

[O-MPI devel] Linux processor affinity

Reply via email to