The source of this annoyance is the widely spread usage of OMPI_ENABLE_THREAD_MULTIPLE as an argument for all of the component init calls. This is obviously wrong as OMPI_ENABLE_THREAD_MULTIPLE is not about the requested support of thread support but about the less restrictive thread level supported by the library. Luckily the solution is simple, replace OMPI_ENABLE_THREAD_MULTIPLE by variable ompi_mpi_thread_multiple, and there should be no need for checking opal_using_threads in the initializers (open-mpi/ompi@343071498965a8f73d5f2b0c27a7ef404caf286c).
George. On Fri, Dec 12, 2014 at 2:58 AM, Pascal Deveze <pascal.dev...@bull.net> wrote: > George, > > > > My initial problem is that when MPI is compiled with > “--enable-mpi-thread-multiple”, the variable enable_mpi_threads is set to > 1 even if MPI_Init() is called in place of MPI_Init_thread(). > > I saw also that opal_using_threads() exists and was used by other BTLs. > > > > Maybe the solution is to find the way to set enable_mpi_threads to 0 when > MPI_Init() is called. > > > > > > *De :* devel [mailto:devel-boun...@open-mpi.org] *De la part de* George > Bosilca > *Envoyé :* vendredi 12 décembre 2014 07:03 > > *À :* Open MPI Developers > *Objet :* Re: [OMPI devel] Patch proposed: opal_set_using_threads(true) > in ompi/runtime/ompi_mpi_init.c is called to late > > > > On Thu, Dec 11, 2014 at 8:30 PM, Ralph Castain <r...@open-mpi.org> wrote: > > Just to help me understand: I don’t think this change actually changed any > behavior. However, it certainly *allows* a different behavior. Isn’t that > true? > > > > It depends how you look at this. To be extremely clear it prevents the > modules from using anything else than their arguments to decide the > provided threading model. With the current change, it is possible that some > of the modules will continue to follow this "old" behavior, while others > might switch to check opal_using_threads to see how they might behave. > > > > My point here is not that one is better than the other, just that we > inadvertently introduced a possibility for non-consistent behavior. > > > > Let me take an example. In the old scheme, the PML was allowed to run each > BTL in a separate thread, with absolutely no BTL support for thread safety. > Thus, the PML could have managed all the interactions between BTL and > requests in an atomic way, without the BTL knowing about. Now, if the BTL > make his decision based on the value returned by opal_using_threads this > approach is not possible anymore. > > > > If so, I guess the real question is for Pascal at Bull: why do you feel > this earlier setting is required? > > > > This might allow to see if using functions that require protection, such > as opal_lifo_push, will work by default or one should use directly their > atomic version? > > > > George. > > > > > > > > On Dec 11, 2014, at 4:21 PM, George Bosilca <bosi...@icl.utk.edu> wrote: > > > > The overall design in OMPI was that no OMPI module should be allowed to > decide if threads are on (thus it should not rely on the value returned by > opal_using_threads > during it's initialization stage). Instead, they should respect the level > of thread support requested as an argument during the initialization step. > > > > And this is true even for the BTLs. The PML component init function is > propagating the enable_progress_threads and enable_mpi_threads, down to > the BML, and then to the BTL. This 2 variables, enable_progress_threads and > enable_mpi_threads, are exactly what the ompi_mpi_init is using to compute > the the value of the opal) using_thread (and that this patch moved). > > > > The setting of the opal_using_threads was delayed during the > initialization to ensure that it's value was not used to select a specific > thread-level in any module, a behavior that is allowed now with the new > setting. > > > > A drastic change in behavior... > > > > George. > > > > > > On Tue, Dec 9, 2014 at 3:33 AM, Ralph Castain <r...@open-mpi.org> wrote: > > Kewl - I’ll fix. Thanks! > > > > On Dec 9, 2014, at 12:32 AM, Pascal Deveze <pascal.dev...@bull.net> > wrote: > > > > Hi Ralph, > > > > This in in the trunk. > > > > *De :* devel [mailto:devel-boun...@open-mpi.org > <devel-boun...@open-mpi.org>] *De la part de* Ralph Castain > *Envoyé :* mardi 9 décembre 2014 09:32 > *À :* Open MPI Developers > *Objet :* Re: [OMPI devel] Patch proposed: opal_set_using_threads(true) > in ompi/runtime/ompi_mpi_init.c is called to late > > > > Hi Pascal > > > > Is this in the trunk or in the 1.8 series (or both)? > > > > > > On Dec 9, 2014, at 12:28 AM, Pascal Deveze <pascal.dev...@bull.net> > wrote: > > > > > > In case where MPI is compiled with --enable-mpi-thread-multiple, a call to > opal_using_threads() always returns 0 in the routine > btl_xxx_component_init() of the BTLs, event if the application calls > MPI_Init_thread() with MPI_THREAD_MULTIPLE. > > > > This is because opal_set_using_threads(true) in > ompi/runtime/ompi_mpi_init.c is called to late. > > > > I propose the following patch that solves the problem for me: > > > > diff --git a/ompi/runtime/ompi_mpi_init.c b/ompi/runtime/ompi_mpi_init.c > > index 35509cf..c2370fc 100644 > > --- a/ompi/runtime/ompi_mpi_init.c > > +++ b/ompi/runtime/ompi_mpi_init.c > > @@ -512,6 +512,13 @@ int ompi_mpi_init(int argc, char **argv, int > requested, int *provided) > > } > > #endif > > > > + /* If thread support was enabled, then setup OPAL to allow for > > + them. */ > > + if ((OPAL_ENABLE_PROGRESS_THREADS == 1) || > > + (*provided != MPI_THREAD_SINGLE)) { > > + opal_set_using_threads(true); > > + } > > + > > /* initialize datatypes. This step should be done early as it will > > * create the local convertor and local arch used in the proc > > * init. > > @@ -724,13 +731,6 @@ int ompi_mpi_init(int argc, char **argv, int > requested, int *provided) > > goto error; > > } > > > > - /* If thread support was enabled, then setup OPAL to allow for > > - them. */ > > - if ((OPAL_ENABLE_PROGRESS_THREADS == 1) || > > - (*provided != MPI_THREAD_SINGLE)) { > > - opal_set_using_threads(true); > > - } > > - > > /* start PML/BTL's */ > > ret = MCA_PML_CALL(enable(true)); > > if( OMPI_SUCCESS != ret ) { > > _______________________________________________ > devel mailing list > de...@open-mpi.org > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel > Link to this post: > http://www.open-mpi.org/community/lists/devel/2014/12/16459.php > > > > _______________________________________________ > devel mailing list > de...@open-mpi.org > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel > > Link to this post: > http://www.open-mpi.org/community/lists/devel/2014/12/16462.php > > > > > _______________________________________________ > devel mailing list > de...@open-mpi.org > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel > Link to this post: > http://www.open-mpi.org/community/lists/devel/2014/12/16463.php > > > > _______________________________________________ > devel mailing list > de...@open-mpi.org > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel > > Link to this post: > http://www.open-mpi.org/community/lists/devel/2014/12/16516.php > > > > > _______________________________________________ > devel mailing list > de...@open-mpi.org > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel > Link to this post: > http://www.open-mpi.org/community/lists/devel/2014/12/16517.php > > > > _______________________________________________ > devel mailing list > de...@open-mpi.org > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel > Link to this post: > http://www.open-mpi.org/community/lists/devel/2014/12/16537.php >