Ok, got --disable-dlopen working again. I removed the code in question
and changed how coll/sm shares the segment data.
-Nathan
On Mon, Jul 28, 2014 at 02:41:37PM -0600, Nathan Hjelm wrote:
>
> Or pull it into coll/sm. Though I think we can do better here since
> point-to-point messaging can be
Or pull it into coll/sm. Though I think we can do better here since
point-to-point messaging can be used in coll/sm. We can use the
netpatterns code to share the segment information.
-Nathan
On Mon, Jul 28, 2014 at 08:37:15PM +, Jeff Squyres (jsquyres) wrote:
> Perhaps that RML code can go b
Perhaps that RML code can go back up in ompi/common/sm...? (since only
ompi/coll/sm uses it)
On Jul 28, 2014, at 4:34 PM, Nathan Hjelm wrote:
>
> Damn, spoke too soon. coll/sm uses it:
>
> ./ompi/mca/coll/sm/coll_sm_module.c:
> mca_common_sm_init_group(comm->c_local_group, size, ful
Damn, spoke too soon. coll/sm uses it:
./ompi/mca/coll/sm/coll_sm_module.c:
mca_common_sm_init_group(comm->c_local_group, size, fullpath,
./ompi/mca/coll/sm/coll_sm_module.c:
"coll:sm:enable:bootstrap comm (%d/%s): mca_common_sm_init_group failed",
Let me se
Looks like you are correct. The function that calls the rml code is
mca_common_sm_init which is no longer called by anything (other than
mca_common_sm_init_group.. which isn't called either). Let me see if I
can fix this. I need this build working again with --disable-dlopen.
mu-fey:/usr/projects
I'm not sure the sm actually relies on the RML any more - I thought we had
removed that dependency, though the file may not have been deleted.
On Jul 28, 2014, at 1:02 PM, Nathan Hjelm wrote:
>
> The trunk is totally broken and it might not be easy to fix. I am seeing
> this error when buildin
This has been clear from day one: everything based on RML to setup will
need to be rewritten. This is not only SM, it also related to IB.
Meanwhile, one must build with dlopen enabled in order to get access to
these calls.
George.
On Mon, Jul 28, 2014 at 4:02 PM, Nathan Hjelm wrote:
>
> The
The trunk is totally broken and it might not be easy to fix. I am seeing
this error when building with --disable-dlopen (the LANL default):
/usr/projects/hpctools/hjelmn/ompi-trunk-git/opal/mca/common/sm/common_sm_rml.c:
In function 'mca_common_sm_rml_info_bcast':
/usr/projects/hpctools/hjelmn/o