Thanks Brice,
bottom line, even if hwloc is not fully ported, it should build and ompi
should get something usable.
in this case, i have no objection removing the --without-hwloc configure
option.
you can contact me off-list regarding the FX10 specific issue
Cheers,
Gilles
On 9/4/2015 2:31 PM, Brice Goglin wrote:
Le 04/09/2015 00:36, Gilles Gouaillardet a écrit :
Ralph,
just to be clear, your proposal is to abort if openmpi is configured
with --without-hwloc, right ?
( the --with-hwloc option is not removed because we want to keep the
option of using an external hwloc library )
if I understand correctly, Paul's point is that if openmpi is ported
to a new architecture for which hwloc has not been ported yet
(embedded hwloc or external hwloc), then the very first step is to
port hwloc before ompi can be built.
did I get it right Paul ?
Brice, what would happen in such a case ?
embedded hwloc cannot be built ?
hwloc returns little or no information ?
If it's a new operating system and it supports at least things like
sysconf, you will get a Machine object with one PUs per logical processor.
If it's a new platform running Linux, they are supposed to tell Linux
at least package/core/thread information. That's what we have for ARM
for instance.
Missing topology detection can be worked around easily (with XML and
synthetic description, what we did for BlueGene/Q before adding manual
support for that specific processor). Binding support can't.
And once you get binding, you get x86-topology even if the operating
system isn't supported (using cpuid).
for example, on Fujitsu FX10 node (single socket, 16 cores), hwloc
reports 16 sockets with one core each and no cache. though this is
not correct, that can be seen as equivalent to the real config by
ompi, so this is not really an issue for ompi.
Can you help fixing this?
The issue is indeed with supercomputers with uncommon architectures
like this one.
Brice
Cheers,
Gilles
On Friday, September 4, 2015, Ralph Castain <r...@open-mpi.org
<mailto:r...@open-mpi.org>> wrote:
No - hwloc is embedded in OMPI anyway.
On Sep 3, 2015, at 11:09 AM, Paul Hargrove <phhargr...@lbl.gov
<javascript:_e(%7B%7D,'cvml','phhargr...@lbl.gov');>> wrote:
On Thu, Sep 3, 2015 at 8:03 AM, Ralph Castain <r...@open-mpi.org
<javascript:_e(%7B%7D,'cvml','r...@open-mpi.org');>> wrote:
Does anyone know of a reason why we shouldn’t do this?
Would doing this mean that a port to a new system would require
that one first perform a full hwloc port?
-Paul
--
Paul H. Hargrove phhargr...@lbl.gov
<javascript:_e(%7B%7D,'cvml','phhargr...@lbl.gov');>
Computer Languages & Systems Software (CLaSS) Group
Computer Science Department Tel: +1-510-495-2352
Lawrence Berkeley National Laboratory Fax: +1-510-486-6900
_______________________________________________
devel mailing list
de...@open-mpi.org
<javascript:_e(%7B%7D,'cvml','de...@open-mpi.org');>
Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
Link to this post:
http://www.open-mpi.org/community/lists/devel/2015/09/17942.php
_______________________________________________
devel mailing list
de...@open-mpi.org
Subscription:http://www.open-mpi.org/mailman/listinfo.cgi/devel
Link to this
post:http://www.open-mpi.org/community/lists/devel/2015/09/17952.php
_______________________________________________
devel mailing list
de...@open-mpi.org
Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
Link to this post:
http://www.open-mpi.org/community/lists/devel/2015/09/17958.php