Re: [OMPI devel] carto vs. hwloc

2009-12-16 Thread Joshua Hursey
Currently, I am working on process migration and automatic recovery based on checkpoint/restart. WRT the PML stack, this works by rewiring the BTLs after restart of the migrated/recovered MPI process(es). There is a fair amount of work in getting this right with respect to both the runtime and t

Re: [OMPI devel] carto vs. hwloc

2009-12-16 Thread Kenneth Lloyd
> -Original Message- > From: devel-boun...@open-mpi.org > [mailto:devel-boun...@open-mpi.org] On Behalf Of Jeff Squyres > Sent: Tuesday, December 15, 2009 6:32 PM > To: Open MPI Developers > Subject: Re: [OMPI devel] carto vs. hwloc > > On Dec 15, 2009, a

Re: [OMPI devel] carto vs. hwloc

2009-12-16 Thread George Bosilca
As far as I know what Josh did is slightly different. In the case of a complete restart (where all processes are restarted from a checkpoint), he setup and rewire a new set of BTLs. However, it happens that we do have some code to rewire the MPI processes in case of failure(s) in one of UTK pro

Re: [OMPI devel] carto vs. hwloc

2009-12-15 Thread Ralph Castain
On Dec 15, 2009, at 6:31 PM, Jeff Squyres wrote: > On Dec 15, 2009, at 2:20 PM, Ralph Castain wrote: > >> It probably should be done at a lower level, but it begs a different >> question. For example, I've created the capability in the new cluster >> manager to detect interfaces that are lost

Re: [OMPI devel] carto vs. hwloc

2009-12-15 Thread Jeff Squyres
On Dec 15, 2009, at 2:20 PM, Ralph Castain wrote: > It probably should be done at a lower level, but it begs a different > question. For example, I've created the capability in the new cluster > manager to detect interfaces that are lost, ride through the problem by > moving affected procs to

Re: [OMPI devel] carto vs. hwloc

2009-12-15 Thread Ralph Castain
o occur IMO. > > Anyways, I think keeping in mind dynamic changes is well worth it but it > seems to start moving there from a static position makes a lot of sense. > > --td >> Ken Lloyd >> >> >>> -Original Message----- >>> From: devel-boun.

Re: [OMPI devel] carto vs. hwloc

2009-12-15 Thread Terry Dontje
[mailto:devel-boun...@open-mpi.org] On Behalf Of Jeff Squyres Sent: Monday, December 14, 2009 6:47 PM To: Open MPI Developers List Subject: Re: [OMPI devel] carto vs. hwloc I had a nice chat with Ralph this afternoon about this topic. He pointed out a few things to me: - I had forgotten (ahem) th

Re: [OMPI devel] carto vs. hwloc

2009-12-15 Thread Kenneth Lloyd
14, 2009 6:47 PM > To: Open MPI Developers List > Subject: Re: [OMPI devel] carto vs. hwloc > > I had a nice chat with Ralph this afternoon about this topic. > > He pointed out a few things to me: > > - I had forgotten (ahem) that carto has weights associated >

Re: [OMPI devel] carto vs. hwloc

2009-12-14 Thread Jeff Squyres
I had a nice chat with Ralph this afternoon about this topic. He pointed out a few things to me: - I had forgotten (ahem) that carto has weights associated with each of its edges (and that's kind of a defining feature). hwloc, at present, does not. So perhaps hwloc would not initially replace

[OMPI devel] carto vs. hwloc

2009-12-14 Thread Jeff Squyres
Question for everyone (possibly a topic for tomorrow's call...): hwloc is evolving into a fairly nice package. It's not ready for inclusion into Open MPI yet, but it's getting there. I predict it will come in somewhere early in the 1.5 series (potentially not 1.5.0, though). hwloc will provid