Currently, I am working on process migration and automatic recovery based on
checkpoint/restart. WRT the PML stack, this works by rewiring the BTLs after
restart of the migrated/recovered MPI process(es). There is a fair amount of
work in getting this right with respect to both the runtime and t
> -Original Message-
> From: devel-boun...@open-mpi.org
> [mailto:devel-boun...@open-mpi.org] On Behalf Of Jeff Squyres
> Sent: Tuesday, December 15, 2009 6:32 PM
> To: Open MPI Developers
> Subject: Re: [OMPI devel] carto vs. hwloc
>
> On Dec 15, 2009, a
As far as I know what Josh did is slightly different. In the case of a complete
restart (where all processes are restarted from a checkpoint), he setup and
rewire a new set of BTLs.
However, it happens that we do have some code to rewire the MPI processes in
case of failure(s) in one of UTK pro
On Dec 15, 2009, at 6:31 PM, Jeff Squyres wrote:
> On Dec 15, 2009, at 2:20 PM, Ralph Castain wrote:
>
>> It probably should be done at a lower level, but it begs a different
>> question. For example, I've created the capability in the new cluster
>> manager to detect interfaces that are lost
On Dec 15, 2009, at 2:20 PM, Ralph Castain wrote:
> It probably should be done at a lower level, but it begs a different
> question. For example, I've created the capability in the new cluster
> manager to detect interfaces that are lost, ride through the problem by
> moving affected procs to
o occur IMO.
>
> Anyways, I think keeping in mind dynamic changes is well worth it but it
> seems to start moving there from a static position makes a lot of sense.
>
> --td
>> Ken Lloyd
>>
>>
>>> -Original Message-----
>>> From: devel-boun.
[mailto:devel-boun...@open-mpi.org] On Behalf Of Jeff Squyres
Sent: Monday, December 14, 2009 6:47 PM
To: Open MPI Developers List
Subject: Re: [OMPI devel] carto vs. hwloc
I had a nice chat with Ralph this afternoon about this topic.
He pointed out a few things to me:
- I had forgotten (ahem) th
14, 2009 6:47 PM
> To: Open MPI Developers List
> Subject: Re: [OMPI devel] carto vs. hwloc
>
> I had a nice chat with Ralph this afternoon about this topic.
>
> He pointed out a few things to me:
>
> - I had forgotten (ahem) that carto has weights associated
>
I had a nice chat with Ralph this afternoon about this topic.
He pointed out a few things to me:
- I had forgotten (ahem) that carto has weights associated with each of its
edges (and that's kind of a defining feature). hwloc, at present, does not.
So perhaps hwloc would not initially replace
Question for everyone (possibly a topic for tomorrow's call...):
hwloc is evolving into a fairly nice package. It's not ready for inclusion
into Open MPI yet, but it's getting there. I predict it will come in somewhere
early in the 1.5 series (potentially not 1.5.0, though). hwloc will provid
10 matches
Mail list logo