The ft_event() function that you mentioned is part of the larger fault
tolerance infrastructure in Open MPI. You need to make sure to enable
it before using (if it is not enabled many of the ft_event functions
default to NULL). Add '--with-ft=cr' to your ./configure line and that
will enabl
Additionally, I believe that the FT system already does something like what you
describe (although perhaps not exactly the same thing) -- there is a phase
where the FT system pauses and quiesces all BTLs.
Did you look at that part of the code, perchance, and see if it meets your
needs?
On Jan
Thanks a lot for your help! I will give it a try.
Christoph
Ralph Castain schrieb:
> You've got this a tad wrong, but that's okay - let me try to clarify a couple
> of things that may help.
>
> First, you don't want to add this as a separate orted command. As you noted,
> orte has no direct wa
You've got this a tad wrong, but that's okay - let me try to clarify a couple
of things that may help.
First, you don't want to add this as a separate orted command. As you noted,
orte has no direct way to tell the OMPI layer to do anything. Instead, you want
to pass a message to the process th
Hi again,
Maybe I should give more specific information with some code snippets...
Currently I added
#define ORTE_DAEMON_BTL_CTL_CMD (orte_daemon_cmd_flag_t) 26
to odls_types.h to identify if I want to trigger the BTL pause.
In process_commands() of orted/orted_comm.c this flag is processed fir