Re: [OMPI devel] Open MPI v4.0.1: Process is hanging inside MPI_Init() when debugged with TotalView

2019-11-13 Thread Ralph Castain via devel
Just want to clarify my remarks to avoid any misunderstanding. I'm not in any way saying MPIR or the debugger are at fault here, nor was I trying to imply that PMIx-based tools are somehow "superior" to MPIR-based ones. My point was solely focused on the question of reliability. The MPIR-based

Re: [OMPI devel] Open MPI v4.0.1: Process is hanging inside MPI_Init() when debugged with TotalView

2019-11-13 Thread Ralph Castain via devel
Agreed and understood. My point was only that I'm not convinced the problem was "fixed" as it is entirely consistent with your findings for the race condition to still exist, but be biased so strongly that it now "normally" passes. Without determining the precise code that causes things to hang

Re: [OMPI devel] Open MPI v4.0.1: Process is hanging inside MPI_Init() when debugged with TotalView

2019-11-13 Thread John DelSignore via devel
Hi Ralph, I assume you are referring to your previous email, where you wrote: Personally, I have never been entirely comfortable with the claim that the PMIx modification was the solution to the problem being discussed here. We have never seen a report of an application hanging in that spot