Re: [OMPI devel] v1.7.5rc4 is posted

2014-03-19 Thread Ralph Castain
Sadly, rc4 was DOA due to a configure error under Linux - please see rc5 at the same location On Mar 19, 2014, at 4:25 PM, Ralph Castain wrote: > Hi folks > > We have posted the final release candidate for v1.7.5: > > http://www.open-mpi.org/software/ompi/v1.7/ > > PLEASE give it a quick sm

[OMPI devel] v1.7.5rc4 is posted

2014-03-19 Thread Ralph Castain
Hi folks We have posted the final release candidate for v1.7.5: http://www.open-mpi.org/software/ompi/v1.7/ PLEASE give it a quick smoke test - we plan to release Thurs afternoon (Pacific time) unless we hear of immediate show-stopper problems Thanks Ralph

Re: [OMPI devel] 1.7.5 and trunk failures

2014-03-19 Thread Jeff Squyres (jsquyres)
For the list: we figured this out. These neighbor tests require np>=4 (whew!). I added minimum np checks to the tests so that they'll skip (exit 77) if np<4. Nathan and I worked through the other three tests. On Mar 18, 2014, at 11:22 PM, Ralph Castain wrote: > Just to be safe, I blew away

[OMPI devel] Fwd: [CRIU] Open MPI and CRIU stdout/stderr

2014-03-19 Thread Jeff Squyres (jsquyres)
Forwarding because Pavel wasn't subscribed to the devel list, and it was rejected. Begin forwarded message: > From: Pavel Emelyanov > Subject: Re: [CRIU] [OMPI devel] Open MPI and CRIU stdout/stderr > Date: March 19, 2014 9:32:00 AM EDT > To: Adrian Reber > Cc: "Jeff Squyres (jsquyres)" , Ope

Re: [OMPI devel] Open MPI and CRIU stdout/stderr

2014-03-19 Thread Jeff Squyres (jsquyres)
On Mar 19, 2014, at 9:13 AM, Adrian Reber wrote: > What does Open MPI do with the file descriptors for stdout/stderr? We admittedly do funny things with stdin, stdout, and stderr... The short version is that OMPI intercepts all the stdin, stdout, and stderr from each MPI process and relays it

[OMPI devel] Open MPI and CRIU stdout/stderr

2014-03-19 Thread Adrian Reber
Cross-posting to criu and openmpi devel mailinglists. To get fault tolerance back into Open MPI I added code to use criu as a checkpoint/restart tool. I can checkpoint a process successfully but I have troubles restarting it. CRIU has currently problems restoring the process which is probably rela