Re: [OMPI devel] openib btl - fatal errors don't abort the job

2010-09-07 Thread Shamis, Pavel
On Sep 3, 2010, at 8:14 AM, Jeff Squyres wrote: > On Sep 1, 2010, at 4:47 PM, Steve Wise wrote: > >> I was wondering what the logic is behind allowing an MPI job to continue in >> the presence of a fatal qp error? > > It's a feature...? The idea was that in some near future we will be able to

Re: [OMPI devel] openib btl - fatal errors don't abort the job

2010-09-03 Thread Jeff Squyres
On Sep 1, 2010, at 4:47 PM, Steve Wise wrote: > I was wondering what the logic is behind allowing an MPI job to continue in > the presence of a fatal qp error? It's a feature...? > Note the "will try to continue" sentence: > > ---

[OMPI devel] openib btl - fatal errors don't abort the job

2010-09-01 Thread Steve Wise
I was wondering what the logic is behind allowing an MPI job to continue in the presence of a fatal qp error? Note the "will try to continue" sentence: -- The OpenFabrics stack has reported a network error event. Open MPI