Re: [OMPI devel] Changed behaviour with PSM on master

2015-01-11 Thread Howard Pritchard
eads) > > Am I missing anything? > > Thanks in advance, > Yohann > > -Original Message- > From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Jeff Squyres > (jsquyres) > Sent: Friday, January 09, 2015 1:27 PM > To: Open MPI Developers List > Subject

Re: [OMPI devel] Changed behaviour with PSM on master

2015-01-10 Thread Adrian Reber
https://github.com/open-mpi/ompi/issues/340 On Fri, Jan 09, 2015 at 01:12:34PM +0100, Adrian Reber wrote: > Running the mpi_test_suite on master used to work with no problems. At > some point in time it stopped working however and now I get only error > messages from PSM: > > """ > n050301:3.0.I

Re: [OMPI devel] Changed behaviour with PSM on master

2015-01-09 Thread Burette, Yohann
anged behaviour with PSM on master +1 -- someone should file a bug. I think Intel needs to decide how they want to handle this (e.g., whether the PSM MTL or OFI MTL should be the default, and how the other can detect if it's not the default and therefore it's safe to call psm_init... or somet

Re: [OMPI devel] Changed behaviour with PSM on master

2015-01-09 Thread Jeff Squyres (jsquyres)
+1 -- someone should file a bug. I think Intel needs to decide how they want to handle this (e.g., whether the PSM MTL or OFI MTL should be the default, and how the other can detect if it's not the default and therefore it's safe to call psm_init... or something like that). On Jan 9, 2015, at

Re: [OMPI devel] Changed behaviour with PSM on master

2015-01-09 Thread Howard Pritchard
HI Adrian, Please open an issue. We don't want users having to explicitly specify the mtl to use just to get a job to run on a intel/infinipath system. Howard 2015-01-09 13:04 GMT-07:00 Adrian Reber : > Should I still open a ticket? Will these be changed or do I always have > to provide '--mca

Re: [OMPI devel] Changed behaviour with PSM on master

2015-01-09 Thread Ralph Castain
I suspect it will have to be fixed at some point. > On Jan 9, 2015, at 12:04 PM, Adrian Reber wrote: > > Should I still open a ticket? Will these be changed or do I always have > to provide '--mca mtl psm' in the future? > > On Fri, Jan 09, 2015 at 12:27:59PM -0700, Howard Pritchard wrote: >> H

Re: [OMPI devel] Changed behaviour with PSM on master

2015-01-09 Thread Adrian Reber
Should I still open a ticket? Will these be changed or do I always have to provide '--mca mtl psm' in the future? On Fri, Jan 09, 2015 at 12:27:59PM -0700, Howard Pritchard wrote: > HI Adrian, Andrew, > > Sorry try again, both the libfabric psm provider and the open mpi psm > mtl are trying to u

Re: [OMPI devel] Changed behaviour with PSM on master

2015-01-09 Thread Howard Pritchard
HI Adrian, Andrew, Sorry try again, both the libfabric psm provider and the open mpi psm mtl are trying to use psm_init. So, to avoid this problem, add --mca mtl psm to your mpirun command line. Sorry for the confusion. Howard 2015-01-09 7:52 GMT-07:00 Friedley, Andrew : > No this is not

Re: [OMPI devel] Changed behaviour with PSM on master

2015-01-09 Thread Howard Pritchard
HI Folks, Sorry for my stupidity. I now see the problem. App is calling pmi_init twice because of the new ofiwg libfabric mtl. You can try mpirun blah blah blah --mca btl and things should work. Howard 2015-01-09 7:52 GMT-07:00 Friedley, Andrew : > No this is not expected behavior. > > T

Re: [OMPI devel] Changed behaviour with PSM on master

2015-01-09 Thread Howard Pritchard
Hi Adrian and Andrew, I"m able to reproduce your problem on one of our qlogic clusters. We are using PSM 1.14 and slurm. I'm noticing that for some reason in our setup the ORTE_MCA_orte_precondition_transports env. variable is not being set. Could you run your test with --mca odls_base_verbose

Re: [OMPI devel] Changed behaviour with PSM on master

2015-01-09 Thread Friedley, Andrew
No this is not expected behavior. The PSM MTL code has not changed in 2 months, when I fixed that unused variable warning for you. That suggests something above the PSM MTL broke things. I see no reason your older software install should suddenly stopping working if all you are updating is OM