[OMPI devel] MTT failures

2015-02-18 Thread Howard Pritchard
Hi Folks

I noticed that the NERSC (carver/edison) MTT smoke tests are failing now.
I also see a lot of
ivy cluster runs are also failing.  All the nersc runs are failing with:

c1479:05071] OPAL ERROR: Bad parameter in file util/attr.c at line 431
[c1479:05071] [[57033,0],0] ORTE_ERROR_LOG: Bad parameter in file
util/attr.c at line 57
[c1479:05071] Signal: Segmentation fault (11)
[c1479:05071] Signal code: Address not mapped (1)
[c1479:05071] *** End of error message ***

the mpirun command line is


mpirun --bind-to none -np 32 --mca coll ^ml --mca btl
self,vader,openib  --prefix
/global/u2/h/hpp/mtt_carver_tmp/installs/8v68/install ./c_hello


Before people begin blaming this as a cray thing, this is from the
NERSC carver system which is an ibm dataplex system running redhat and
using MLNX connectX HCAs.

Anyone else seeing these failures?

Howard


Re: [OMPI devel] MTT failures

2015-02-18 Thread Ralph Castain
You’re almost 14 hours out-of-date Howard - it was fixed last night

> On Feb 18, 2015, at 10:39 AM, Howard Pritchard  wrote:
> 
> Hi Folks
> 
> I noticed that the NERSC (carver/edison) MTT smoke tests are failing now.  I 
> also see a lot of 
> ivy cluster runs are also failing.  All the nersc runs are failing with:
> 
> c1479:05071] OPAL ERROR: Bad parameter in file util/attr.c at line 431
> [c1479:05071] [[57033,0],0] ORTE_ERROR_LOG: Bad parameter in file util/attr.c 
> at line 57
> [c1479:05071] Signal: Segmentation fault (11)
> [c1479:05071] Signal code: Address not mapped (1)
> [c1479:05071] *** End of error message ***
> the mpirun command line is
> 
> mpirun --bind-to none -np 32 --mca coll ^ml --mca btl self,vader,openib  
> --prefix
> /global/u2/h/hpp/mtt_carver_tmp/installs/8v68/install ./c_hello 
> 
> Before people begin blaming this as a cray thing, this is from the NERSC 
> carver system which is an ibm dataplex system running redhat and using MLNX 
> connectX HCAs.
> Anyone else seeing these failures?
> Howard
> 
> 
> ___
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post: 
> http://www.open-mpi.org/community/lists/devel/2015/02/16992.php