Re: [OMPI devel] barrier problem

2012-03-23 Thread Shamis, Pavel
Pavel, Mvapich implements multicore optimized collectives, which perform substantially better than default algorithms. FYI, ORNL team works on new high performance collectives framework for OMPI. The framework provides significant boost in collectives performance. Regards, Pavel (Pasha) Shami

[OMPI devel] Remove Portals support?

2012-03-23 Thread Barrett, Brian W
Hi all - This is not an RFC, but more a question for the community. Is anyone still actively using the Portals MTL/BTLs? We're not at Sandia. I know ORNL was using it at one point. SNL probably can't do much in the way of support anymore, so if no one wants them, it might make sense to remo

Re: [OMPI devel] [EXTERNAL] Re: [OMPI svn] svn:open-mpi r26180

2012-03-23 Thread Barrett, Brian W
Gah; sorry about that. I thought I had tested that code path, but it appears not. Stupid flexibility of Open MPI :). Jeff's correct, it's supposed to be MPI_SOURCE. Thanks, Jeff! Brian On Mar 22, 2012, at 6:50 PM, Ralph Castain wrote: > Thanks! > > On Mar 22, 2012, at 6:12 PM, Jeffrey Squ

Re: [OMPI devel] barrier problem

2012-03-23 Thread Pavel Mezentsev
I've been comparing 1.5.4 and 1.5.5rc3 with the same parameters that's why I didn't use --bind-to-core. I checked and the usage of --bind-to-core improved the result comparing to 1.5.4: #repetitions t_min[usec] t_max[usec] t_avg[usec] 100084.9685.0885.02 So I gu

Re: [OMPI devel] barrier problem

2012-03-23 Thread Ralph Castain
I don't see where you told OMPI to --bind-to-core. We don't automatically bind, so you have to explicitly tell us to do so. On Mar 23, 2012, at 6:20 AM, Pavel Mezentsev wrote: > Hello > > I'm doing some testing with IMB and dicovered a strange thing: > > Since I have a system with new AMD opte

[OMPI devel] barrier problem

2012-03-23 Thread Pavel Mezentsev
Hello I'm doing some testing with IMB and dicovered a strange thing: Since I have a system with new AMD opteron 6276 processors I'm using 1.5.5rc3 since it supports binding to cores. But when I run the barrier test form intel mpi benchmarks, the best I get is: #repetitions t_min[usec] t_max[us