Re: [OMPI devel] 1.10.3rc MTT failures

2016-04-26 Thread Gilles Gouaillardet

Jeff,

fwiw, it is possible to save some bandwidth (~4x) with the --depth option


full clone:

git clone https://ggouaillar...@github.com/open-mpi/ompi-tests.git
Cloning into 'ompi-tests'...
remote: Counting objects: 32016, done.
remote: Total 32016 (delta 0), reused 0 (delta 0), pack-reused 32016
Receiving objects: 100% (32016/32016), 61.31 MiB | 645.00 KiB/s, done.
Resolving deltas: 100% (20719/20719), done.
Checking out files: 100% (9221/9221), done.


last commit only :

git clone --depth=1 https://ggouaillar...@github.com/open-mpi/ompi-tests.git
Cloning into 'ompi-tests'...
remote: Counting objects: 10687, done.
remote: Compressing objects: 100% (4667/4667), done.
remote: Total 10687 (delta 4972), reused 9595 (delta 4477), pack-reused 0
Receiving objects: 100% (10687/10687), 13.29 MiB | 673.00 KiB/s, done.
Resolving deltas: 100% (4972/4972), done.

Cheers,

Gilles

On 4/26/2016 12:03 AM, Jeff Squyres (jsquyres) wrote:

On Apr 25, 2016, at 9:50 AM, Gilles Gouaillardet 
 wrote:

and fwiw, Jeff uses an internally mirrored repo for ompi-tests, so it Cisco 
clusters should use the latest test suites.

Correct.  My local git mirrors update nightly.

FWIW: This made a *huge* difference when we were using SVN for ompi-tests.  An 
individual SVN checkout across the network was reeely slow; it was 
*significantly* faster to do a local SVN checkout.

I'm sure it's still faster to do a local git clone, but I don't know offhand if 
the amount of speedup is compared to a github.com clone of ompi-tests.





Re: [OMPI devel] 1.10.3rc MTT failures

2016-04-25 Thread Jeff Squyres (jsquyres)
On Apr 25, 2016, at 9:50 AM, Gilles Gouaillardet 
 wrote:
> 
> and fwiw, Jeff uses an internally mirrored repo for ompi-tests, so it Cisco 
> clusters should use the latest test suites.

Correct.  My local git mirrors update nightly.

FWIW: This made a *huge* difference when we were using SVN for ompi-tests.  An 
individual SVN checkout across the network was reeely slow; it was 
*significantly* faster to do a local SVN checkout.

I'm sure it's still faster to do a local git clone, but I don't know offhand if 
the amount of speedup is compared to a github.com clone of ompi-tests.

-- 
Jeff Squyres
jsquy...@cisco.com
For corporate legal information go to: 
http://www.cisco.com/web/about/doing_business/legal/cri/



Re: [OMPI devel] 1.10.3rc MTT failures

2016-04-25 Thread Ralph Castain
FWIW: there seems to be some message attempting to be sent down to the child 
procs on termination that is causing that issue. I’m not sure where it comes 
from, but probably is due to the restoration of the usock OOB component.


> On Apr 25, 2016, at 7:25 AM, Josh Hursey  wrote:
> 
> IBM had a stale version of ompi-tests. I have sync'ed that repo, and will try 
> again later today.
> 
> The loop spawn error will take some digging. I'll see what we can find.
> 
> On Mon, Apr 25, 2016 at 9:14 AM, Gilles Gouaillardet 
> mailto:gilles.gouaillar...@gmail.com>> wrote:
> This is a known bug that is being discussed at 
> https://github.com/open-mpi/ompi/pull/1473/commits/0d1431f02c6b2876cdeee4fd783d6b6807dfff2a
>  
> 
> it affects big endian machine or 8 bytes fortran integer
> 
> Cheers,
> 
> Gilles
> 
> 
> On Monday, April 25, 2016, Adrian Reber  > wrote:
> Errors like that (Win::Get_attr: Got wrong value for disp unit) are from
> my ppc64 machine: https://mtt.open-mpi.org/index.php?do_redir=2295 
> 
> 
> The MTT setup is checking out the tests from github directly:
> 
> [Test get: ibm]
> module = SCM
> scm_module = Git
> scm_url = https://github.com/open-mpi/ompi-tests.git 
> 
> scm_subdir = ibm
> 
> Not sure Ralph meant those errors. But they only happen on ppc64 and not
> on x86_64 with a very similar mtt configuration file.
> 
> Adrian
> 
> On Mon, Apr 25, 2016 at 10:50:03PM +0900, Gilles Gouaillardet wrote:
> > Cisco mtt looks clean
> > since ompi_tests repo is private, it cannot be automatically pulled unless
> > a password is saved (https) or a public key was uploaded to github (ssh)
> > for that reason, I would not simply assume the latest test suite is used :-(
> > and fwiw, Jeff uses an internally mirrored repo for ompi-tests, so it Cisco
> > clusters should use the latest test suites.
> >
> > Geoffrey,
> > can you please comment on the config of the ibm cluster ?
> >
> > Cheers,
> >
> > Gilles
> >
> > On Monday, April 25, 2016, Ralph Castain 
> > ');>> wrote:
> >
> > > I don’t know - this isn’t on my machine, but rather in the weekend and
> > > nightly MTT reports. I’m assuming folks are running the latest test suite,
> > > but...
> > >
> > >
> > > On Apr 25, 2016, at 6:20 AM, Gilles Gouaillardet <
> > > gilles.gouaillar...@gmail.com <>> wrote:
> > >
> > > Ralph,
> > >
> > > can you make sure the ibm test suite is up to date ?
> > > I pushed a fix for datatypes a few days ago, and it should be fine now.
> > >
> > > I will double check this tomorrow anyway
> > >
> > > Cheers,
> > >
> > > Gilles
> > >
> > > On Monday, April 25, 2016, Ralph Castain > wrote:
> > >
> > >> I’m seeing some consistent errors in the 1.10.3rc MTT results and would
> > >> appreciate it if folks could check them out:
> > >>
> > >> ONESIDED:
> > >> onesided/cxx_win_attr:
> > >> [**ERROR**]: MPI_COMM_WORLD rank 0, file cxx_win_attr.cc:50:
> > >> Win::Get_attr: Got wrong value for disp unit
> > >> [**ERROR**]: MPI_COMM_WORLD rank 1, file cxx_win_attr.cc:50:
> > >> Win::Get_attr: Got wrong value for disp
> > >>
> > >>
> > >> DATATYPE:
> > >> datatype/predefined-datatype-name
> > >> MPI_LONG_LONG!= MPI_LONG_LONG_INT
> > >>
> > >>
> > >> LOOP SPAWN:
> > >> too many retries sending message to , giving up
> > >>
> > >> Thanks
> > >> Ralph
> > >>
> > >> ___
> > >> devel mailing list
> > >> de...@open-mpi.org <>
> > >> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel 
> > >> 
> > >> Link to this post:
> > >> http://www.open-mpi.org/community/lists/devel/2016/04/18809.php 
> > >> 
> > >
> > > ___
> > > devel mailing list
> > > de...@open-mpi.org <>
> > > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel 
> > > 
> > > Link to this post:
> > > http://www.open-mpi.org/community/lists/devel/2016/04/18810.php 
> > > 
> > >
> > >
> > >
> 
> > ___
> > devel mailing list
> > de...@open-mpi.org <>
> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel 
> > 
> > Link to this post: 
> > http://www.open-mpi.org/community/lists/devel/2016/04/18812.php 
> > 
> ___
> devel mailing list
> de...@open-mpi.org <>
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel 
> 
> Link to this post:

Re: [OMPI devel] 1.10.3rc MTT failures

2016-04-25 Thread Josh Hursey
IBM had a stale version of ompi-tests. I have sync'ed that repo, and will
try again later today.

The loop spawn error will take some digging. I'll see what we can find.

On Mon, Apr 25, 2016 at 9:14 AM, Gilles Gouaillardet <
gilles.gouaillar...@gmail.com> wrote:

> This is a known bug that is being discussed at
> https://github.com/open-mpi/ompi/pull/1473/commits/0d1431f02c6b2876cdeee4fd783d6b6807dfff2a
> it affects big endian machine or 8 bytes fortran integer
>
> Cheers,
>
> Gilles
>
>
> On Monday, April 25, 2016, Adrian Reber  wrote:
>
>> Errors like that (Win::Get_attr: Got wrong value for disp unit) are from
>> my ppc64 machine: https://mtt.open-mpi.org/index.php?do_redir=2295
>>
>> The MTT setup is checking out the tests from github directly:
>>
>> [Test get: ibm]
>> module = SCM
>> scm_module = Git
>> scm_url = https://github.com/open-mpi/ompi-tests.git
>> scm_subdir = ibm
>>
>> Not sure Ralph meant those errors. But they only happen on ppc64 and not
>> on x86_64 with a very similar mtt configuration file.
>>
>> Adrian
>>
>> On Mon, Apr 25, 2016 at 10:50:03PM +0900, Gilles Gouaillardet wrote:
>> > Cisco mtt looks clean
>> > since ompi_tests repo is private, it cannot be automatically pulled
>> unless
>> > a password is saved (https) or a public key was uploaded to github (ssh)
>> > for that reason, I would not simply assume the latest test suite is
>> used :-(
>> > and fwiw, Jeff uses an internally mirrored repo for ompi-tests, so it
>> Cisco
>> > clusters should use the latest test suites.
>> >
>> > Geoffrey,
>> > can you please comment on the config of the ibm cluster ?
>> >
>> > Cheers,
>> >
>> > Gilles
>> >
>> > On Monday, April 25, 2016, Ralph Castain > > > wrote:
>> >
>> > > I don’t know - this isn’t on my machine, but rather in the weekend and
>> > > nightly MTT reports. I’m assuming folks are running the latest test
>> suite,
>> > > but...
>> > >
>> > >
>> > > On Apr 25, 2016, at 6:20 AM, Gilles Gouaillardet <
>> > > gilles.gouaillar...@gmail.com> wrote:
>> > >
>> > > Ralph,
>> > >
>> > > can you make sure the ibm test suite is up to date ?
>> > > I pushed a fix for datatypes a few days ago, and it should be fine
>> now.
>> > >
>> > > I will double check this tomorrow anyway
>> > >
>> > > Cheers,
>> > >
>> > > Gilles
>> > >
>> > > On Monday, April 25, 2016, Ralph Castain  wrote:
>> > >
>> > >> I’m seeing some consistent errors in the 1.10.3rc MTT results and
>> would
>> > >> appreciate it if folks could check them out:
>> > >>
>> > >> ONESIDED:
>> > >> onesided/cxx_win_attr:
>> > >> [**ERROR**]: MPI_COMM_WORLD rank 0, file cxx_win_attr.cc:50:
>> > >> Win::Get_attr: Got wrong value for disp unit
>> > >> [**ERROR**]: MPI_COMM_WORLD rank 1, file cxx_win_attr.cc:50:
>> > >> Win::Get_attr: Got wrong value for disp
>> > >>
>> > >>
>> > >> DATATYPE:
>> > >> datatype/predefined-datatype-name
>> > >> MPI_LONG_LONG!= MPI_LONG_LONG_INT
>> > >>
>> > >>
>> > >> LOOP SPAWN:
>> > >> too many retries sending message to , giving up
>> > >>
>> > >> Thanks
>> > >> Ralph
>> > >>
>> > >> ___
>> > >> devel mailing list
>> > >> de...@open-mpi.org
>> > >> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>> > >> Link to this post:
>> > >> http://www.open-mpi.org/community/lists/devel/2016/04/18809.php
>> > >
>> > > ___
>> > > devel mailing list
>> > > de...@open-mpi.org
>> > > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>> > > Link to this post:
>> > > http://www.open-mpi.org/community/lists/devel/2016/04/18810.php
>> > >
>> > >
>> > >
>>
>> > ___
>> > devel mailing list
>> > de...@open-mpi.org
>> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>> > Link to this post:
>> http://www.open-mpi.org/community/lists/devel/2016/04/18812.php
>> ___
>> devel mailing list
>> de...@open-mpi.org
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>> Link to this post:
>> http://www.open-mpi.org/community/lists/devel/2016/04/18813.php
>
>
> ___
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post:
> http://www.open-mpi.org/community/lists/devel/2016/04/18814.php
>


Re: [OMPI devel] 1.10.3rc MTT failures

2016-04-25 Thread Ralph Castain
I beg to differ on the Cisco case - some of those errors come directly from the 
Cisco MTT.


> On Apr 25, 2016, at 6:50 AM, Gilles Gouaillardet 
>  wrote:
> 
> Cisco mtt looks clean
> since ompi_tests repo is private, it cannot be automatically pulled unless a 
> password is saved (https) or a public key was uploaded to github (ssh)
> for that reason, I would not simply assume the latest test suite is used :-(
> and fwiw, Jeff uses an internally mirrored repo for ompi-tests, so it Cisco 
> clusters should use the latest test suites.
> 
> Geoffrey,
> can you please comment on the config of the ibm cluster ?
> 
> Cheers,
> 
> Gilles
> 
> On Monday, April 25, 2016, Ralph Castain  > wrote:
> I don’t know - this isn’t on my machine, but rather in the weekend and 
> nightly MTT reports. I’m assuming folks are running the latest test suite, 
> but...
> 
> 
>> On Apr 25, 2016, at 6:20 AM, Gilles Gouaillardet 
>> > wrote:
>> 
>> Ralph,
>> 
>> can you make sure the ibm test suite is up to date ?
>> I pushed a fix for datatypes a few days ago, and it should be fine now.
>> 
>> I will double check this tomorrow anyway
>> 
>> Cheers,
>> 
>> Gilles
>> 
>> On Monday, April 25, 2016, Ralph Castain > wrote:
>> I’m seeing some consistent errors in the 1.10.3rc MTT results and would 
>> appreciate it if folks could check them out:
>> 
>> ONESIDED:
>> onesided/cxx_win_attr:
>> [**ERROR**]: MPI_COMM_WORLD rank 0, file cxx_win_attr.cc:50:
>> Win::Get_attr: Got wrong value for disp unit
>> [**ERROR**]: MPI_COMM_WORLD rank 1, file cxx_win_attr.cc:50:
>> Win::Get_attr: Got wrong value for disp
>> 
>> 
>> DATATYPE:
>> datatype/predefined-datatype-name
>> MPI_LONG_LONG!= MPI_LONG_LONG_INT
>> 
>> 
>> LOOP SPAWN:
>> too many retries sending message to , giving up
>> 
>> Thanks
>> Ralph
>> 
>> ___
>> devel mailing list
>> de...@open-mpi.org <>
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel 
>> 
>> Link to this post: 
>> http://www.open-mpi.org/community/lists/devel/2016/04/18809.php 
>> ___
>> devel mailing list
>> de...@open-mpi.org <>
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel 
>> 
>> Link to this post: 
>> http://www.open-mpi.org/community/lists/devel/2016/04/18810.php 
>> 
> ___
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post: 
> http://www.open-mpi.org/community/lists/devel/2016/04/18812.php



Re: [OMPI devel] 1.10.3rc MTT failures

2016-04-25 Thread Gilles Gouaillardet
This is a known bug that is being discussed at
https://github.com/open-mpi/ompi/pull/1473/commits/0d1431f02c6b2876cdeee4fd783d6b6807dfff2a
it affects big endian machine or 8 bytes fortran integer

Cheers,

Gilles

On Monday, April 25, 2016, Adrian Reber  wrote:

> Errors like that (Win::Get_attr: Got wrong value for disp unit) are from
> my ppc64 machine: https://mtt.open-mpi.org/index.php?do_redir=2295
>
> The MTT setup is checking out the tests from github directly:
>
> [Test get: ibm]
> module = SCM
> scm_module = Git
> scm_url = https://github.com/open-mpi/ompi-tests.git
> scm_subdir = ibm
>
> Not sure Ralph meant those errors. But they only happen on ppc64 and not
> on x86_64 with a very similar mtt configuration file.
>
> Adrian
>
> On Mon, Apr 25, 2016 at 10:50:03PM +0900, Gilles Gouaillardet wrote:
> > Cisco mtt looks clean
> > since ompi_tests repo is private, it cannot be automatically pulled
> unless
> > a password is saved (https) or a public key was uploaded to github (ssh)
> > for that reason, I would not simply assume the latest test suite is used
> :-(
> > and fwiw, Jeff uses an internally mirrored repo for ompi-tests, so it
> Cisco
> > clusters should use the latest test suites.
> >
> > Geoffrey,
> > can you please comment on the config of the ibm cluster ?
> >
> > Cheers,
> >
> > Gilles
> >
> > On Monday, April 25, 2016, Ralph Castain  
> > ');>>
> wrote:
> >
> > > I don’t know - this isn’t on my machine, but rather in the weekend and
> > > nightly MTT reports. I’m assuming folks are running the latest test
> suite,
> > > but...
> > >
> > >
> > > On Apr 25, 2016, at 6:20 AM, Gilles Gouaillardet <
> > > gilles.gouaillar...@gmail.com > wrote:
> > >
> > > Ralph,
> > >
> > > can you make sure the ibm test suite is up to date ?
> > > I pushed a fix for datatypes a few days ago, and it should be fine now.
> > >
> > > I will double check this tomorrow anyway
> > >
> > > Cheers,
> > >
> > > Gilles
> > >
> > > On Monday, April 25, 2016, Ralph Castain  > wrote:
> > >
> > >> I’m seeing some consistent errors in the 1.10.3rc MTT results and
> would
> > >> appreciate it if folks could check them out:
> > >>
> > >> ONESIDED:
> > >> onesided/cxx_win_attr:
> > >> [**ERROR**]: MPI_COMM_WORLD rank 0, file cxx_win_attr.cc:50:
> > >> Win::Get_attr: Got wrong value for disp unit
> > >> [**ERROR**]: MPI_COMM_WORLD rank 1, file cxx_win_attr.cc:50:
> > >> Win::Get_attr: Got wrong value for disp
> > >>
> > >>
> > >> DATATYPE:
> > >> datatype/predefined-datatype-name
> > >> MPI_LONG_LONG!= MPI_LONG_LONG_INT
> > >>
> > >>
> > >> LOOP SPAWN:
> > >> too many retries sending message to , giving up
> > >>
> > >> Thanks
> > >> Ralph
> > >>
> > >> ___
> > >> devel mailing list
> > >> de...@open-mpi.org 
> > >> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> > >> Link to this post:
> > >> http://www.open-mpi.org/community/lists/devel/2016/04/18809.php
> > >
> > > ___
> > > devel mailing list
> > > de...@open-mpi.org 
> > > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> > > Link to this post:
> > > http://www.open-mpi.org/community/lists/devel/2016/04/18810.php
> > >
> > >
> > >
>
> > ___
> > devel mailing list
> > de...@open-mpi.org 
> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> > Link to this post:
> http://www.open-mpi.org/community/lists/devel/2016/04/18812.php
> ___
> devel mailing list
> de...@open-mpi.org 
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post:
> http://www.open-mpi.org/community/lists/devel/2016/04/18813.php


Re: [OMPI devel] 1.10.3rc MTT failures

2016-04-25 Thread Adrian Reber
Errors like that (Win::Get_attr: Got wrong value for disp unit) are from
my ppc64 machine: https://mtt.open-mpi.org/index.php?do_redir=2295

The MTT setup is checking out the tests from github directly:

[Test get: ibm]
module = SCM
scm_module = Git
scm_url = https://github.com/open-mpi/ompi-tests.git
scm_subdir = ibm

Not sure Ralph meant those errors. But they only happen on ppc64 and not
on x86_64 with a very similar mtt configuration file.

Adrian

On Mon, Apr 25, 2016 at 10:50:03PM +0900, Gilles Gouaillardet wrote:
> Cisco mtt looks clean
> since ompi_tests repo is private, it cannot be automatically pulled unless
> a password is saved (https) or a public key was uploaded to github (ssh)
> for that reason, I would not simply assume the latest test suite is used :-(
> and fwiw, Jeff uses an internally mirrored repo for ompi-tests, so it Cisco
> clusters should use the latest test suites.
> 
> Geoffrey,
> can you please comment on the config of the ibm cluster ?
> 
> Cheers,
> 
> Gilles
> 
> On Monday, April 25, 2016, Ralph Castain  > wrote:
> 
> > I don’t know - this isn’t on my machine, but rather in the weekend and
> > nightly MTT reports. I’m assuming folks are running the latest test suite,
> > but...
> >
> >
> > On Apr 25, 2016, at 6:20 AM, Gilles Gouaillardet <
> > gilles.gouaillar...@gmail.com> wrote:
> >
> > Ralph,
> >
> > can you make sure the ibm test suite is up to date ?
> > I pushed a fix for datatypes a few days ago, and it should be fine now.
> >
> > I will double check this tomorrow anyway
> >
> > Cheers,
> >
> > Gilles
> >
> > On Monday, April 25, 2016, Ralph Castain  wrote:
> >
> >> I’m seeing some consistent errors in the 1.10.3rc MTT results and would
> >> appreciate it if folks could check them out:
> >>
> >> ONESIDED:
> >> onesided/cxx_win_attr:
> >> [**ERROR**]: MPI_COMM_WORLD rank 0, file cxx_win_attr.cc:50:
> >> Win::Get_attr: Got wrong value for disp unit
> >> [**ERROR**]: MPI_COMM_WORLD rank 1, file cxx_win_attr.cc:50:
> >> Win::Get_attr: Got wrong value for disp
> >>
> >>
> >> DATATYPE:
> >> datatype/predefined-datatype-name
> >> MPI_LONG_LONG!= MPI_LONG_LONG_INT
> >>
> >>
> >> LOOP SPAWN:
> >> too many retries sending message to , giving up
> >>
> >> Thanks
> >> Ralph
> >>
> >> ___
> >> devel mailing list
> >> de...@open-mpi.org
> >> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> >> Link to this post:
> >> http://www.open-mpi.org/community/lists/devel/2016/04/18809.php
> >
> > ___
> > devel mailing list
> > de...@open-mpi.org
> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> > Link to this post:
> > http://www.open-mpi.org/community/lists/devel/2016/04/18810.php
> >
> >
> >

> ___
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post: 
> http://www.open-mpi.org/community/lists/devel/2016/04/18812.php


[OMPI devel] 1.10.3rc MTT failures

2016-04-25 Thread Gilles Gouaillardet
Cisco mtt looks clean
since ompi_tests repo is private, it cannot be automatically pulled unless
a password is saved (https) or a public key was uploaded to github (ssh)
for that reason, I would not simply assume the latest test suite is used :-(
and fwiw, Jeff uses an internally mirrored repo for ompi-tests, so it Cisco
clusters should use the latest test suites.

Geoffrey,
can you please comment on the config of the ibm cluster ?

Cheers,

Gilles

On Monday, April 25, 2016, Ralph Castain > wrote:

> I don’t know - this isn’t on my machine, but rather in the weekend and
> nightly MTT reports. I’m assuming folks are running the latest test suite,
> but...
>
>
> On Apr 25, 2016, at 6:20 AM, Gilles Gouaillardet <
> gilles.gouaillar...@gmail.com> wrote:
>
> Ralph,
>
> can you make sure the ibm test suite is up to date ?
> I pushed a fix for datatypes a few days ago, and it should be fine now.
>
> I will double check this tomorrow anyway
>
> Cheers,
>
> Gilles
>
> On Monday, April 25, 2016, Ralph Castain  wrote:
>
>> I’m seeing some consistent errors in the 1.10.3rc MTT results and would
>> appreciate it if folks could check them out:
>>
>> ONESIDED:
>> onesided/cxx_win_attr:
>> [**ERROR**]: MPI_COMM_WORLD rank 0, file cxx_win_attr.cc:50:
>> Win::Get_attr: Got wrong value for disp unit
>> [**ERROR**]: MPI_COMM_WORLD rank 1, file cxx_win_attr.cc:50:
>> Win::Get_attr: Got wrong value for disp
>>
>>
>> DATATYPE:
>> datatype/predefined-datatype-name
>> MPI_LONG_LONG!= MPI_LONG_LONG_INT
>>
>>
>> LOOP SPAWN:
>> too many retries sending message to , giving up
>>
>> Thanks
>> Ralph
>>
>> ___
>> devel mailing list
>> de...@open-mpi.org
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>> Link to this post:
>> http://www.open-mpi.org/community/lists/devel/2016/04/18809.php
>
> ___
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post:
> http://www.open-mpi.org/community/lists/devel/2016/04/18810.php
>
>
>


Re: [OMPI devel] 1.10.3rc MTT failures

2016-04-25 Thread Ralph Castain
I don’t know - this isn’t on my machine, but rather in the weekend and nightly 
MTT reports. I’m assuming folks are running the latest test suite, but...


> On Apr 25, 2016, at 6:20 AM, Gilles Gouaillardet 
>  wrote:
> 
> Ralph,
> 
> can you make sure the ibm test suite is up to date ?
> I pushed a fix for datatypes a few days ago, and it should be fine now.
> 
> I will double check this tomorrow anyway
> 
> Cheers,
> 
> Gilles
> 
> On Monday, April 25, 2016, Ralph Castain  > wrote:
> I’m seeing some consistent errors in the 1.10.3rc MTT results and would 
> appreciate it if folks could check them out:
> 
> ONESIDED:
> onesided/cxx_win_attr:
> [**ERROR**]: MPI_COMM_WORLD rank 0, file cxx_win_attr.cc:50:
> Win::Get_attr: Got wrong value for disp unit
> [**ERROR**]: MPI_COMM_WORLD rank 1, file cxx_win_attr.cc:50:
> Win::Get_attr: Got wrong value for disp
> 
> 
> DATATYPE:
> datatype/predefined-datatype-name
> MPI_LONG_LONG!= MPI_LONG_LONG_INT
> 
> 
> LOOP SPAWN:
> too many retries sending message to , giving up
> 
> Thanks
> Ralph
> 
> ___
> devel mailing list
> de...@open-mpi.org 
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel 
> 
> Link to this post: 
> http://www.open-mpi.org/community/lists/devel/2016/04/18809.php 
> ___
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post: 
> http://www.open-mpi.org/community/lists/devel/2016/04/18810.php



Re: [OMPI devel] 1.10.3rc MTT failures

2016-04-25 Thread Gilles Gouaillardet
Ralph,

can you make sure the ibm test suite is up to date ?
I pushed a fix for datatypes a few days ago, and it should be fine now.

I will double check this tomorrow anyway

Cheers,

Gilles

On Monday, April 25, 2016, Ralph Castain  wrote:

> I’m seeing some consistent errors in the 1.10.3rc MTT results and would
> appreciate it if folks could check them out:
>
> ONESIDED:
> onesided/cxx_win_attr:
> [**ERROR**]: MPI_COMM_WORLD rank 0, file cxx_win_attr.cc:50:
> Win::Get_attr: Got wrong value for disp unit
> [**ERROR**]: MPI_COMM_WORLD rank 1, file cxx_win_attr.cc:50:
> Win::Get_attr: Got wrong value for disp
>
>
> DATATYPE:
> datatype/predefined-datatype-name
> MPI_LONG_LONG!= MPI_LONG_LONG_INT
>
>
> LOOP SPAWN:
> too many retries sending message to , giving up
>
> Thanks
> Ralph
>
> ___
> devel mailing list
> de...@open-mpi.org 
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post:
> http://www.open-mpi.org/community/lists/devel/2016/04/18809.php


[OMPI devel] 1.10.3rc MTT failures

2016-04-25 Thread Ralph Castain
I’m seeing some consistent errors in the 1.10.3rc MTT results and would 
appreciate it if folks could check them out:

ONESIDED:
onesided/cxx_win_attr:
[**ERROR**]: MPI_COMM_WORLD rank 0, file cxx_win_attr.cc:50:
Win::Get_attr: Got wrong value for disp unit
[**ERROR**]: MPI_COMM_WORLD rank 1, file cxx_win_attr.cc:50:
Win::Get_attr: Got wrong value for disp


DATATYPE:
datatype/predefined-datatype-name 
MPI_LONG_LONG!= MPI_LONG_LONG_INT


LOOP SPAWN:
too many retries sending message to , giving up

Thanks
Ralph