Re: [OMPI devel] Hangs on the trunk

2014-07-14 Thread Nathan Hjelm
Found the typo. Fixed in r32232

-Nathan

On Mon, Jul 14, 2014 at 02:44:06PM -0600, Nathan Hjelm wrote:
> Yeah. I think it is a problem I introduced. I am trying to track it down
> now. Probably messed up one of the conversions in ob1.
> 
> -Nathan
> 
> On Mon, Jul 14, 2014 at 01:38:38PM -0700, Rolf vandeVaart wrote:
> >I have noticed that I am seeing some tests hang on the trunk.  For
> >example:
> > 
> > 
> > 
> >$ mpirun --mca btl_tcp_if_include eth0 --host
> >drossetti-ivy0,drossetti-ivy1 -np 2 --mca pml ob1 --mca btl sm,tcp,self
> >--mca coll_mdisable_allgather 1 --mca btl_openib_warn_default_gid_prefix > > 0
> >send
> > 
> > 
> > 
> >It is not unusual for this test to take several minutes, particularly on
> >slow networks.
> > 
> >Please be patient.
> > 
> >NOTICE: Using max message size: 10485760
> > 
> >Progress: [=  
> > 
> > 
> > 
> >Is anyone else seeing this?  (This is really a hang in spite of the
> >message saying it should take a few minutes)
> > 
> > 
> > 
> >This started with the changes Nathan did for renaming the descriptor
> >fields - r32196 through r32202.
> > 
> >From what I can tell, it looks like it hangs the second time the
> >rendezvous protocol is used to send the data.
> > 
> > 
> > 
> >Rolf
> > 
> >  --
> > 
> >This email message is for the sole use of the intended recipient(s) and
> >may contain confidential information.  Any unauthorized review, use,
> >disclosure or distribution is prohibited.  If you are not the intended
> >recipient, please contact the sender by reply email and destroy all 
> > copies
> >of the original message.
> > 
> >  --
> 
> > ___
> > devel mailing list
> > de...@open-mpi.org
> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> > Link to this post: 
> > http://www.open-mpi.org/community/lists/devel/2014/07/15136.php
> 



> ___
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post: 
> http://www.open-mpi.org/community/lists/devel/2014/07/15137.php



pgp30g_5poMEM.pgp
Description: PGP signature


Re: [OMPI devel] Hangs on the trunk

2014-07-14 Thread Nathan Hjelm
Yeah. I think it is a problem I introduced. I am trying to track it down
now. Probably messed up one of the conversions in ob1.

-Nathan

On Mon, Jul 14, 2014 at 01:38:38PM -0700, Rolf vandeVaart wrote:
>I have noticed that I am seeing some tests hang on the trunk.  For
>example:
> 
> 
> 
>$ mpirun --mca btl_tcp_if_include eth0 --host
>drossetti-ivy0,drossetti-ivy1 -np 2 --mca pml ob1 --mca btl sm,tcp,self
>--mca coll_mdisable_allgather 1 --mca btl_openib_warn_default_gid_prefix 0
>send
> 
> 
> 
>It is not unusual for this test to take several minutes, particularly on
>slow networks.
> 
>Please be patient.
> 
>NOTICE: Using max message size: 10485760
> 
>Progress: [=  
> 
> 
> 
>Is anyone else seeing this?  (This is really a hang in spite of the
>message saying it should take a few minutes)
> 
> 
> 
>This started with the changes Nathan did for renaming the descriptor
>fields - r32196 through r32202.
> 
>From what I can tell, it looks like it hangs the second time the
>rendezvous protocol is used to send the data.
> 
> 
> 
>Rolf
> 
>  --
> 
>This email message is for the sole use of the intended recipient(s) and
>may contain confidential information.  Any unauthorized review, use,
>disclosure or distribution is prohibited.  If you are not the intended
>recipient, please contact the sender by reply email and destroy all copies
>of the original message.
> 
>  --

> ___
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post: 
> http://www.open-mpi.org/community/lists/devel/2014/07/15136.php



pgpghCbh7lgyD.pgp
Description: PGP signature


[OMPI devel] Hangs on the trunk

2014-07-14 Thread Rolf vandeVaart
I have noticed that I am seeing some tests hang on the trunk.  For example:

$ mpirun --mca btl_tcp_if_include eth0 --host drossetti-ivy0,drossetti-ivy1 -np 
2 --mca pml ob1 --mca btl sm,tcp,self --mca coll_mdisable_allgather 1 --mca 
btl_openib_warn_default_gid_prefix 0 send

It is not unusual for this test to take several minutes, particularly on slow 
networks.
Please be patient.
NOTICE: Using max message size: 10485760
Progress: [=

Is anyone else seeing this?  (This is really a hang in spite of the message 
saying it should take a few minutes)

This started with the changes Nathan did for renaming the descriptor fields - 
r32196 through r32202.
>From what I can tell, it looks like it hangs the second time the rendezvous 
>protocol is used to send the data.

Rolf

---
This email message is for the sole use of the intended recipient(s) and may 
contain
confidential information.  Any unauthorized review, use, disclosure or 
distribution
is prohibited.  If you are not the intended recipient, please contact the 
sender by
reply email and destroy all copies of the original message.
---