Re: [OMPI devel] Hangs on the trunk
Found the typo. Fixed in r32232 -Nathan On Mon, Jul 14, 2014 at 02:44:06PM -0600, Nathan Hjelm wrote: > Yeah. I think it is a problem I introduced. I am trying to track it down > now. Probably messed up one of the conversions in ob1. > > -Nathan > > On Mon, Jul 14, 2014 at 01:38:38PM -0700, Rolf vandeVaart wrote: > >I have noticed that I am seeing some tests hang on the trunk. For > >example: > > > > > > > >$ mpirun --mca btl_tcp_if_include eth0 --host > >drossetti-ivy0,drossetti-ivy1 -np 2 --mca pml ob1 --mca btl sm,tcp,self > >--mca coll_mdisable_allgather 1 --mca btl_openib_warn_default_gid_prefix > > 0 > >send > > > > > > > >It is not unusual for this test to take several minutes, particularly on > >slow networks. > > > >Please be patient. > > > >NOTICE: Using max message size: 10485760 > > > >Progress: [= > > > > > > > >Is anyone else seeing this? (This is really a hang in spite of the > >message saying it should take a few minutes) > > > > > > > >This started with the changes Nathan did for renaming the descriptor > >fields - r32196 through r32202. > > > >From what I can tell, it looks like it hangs the second time the > >rendezvous protocol is used to send the data. > > > > > > > >Rolf > > > > -- > > > >This email message is for the sole use of the intended recipient(s) and > >may contain confidential information. Any unauthorized review, use, > >disclosure or distribution is prohibited. If you are not the intended > >recipient, please contact the sender by reply email and destroy all > > copies > >of the original message. > > > > -- > > > ___ > > devel mailing list > > de...@open-mpi.org > > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel > > Link to this post: > > http://www.open-mpi.org/community/lists/devel/2014/07/15136.php > > ___ > devel mailing list > de...@open-mpi.org > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel > Link to this post: > http://www.open-mpi.org/community/lists/devel/2014/07/15137.php pgp30g_5poMEM.pgp Description: PGP signature
Re: [OMPI devel] Hangs on the trunk
Yeah. I think it is a problem I introduced. I am trying to track it down now. Probably messed up one of the conversions in ob1. -Nathan On Mon, Jul 14, 2014 at 01:38:38PM -0700, Rolf vandeVaart wrote: >I have noticed that I am seeing some tests hang on the trunk. For >example: > > > >$ mpirun --mca btl_tcp_if_include eth0 --host >drossetti-ivy0,drossetti-ivy1 -np 2 --mca pml ob1 --mca btl sm,tcp,self >--mca coll_mdisable_allgather 1 --mca btl_openib_warn_default_gid_prefix 0 >send > > > >It is not unusual for this test to take several minutes, particularly on >slow networks. > >Please be patient. > >NOTICE: Using max message size: 10485760 > >Progress: [= > > > >Is anyone else seeing this? (This is really a hang in spite of the >message saying it should take a few minutes) > > > >This started with the changes Nathan did for renaming the descriptor >fields - r32196 through r32202. > >From what I can tell, it looks like it hangs the second time the >rendezvous protocol is used to send the data. > > > >Rolf > > -- > >This email message is for the sole use of the intended recipient(s) and >may contain confidential information. Any unauthorized review, use, >disclosure or distribution is prohibited. If you are not the intended >recipient, please contact the sender by reply email and destroy all copies >of the original message. > > -- > ___ > devel mailing list > de...@open-mpi.org > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel > Link to this post: > http://www.open-mpi.org/community/lists/devel/2014/07/15136.php pgpghCbh7lgyD.pgp Description: PGP signature
[OMPI devel] Hangs on the trunk
I have noticed that I am seeing some tests hang on the trunk. For example: $ mpirun --mca btl_tcp_if_include eth0 --host drossetti-ivy0,drossetti-ivy1 -np 2 --mca pml ob1 --mca btl sm,tcp,self --mca coll_mdisable_allgather 1 --mca btl_openib_warn_default_gid_prefix 0 send It is not unusual for this test to take several minutes, particularly on slow networks. Please be patient. NOTICE: Using max message size: 10485760 Progress: [= Is anyone else seeing this? (This is really a hang in spite of the message saying it should take a few minutes) This started with the changes Nathan did for renaming the descriptor fields - r32196 through r32202. >From what I can tell, it looks like it hangs the second time the rendezvous >protocol is used to send the data. Rolf --- This email message is for the sole use of the intended recipient(s) and may contain confidential information. Any unauthorized review, use, disclosure or distribution is prohibited. If you are not the intended recipient, please contact the sender by reply email and destroy all copies of the original message. ---