Re: [OMPI devel] [OMPI svn] svn:open-mpi r25476

2011-11-18 Thread George Bosilca
On Nov 18, 2011, at 07:49 , Ralph Castain wrote: > That's a condition which should never be reached, but just to be safe, I have > added a "bozo check" that will cause the routine to error out with a message > if that situation occurs. I have tried everything - hostfile, dash-host, > bizarre co

Re: [OMPI devel] [OMPI svn] svn:open-mpi r25476

2011-11-18 Thread Ralph Castain
On Nov 18, 2011, at 3:28 AM, George Bosilca wrote: > I fixed some of the issues (nearly infinite loops on the orted due to non > initialized values), but not the one in rmaps. Clearly, the code in > ramps_base_ranking.c (the while loop starting with "while (cnt < > jdata->num_procs))" reach an

Re: [OMPI devel] [OMPI svn] svn:open-mpi r25476

2011-11-18 Thread George Bosilca
I fixed some of the issues (nearly infinite loops on the orted due to non initialized values), but not the one in rmaps. Clearly, the code in ramps_base_ranking.c (the while loop starting with "while (cnt < jdata->num_procs))" reach an infinite loop as soon as no node->procs exists, as there is

Re: [OMPI devel] [OMPI svn] svn:open-mpi r25476

2011-11-17 Thread Ralph Castain
I can't get it to fail, even with hostfile arguments. I'll try again in the morning. On Nov 17, 2011, at 8:49 PM, George Bosilca wrote: > Maybe the issue is generated by how the hostile is specified. I used > orte_default_hostfile= in my mca-params.conf. > > george. > > On Nov 17, 2011, at 1

Re: [OMPI devel] [OMPI svn] svn:open-mpi r25476

2011-11-17 Thread George Bosilca
Maybe the issue is generated by how the hostile is specified. I used orte_default_hostfile= in my mca-params.conf. george. On Nov 17, 2011, at 19:17 , Ralph Castain wrote: > I'm still building on odin, but will check there again to see if I can > replicate - perhaps something didn't get comm

Re: [OMPI devel] [OMPI svn] svn:open-mpi r25476

2011-11-17 Thread Ralph Castain
On Nov 17, 2011, at 8:13 PM, George Bosilca wrote: > I have a fresh checkout. In your example where are your hosts coming from? > How do you specify the hostile? The hosts are coming from the slurm allocation, though I also tried adding -host arguments. The error you describe comes well after

Re: [OMPI devel] [OMPI svn] svn:open-mpi r25476

2011-11-17 Thread George Bosilca
I have a fresh checkout. In your example where are your hosts coming from? How do you specify the hostile? george. On Nov 17, 2011, at 19:06 , Ralph Castain wrote: > Hmmm...well, things seem to work just fine for me: > > [rhc@odin ~/ompi-hwloc]$ mpirun -np 2 -bynode -mca plm rsh hostname > o