Re: [OMPI devel] orte question

2011-07-27 Thread Ralph Castain
Hmmm...I'm not seeing that behavior. I get a 0 exit code every time. You'll get a 243 if there are stale session directories laying around as it indicates that the mpirun's in those dirs are not reachable. Perhaps that is what's happening? On Jul 27, 2011, at 3:14 PM, Greg Watson wrote: > Ral

Re: [OMPI devel] orte question

2011-07-27 Thread Ralph Castain
Hmmmno, can't imagine why. I'll fix - thanks! On Jul 27, 2011, at 3:14 PM, Greg Watson wrote: > Ralph, > > Looking good so far. I did notice that ompi-ps always seems to have an exit > code of 243. Is that on purpose? > > Greg > > On Jul 25, 2011, at 4:44 PM, Ralph Castain wrote: > >> r2

Re: [OMPI devel] orte question

2011-07-27 Thread Greg Watson
Ralph, Looking good so far. I did notice that ompi-ps always seems to have an exit code of 243. Is that on purpose? Greg On Jul 25, 2011, at 4:44 PM, Ralph Castain wrote: > r24944 - let me know how it works! > > > On Jul 25, 2011, at 1:01 PM, Greg Watson wrote: > >> That would probably be m

Re: [OMPI devel] orte question

2011-07-25 Thread Ralph Castain
r24944 - let me know how it works! On Jul 25, 2011, at 1:01 PM, Greg Watson wrote: > That would probably be more intuitive. > > Thanks, > Greg > > On Jul 25, 2011, at 2:28 PM, Ralph Castain wrote: > >> job 0 is mpirun and its daemons - I can have it ignore that job as I doubt >> users care :

Re: [OMPI devel] orte question

2011-07-25 Thread Greg Watson
That would probably be more intuitive. Thanks, Greg On Jul 25, 2011, at 2:28 PM, Ralph Castain wrote: > job 0 is mpirun and its daemons - I can have it ignore that job as I doubt > users care :-) > > On Jul 25, 2011, at 12:25 PM, Greg Watson wrote: > >> Ralph, >> >> The output format looks g

Re: [OMPI devel] orte question

2011-07-25 Thread Ralph Castain
job 0 is mpirun and its daemons - I can have it ignore that job as I doubt users care :-) On Jul 25, 2011, at 12:25 PM, Greg Watson wrote: > Ralph, > > The output format looks good, but I'm not sure it's quite correct. If I run > the mpirun command, I see the following: > > mpirun:47520:num n

Re: [OMPI devel] orte question

2011-07-25 Thread Greg Watson
Ralph, The output format looks good, but I'm not sure it's quite correct. If I run the mpirun command, I see the following: mpirun:47520:num nodes:1:num jobs:2 jobid:0:state:RUNNING:slots:0:num procs:0 jobid:1:state:RUNNING:slots:1:num procs:4 process:x:rank:0:pid:47522:node:greg.local:state:SYN

Re: [OMPI devel] orte question

2011-07-23 Thread Ralph Castain
On Jul 23, 2011, at 5:04 PM, Ashley Pittman wrote: > > On 23 Jul 2011, at 03:55, Ralph Castain wrote: >>> c) A more easily parsable output format from ompi-ps. It doesn't need to be >>> a full blown XML format, just something like the following would suffice: >>> >>> jobid:719585280:state:Runn

Re: [OMPI devel] orte question

2011-07-23 Thread Ashley Pittman
On 23 Jul 2011, at 03:55, Ralph Castain wrote: >> c) A more easily parsable output format from ompi-ps. It doesn't need to be >> a full blown XML format, just something like the following would suffice: >> >> jobid:719585280:state:Running:slots:1:num procs:4 >> process_name:./x:rank:0:pid:3082:n

Re: [OMPI devel] orte question

2011-07-23 Thread Ralph Castain
Okay, you should have it in r24929. Use: orte-ps --parseable to get the new output. On Jul 23, 2011, at 11:43 AM, Ralph Castain wrote: > Gar - have to eat my words a bit. The jobid requested by orte-ps is just the > "local" jobid - i.e., it is expecting you to provide a number from 0-N, as I

Re: [OMPI devel] orte question

2011-07-23 Thread Ralph Castain
Gar - have to eat my words a bit. The jobid requested by orte-ps is just the "local" jobid - i.e., it is expecting you to provide a number from 0-N, as I described below (copied here): > A jobid of 1 indicates the primary application, 2 and above would specify > comm_spawned jobs. Not providi

Re: [OMPI devel] orte question

2011-07-22 Thread Ralph Castain
On Jul 22, 2011, at 3:57 PM, Greg Watson wrote: > Hi Ralph, > > I'd like three things :-) > > a) A --report-jobid option that prints the jobid on the first line in a form > that can be passed to the -jobid option on ompi-ps. Probably tagging it in > the output if -tag-output is enabled (e.g.

Re: [OMPI devel] orte question

2011-07-22 Thread Greg Watson
Hi Ralph, I'd like three things :-) a) A --report-jobid option that prints the jobid on the first line in a form that can be passed to the -jobid option on ompi-ps. Probably tagging it in the output if -tag-output is enabled (e.g. jobid:) would be a good idea. b) The orte-ps command output to

Re: [OMPI devel] orte question

2011-07-22 Thread Ralph Castain
Hmmm...well, it looks like we could have made this nicer than we did :-/ If you add --report-uri to the mpirun command line, you'll get back the uri for that mpirun. This has the form of :. As the -h option indicates: -report-uri | --report-uri Printout URI on stdou