Re: [OMPI devel] -display-map

Greg Watson Fri, 16 Jan 2009 14:52:31 -0500

Ralph,

Is there something I need to do to enable stdout/err encapsulation(apart from -xml)? Here's what I see:

$ mpirun -mca orte_show_resolved_nodenames 1 -xml -display-map -np 5 /Users/greg/Documents/workspace1/testMPI/Debug/testMPI

<map>
        <host name="Jarrah.local" slots="8" max_slots="0">
                <noderesolve resolved="node0"/>
                <noderesolve resolved="node1"/>
                <noderesolve resolved="node2"/>
                <noderesolve resolved="node3"/>
                <noderesolve resolved="node4"/>
                <noderesolve resolved="node5"/>
                <noderesolve resolved="node6"/>
                <noderesolve resolved="node7"/>
                <process rank="0"/>
                <process rank="1"/>
                <process rank="2"/>
                <process rank="3"/>
                <process rank="4"/>
        </host>
</map>
n = 0
n = 0
n = 0
n = 0
n = 0

On Jan 15, 2009, at 1:13 PM, Ralph Castain wrote:

Okay, it is in the trunk as of r20284 - I'll file the request tohave it moved to 1.3.1.
Let me know if you get a chance to test the stdout/err stuff in thetrunk - we should try and iterate it so any changes can make 1.3.1as well.
Thanks!
Ralph


On Jan 15, 2009, at 11:03 AM, Greg Watson wrote:
Ralph,
I think the second form would be ideal and would simplify thingsgreatly.
Greg

On Jan 15, 2009, at 10:53 AM, Ralph Castain wrote:
Here is what I was able to do - note that the resolve messages areassociated with the specific hostname, not the overall map:
<map>
        <host name="graywolf54.lanl.gov" slots="1" max_slots="0">
                <noderesolve name="graywolf54.lanl.gov" resolved="localhost"/>
                <process rank="0"/>
                <process rank="1"/>
                <process rank="2"/>
        </host>
</map>
Will that work for you? If you like, I can remove the name= fieldfrom the noderesolve element since the info is specific to thehost element that contains it. In other words, I can make it looklike this:
<map>
        <host name="graywolf54.lanl.gov" slots="1" max_slots="0">
                <noderesolve resolved="localhost"/>
                <process rank="0"/>
                <process rank="1"/>
                <process rank="2"/>
        </host>
</map>

if that would help.

Ralph


On Jan 14, 2009, at 7:57 AM, Ralph Castain wrote:
We -may- be able to do a more formal XML output at some point.The problem will be the natural interleaving of stdout/err fromthe various procs due to the async behavior of MPI. Mpirunreceives fragmented output in the forwarding system, limited bythe buffer sizes and the amount of data we can read at any one"bite" from the pipes connecting us to the procs. So even thoughthe user -thinks- they output a single large line of stuff, itmay show up at mpirun as a series of fragments. Hence, it getstricky to know how to put appropriate XML brackets around it.
Given this input about when you actually want resolved name info,I can at least do something about that area. Won't be in 1.3.0,but should make 1.3.1.
As for XML-tagged stdout/err: the OMPI community asked me not toturn that feature "on" for 1.3.0 as they felt it hasn't beenadequately tested yet. The code is present, but cannot beactivated in 1.3.0. However, I believe it is activated on thetrunk when you do --xml --tagged-output, so perhaps some testingwill help us debug and validate it adequately for 1.3.1?
Thanks
Ralph


On Jan 14, 2009, at 7:02 AM, Greg Watson wrote:
Ralph,
The only time we use the resolved names is when we get a map, sowe consider them part of the map output.
If quasi-XML is all that will ever be possible with 1.3, thenyou may as well leave as-is and we will attempt to clean it upin Eclipse. It would be nice if a future version of ompi couldoutput correct XML (including stdout) as this would vastlysimplify the parsing we need to do.
Regards,

Greg

On Jan 13, 2009, at 3:30 PM, Ralph Castain wrote:
Hmmm...well, I can't do either for 1.3.0 as it is departingthis afternoon.
The first option would be very hard to do. I would have toexpose the display-map option across the code base and check itprior to printing anything about resolving node names. I guessI should ask: do you only want noderesolve statements when weare displaying the map? Right now, I will output them regardless.
The second option could be done. I could check if any "display"option has been specified, and output the <ompi> root at thattime (likewise for the end). Anything we output in-betweenwould be encapsulated between the two, but that would includeany user output to stdout and/or stderr - which for 1.3.0 isnot in xml.
Any thoughts?

Ralph
PS. Guess I should clarify that I was not striving for true XMLinteraction here, but rather a quasi-XML format that would helpyou to filter the output. I have no problem trying to get tosomething more formally correct, but it could be tricky in someplaces to achieve it due to the inherent async nature of thebeast.
On Jan 13, 2009, at 12:17 PM, Greg Watson wrote:
Ralph,
The XML is looking better now, but there is still one problem.To be valid, there needs to be only one root element, butcurrently you don't have any (or many). So rather than:
<noderesolve name="node0" resolved="Jarrah.local"/>
<noderesolve name="node1" resolved="Jarrah.local"/>
<map>
        <host name="Jarrah.local" slots="8" max_slots="0">
                <process rank="0"/>
                <process rank="1"/>
                <process rank="2"/>
                <process rank="3"/>
                <process rank="4"/>
        </host>
</map>

the XML should be:

<map>
        <noderesolve name="node0" resolved="Jarrah.local"/>
        <noderesolve name="node1" resolved="Jarrah.local"/>
        <host name="Jarrah.local" slots="8" max_slots="0">
                <process rank="0"/>
                <process rank="1"/>
                <process rank="2"/>
                <process rank="3"/>
                <process rank="4"/>
        </host>
</map>

or:

<ompi>
        <noderesolve name="node0" resolved="Jarrah.local"/>
        <noderesolve name="node1" resolved="Jarrah.local"/>
        <map>
                <host name="Jarrah.local" slots="8" max_slots="0">
                        <process rank="0"/>
                        <process rank="1"/>
                        <process rank="2"/>
                        <process rank="3"/>
                        <process rank="4"/>
                </host>
        </map>
</ompi>

Would either of these be possible?

Thanks,

Greg

On Dec 8, 2008, at 2:18 PM, Greg Watson wrote:
Ok thanks. I'll test from trunk in future.

Greg

On Dec 8, 2008, at 2:05 PM, Ralph Castain wrote:
Working its way around the CMR process now.
Might be easier in the future if we could test/debug this inthe trunk, though. Otherwise, the CMR procedure will fallbehind and a fix might miss a release window.
Anyway, hopefully this one will make the 1.3.0 release cutoff.

Thanks
Ralph

On Dec 8, 2008, at 9:56 AM, Greg Watson wrote:
Hi Ralph,
This is now in 1.3rc2, thanks. However there are a coupleof problems. Here is what I see:
[Jarrah.watson.ibm.com:58957] <noderesolve name="node0"resolved="Jarrah.watson.ibm.com">
For some reason each line is prefixed with "[...]", anyidea why this is? Also the end tag should be "/>" not ">".
Thanks,

Greg

On Nov 24, 2008, at 3:06 PM, Greg Watson wrote:
Great, thanks. I'll take a look once it comes over to 1.3.

Cheers,

Greg

On Nov 24, 2008, at 2:59 PM, Ralph Castain wrote:
Yo Greg
This is in the trunk as of r20032. I'll bring it over to1.3 in a few days.
I implemented it as another MCA param"orte_show_resolved_nodenames" so you can actually getthe info as you execute the job, if you want. The xml tagis "noderesolve" - let me know if you need any changes.
Ralph


On Oct 22, 2008, at 11:55 AM, Greg Watson wrote:
Ralph,
I guess the issue for us is that we will have to run twocommands to get the information we need. One to get theconfiguration information, such as version and MCAparameters, and one to get the host information, whereasit would seem more logical that this should all beavailable via some kind of "configuration discovery"command. I understand the issue with supplying thehostfile though, so maybe this just points at the needfor us to separate configuration information from thehost information. In any case, we'll work with what youthink is best.
Greg

On Oct 20, 2008, at 4:49 PM, Ralph Castain wrote:
Hmmm...just to be sure we are all clear on this. Thereason we proposed to use mpirun is that "hostfile" hasno meaning outside of mpirun. That's why ompi_infocan't do anything in this regard.
We have no idea what hostfile the user may specifyuntil we actually get the mpirun cmd line. They mayhave specified a default-hostfile, but they could alsospecify hostfiles for the individual app_contexts.These may or may not include the node upon which mpirunis executing.
So the only way to provide you with a separate commandto get a hostfile<->nodename mapping would require youto provide us with the default-hostifle and/or hostfilecmd line options just as if you were issuing the mpiruncmd. We just wouldn't launch - but it would be theexact equivalent of doing "mpirun --do-not-launch".
Am I missing something? If so, please do correct me - Iwould be happy to provide a tool if that would make iteasier. Just not sure what that tool would do.
Thanks
Ralph


On Oct 19, 2008, at 1:59 PM, Greg Watson wrote:
Ralph,
It seems a little strange to be using mpirun for this,but barring providing a separate command, or usingompi_info, I think this would solve our problem.
Thanks,

Greg

On Oct 17, 2008, at 10:46 AM, Ralph Castain wrote:
Sorry for delay - had to ponder this one for awhile.
Jeff and I agree that adding something to ompi_infowould not be a good idea. Ompi_info has no knowledgeor understanding of hostfiles, and adding thatcapability to it would be a major distortion of itsintended use.
However, we think we can offer an alternative thatmight better solve the problem. Remember, we nowtreat hostfiles in a very different manner thanbefore - see the wiki page for a completedescription, or "man orte_hosts".
So the problem is that, to provide you with what youwant, we need to "dump" the information from whateverdefault-hostfile was provided, and, if no default-hostfile was provided, then the information from eachhostfile that was provided with an app_context.
The best way we could think of to do this is to addanother mpirun cmd line option --dump-hostfiles thatwould output the line-by-line name from the hostfileplus the name we resolved it to. Of course, --xmlwould cause it to be in xml format.
Would that meet your needs?

Ralph


On Oct 15, 2008, at 3:12 PM, Greg Watson wrote:
Hi Ralph,
We've been discussing this back and forth a bitinternally and don't really see an easy solution.Our problem is that Eclipse is not running on thehead node, so gethostbyname will not necessarilyresolve to the same address. For example, thehostfile might refer to the head node by an internalnetwork address that is not visible to the outsideworld. Since gethostname also looks in /etc/hosts,it may resolve locally but not on a remote system.The only think I can think of would be, rather thanus reading the hostfile directly as we do now, toprovide an option to ompi_info that would dump thehostfile using the same rules that you apply whenyou're using the hostfile. Would that be feasible?
Greg

On Sep 22, 2008, at 4:25 PM, Ralph Castain wrote:
Sorry for delay - was on vacation and am now tryingto work my way back to the surface.
I'm not sure I can fix this one for two reasons:
1. In general, OMPI doesn't really care what nameis used for the node. However, the problem is thatit needs to be consistent. In this case, ORTE hasalready used the name returned by gethostname tocreate its session directory structure long beforempirun reads a hostfile. This is why we retain thevalue from gethostname instead of allowing it to beoverwritten by the name in whatever allocation weare given. Using the name in hostfile would requirethat I either find some way to remember any priorname, or that I tear down and rebuild the sessiondirectory tree - neither seems attractive norsimple (e.g., what happens when the user providesmultiple entries in the hostfile for the node, eachwith a different IP address based on anotherinterface in that node? Sounds crazy, but we havealready seen it done - which one do I use?).
2. We don't actually store the hostfile infoanywhere - we just use it and forget it. For us toadd an XML attribute containing any hostfile-related info would therefore require us to re-readthe hostfile. I could have it do that -only- in thecase of "XML output required", but it seems ratherugly.
An alternative might be for you to simply do a"gethostbyname" lookup of the IP address orhostname to see if it matches instead of just doinga strcmp. This is what we have to do internally aswe frequently have problems with FQDN vs. non-FQDNvs. IP addresses etc. If the local OS hasn't cachedthe IP address for the node in question it can takea little time to DNS resolve it, but otherwiseworks fine.
I can point you to the code in OPAL that we use - Iwould think something similar would be easy toimplement in your code and would readily solve theproblem.
Ralph

On Sep 19, 2008, at 7:18 AM, Greg Watson wrote:
Ralph,
The problem we're seeing is just with the headnode. If I specify a particular IP address for thehead node in the hostfile, it gets changed to theFQDN when displayed in the map. This is a problemfor us as we need to be able to match the two, andsince we're not necessarily running on the headnode, we can't always do the same resolutionyou're doing.
Would it be possible to use the same address thatis specified in the hostfile, or alternativelyprovide an XML attribute that contains thisinformation?
Thanks,

Greg

On Sep 11, 2008, at 9:06 AM, Ralph Castain wrote:
Not in that regard, depending upon what you meanby "recently". The only changes I am aware of wrtnodes consisted of some changes to the order inwhich we use the nodes when specified by hostfileor -host, and a little #if protectionism neededby Brian for the Cray port.
Are you seeing this for every node? Reason I ask:I can't offhand think of anything in the codebase that would replace a host name with the FQDNbecause we don't get that info for remote nodes.The only exception is the head node (where mpirunsits) - in that lone case, we default to the namereturned to us by gethostname(). We do thatbecause the head node is frequently accessible ona more global basis than the compute nodes -thus, the FQDN is required to ensure that thereis no address confusion on the network.
If the user refers to compute nodes in a hostfileor -host (or in an allocation from a resourcemanager) by non-FQDN, we just assume they knowwhat they are doing and the name will correctlyresolve to a unique address.
On Sep 10, 2008, at 9:45 AM, Greg Watson wrote:
Hi,
Has there been a change in the behavior of the -display-map option has changed recently in the1.3 branch. We're now seeing the host name as afully resolved DN rather than the entry that wasspecified in the hostfile. Is there anyparticular reason for this? If so, would it bepossible to add the hostfile entry to the outputsince we need to be able to match the two?
Thanks,

Greg
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel

Re: [OMPI devel] -display-map

Reply via email to