Re: [OMPI devel] -display-map

Greg Watson Thu, 15 Jan 2009 13:03:31 -0500

Ralph,

I think the second form would be ideal and would simplify thingsgreatly.


Greg

On Jan 15, 2009, at 10:53 AM, Ralph Castain wrote:

Here is what I was able to do - note that the resolve messages areassociated with the specific hostname, not the overall map:
<map>
        <host name="graywolf54.lanl.gov" slots="1" max_slots="0">
                <noderesolve name="graywolf54.lanl.gov" resolved="localhost"/>
                <process rank="0"/>
                <process rank="1"/>
                <process rank="2"/>
        </host>
</map>
Will that work for you? If you like, I can remove the name= fieldfrom the noderesolve element since the info is specific to the hostelement that contains it. In other words, I can make it look likethis:
<map>
        <host name="graywolf54.lanl.gov" slots="1" max_slots="0">
                <noderesolve resolved="localhost"/>
                <process rank="0"/>
                <process rank="1"/>
                <process rank="2"/>
        </host>
</map>

if that would help.

Ralph


On Jan 14, 2009, at 7:57 AM, Ralph Castain wrote:
We -may- be able to do a more formal XML output at some point. Theproblem will be the natural interleaving of stdout/err from thevarious procs due to the async behavior of MPI. Mpirun receivesfragmented output in the forwarding system, limited by the buffersizes and the amount of data we can read at any one "bite" from thepipes connecting us to the procs. So even though the user -thinks-they output a single large line of stuff, it may show up at mpirunas a series of fragments. Hence, it gets tricky to know how to putappropriate XML brackets around it.
Given this input about when you actually want resolved name info, Ican at least do something about that area. Won't be in 1.3.0, butshould make 1.3.1.
As for XML-tagged stdout/err: the OMPI community asked me not toturn that feature "on" for 1.3.0 as they felt it hasn't beenadequately tested yet. The code is present, but cannot be activatedin 1.3.0. However, I believe it is activated on the trunk when youdo --xml --tagged-output, so perhaps some testing will help usdebug and validate it adequately for 1.3.1?
Thanks
Ralph


On Jan 14, 2009, at 7:02 AM, Greg Watson wrote:
Ralph,
The only time we use the resolved names is when we get a map, sowe consider them part of the map output.
If quasi-XML is all that will ever be possible with 1.3, then youmay as well leave as-is and we will attempt to clean it up inEclipse. It would be nice if a future version of ompi could outputcorrect XML (including stdout) as this would vastly simplify theparsing we need to do.
Regards,

Greg

On Jan 13, 2009, at 3:30 PM, Ralph Castain wrote:
Hmmm...well, I can't do either for 1.3.0 as it is departing thisafternoon.
The first option would be very hard to do. I would have to exposethe display-map option across the code base and check it prior toprinting anything about resolving node names. I guess I shouldask: do you only want noderesolve statements when we aredisplaying the map? Right now, I will output them regardless.
The second option could be done. I could check if any "display"option has been specified, and output the <ompi> root at thattime (likewise for the end). Anything we output in-between wouldbe encapsulated between the two, but that would include any useroutput to stdout and/or stderr - which for 1.3.0 is not in xml.
Any thoughts?

Ralph
PS. Guess I should clarify that I was not striving for true XMLinteraction here, but rather a quasi-XML format that would helpyou to filter the output. I have no problem trying to get tosomething more formally correct, but it could be tricky in someplaces to achieve it due to the inherent async nature of the beast.
On Jan 13, 2009, at 12:17 PM, Greg Watson wrote:
Ralph,
The XML is looking better now, but there is still one problem.To be valid, there needs to be only one root element, butcurrently you don't have any (or many). So rather than:
<noderesolve name="node0" resolved="Jarrah.local"/>
<noderesolve name="node1" resolved="Jarrah.local"/>
<map>
        <host name="Jarrah.local" slots="8" max_slots="0">
                <process rank="0"/>
                <process rank="1"/>
                <process rank="2"/>
                <process rank="3"/>
                <process rank="4"/>
        </host>
</map>

the XML should be:

<map>
        <noderesolve name="node0" resolved="Jarrah.local"/>
        <noderesolve name="node1" resolved="Jarrah.local"/>
        <host name="Jarrah.local" slots="8" max_slots="0">
                <process rank="0"/>
                <process rank="1"/>
                <process rank="2"/>
                <process rank="3"/>
                <process rank="4"/>
        </host>
</map>

or:

<ompi>
        <noderesolve name="node0" resolved="Jarrah.local"/>
        <noderesolve name="node1" resolved="Jarrah.local"/>
        <map>
                <host name="Jarrah.local" slots="8" max_slots="0">
                        <process rank="0"/>
                        <process rank="1"/>
                        <process rank="2"/>
                        <process rank="3"/>
                        <process rank="4"/>
                </host>
        </map>
</ompi>

Would either of these be possible?

Thanks,

Greg

On Dec 8, 2008, at 2:18 PM, Greg Watson wrote:
Ok thanks. I'll test from trunk in future.

Greg

On Dec 8, 2008, at 2:05 PM, Ralph Castain wrote:
Working its way around the CMR process now.
Might be easier in the future if we could test/debug this inthe trunk, though. Otherwise, the CMR procedure will fallbehind and a fix might miss a release window.
Anyway, hopefully this one will make the 1.3.0 release cutoff.

Thanks
Ralph

On Dec 8, 2008, at 9:56 AM, Greg Watson wrote:
Hi Ralph,
This is now in 1.3rc2, thanks. However there are a couple ofproblems. Here is what I see:
[Jarrah.watson.ibm.com:58957] <noderesolve name="node0"resolved="Jarrah.watson.ibm.com">
For some reason each line is prefixed with "[...]", any ideawhy this is? Also the end tag should be "/>" not ">".
Thanks,

Greg

On Nov 24, 2008, at 3:06 PM, Greg Watson wrote:
Great, thanks. I'll take a look once it comes over to 1.3.

Cheers,

Greg

On Nov 24, 2008, at 2:59 PM, Ralph Castain wrote:
Yo Greg
This is in the trunk as of r20032. I'll bring it over to1.3 in a few days.
I implemented it as another MCA param"orte_show_resolved_nodenames" so you can actually get theinfo as you execute the job, if you want. The xml tag is"noderesolve" - let me know if you need any changes.
Ralph


On Oct 22, 2008, at 11:55 AM, Greg Watson wrote:
Ralph,
I guess the issue for us is that we will have to run twocommands to get the information we need. One to get theconfiguration information, such as version and MCAparameters, and one to get the host information, whereasit would seem more logical that this should all beavailable via some kind of "configuration discovery"command. I understand the issue with supplying thehostfile though, so maybe this just points at the need forus to separate configuration information from the hostinformation. In any case, we'll work with what you thinkis best.
Greg

On Oct 20, 2008, at 4:49 PM, Ralph Castain wrote:
Hmmm...just to be sure we are all clear on this. Thereason we proposed to use mpirun is that "hostfile" hasno meaning outside of mpirun. That's why ompi_info can'tdo anything in this regard.
We have no idea what hostfile the user may specify untilwe actually get the mpirun cmd line. They may havespecified a default-hostfile, but they could also specifyhostfiles for the individual app_contexts. These may ormay not include the node upon which mpirun is executing.
So the only way to provide you with a separate command toget a hostfile<->nodename mapping would require you toprovide us with the default-hostifle and/or hostfile cmdline options just as if you were issuing the mpirun cmd.We just wouldn't launch - but it would be the exactequivalent of doing "mpirun --do-not-launch".
Am I missing something? If so, please do correct me - Iwould be happy to provide a tool if that would make iteasier. Just not sure what that tool would do.
Thanks
Ralph


On Oct 19, 2008, at 1:59 PM, Greg Watson wrote:
Ralph,
It seems a little strange to be using mpirun for this,but barring providing a separate command, or usingompi_info, I think this would solve our problem.
Thanks,

Greg

On Oct 17, 2008, at 10:46 AM, Ralph Castain wrote:
Sorry for delay - had to ponder this one for awhile.
Jeff and I agree that adding something to ompi_infowould not be a good idea. Ompi_info has no knowledge orunderstanding of hostfiles, and adding that capabilityto it would be a major distortion of its intended use.
However, we think we can offer an alternative thatmight better solve the problem. Remember, we now treathostfiles in a very different manner than before - seethe wiki page for a complete description, or "manorte_hosts".
So the problem is that, to provide you with what youwant, we need to "dump" the information from whateverdefault-hostfile was provided, and, if no default-hostfile was provided, then the information from eachhostfile that was provided with an app_context.
The best way we could think of to do this is to addanother mpirun cmd line option --dump-hostfiles thatwould output the line-by-line name from the hostfileplus the name we resolved it to. Of course, --xml wouldcause it to be in xml format.
Would that meet your needs?

Ralph


On Oct 15, 2008, at 3:12 PM, Greg Watson wrote:
Hi Ralph,
We've been discussing this back and forth a bitinternally and don't really see an easy solution. Ourproblem is that Eclipse is not running on the headnode, so gethostbyname will not necessarily resolve tothe same address. For example, the hostfile mightrefer to the head node by an internal network addressthat is not visible to the outside world. Sincegethostname also looks in /etc/hosts, it may resolvelocally but not on a remote system. The only think Ican think of would be, rather than us reading thehostfile directly as we do now, to provide an optionto ompi_info that would dump the hostfile using thesame rules that you apply when you're using thehostfile. Would that be feasible?
Greg

On Sep 22, 2008, at 4:25 PM, Ralph Castain wrote:
Sorry for delay - was on vacation and am now tryingto work my way back to the surface.
I'm not sure I can fix this one for two reasons:
1. In general, OMPI doesn't really care what name isused for the node. However, the problem is that itneeds to be consistent. In this case, ORTE hasalready used the name returned by gethostname tocreate its session directory structure long beforempirun reads a hostfile. This is why we retain thevalue from gethostname instead of allowing it to beoverwritten by the name in whatever allocation we aregiven. Using the name in hostfile would require thatI either find some way to remember any prior name, orthat I tear down and rebuild the session directorytree - neither seems attractive nor simple (e.g.,what happens when the user provides multiple entriesin the hostfile for the node, each with a differentIP address based on another interface in that node?Sounds crazy, but we have already seen it done -which one do I use?).
2. We don't actually store the hostfile info anywhere- we just use it and forget it. For us to add an XMLattribute containing any hostfile-related info wouldtherefore require us to re-read the hostfile. I couldhave it do that -only- in the case of "XML outputrequired", but it seems rather ugly.
An alternative might be for you to simply do a"gethostbyname" lookup of the IP address or hostnameto see if it matches instead of just doing a strcmp.This is what we have to do internally as wefrequently have problems with FQDN vs. non-FQDN vs.IP addresses etc. If the local OS hasn't cached theIP address for the node in question it can take alittle time to DNS resolve it, but otherwise worksfine.
I can point you to the code in OPAL that we use - Iwould think something similar would be easy toimplement in your code and would readily solve theproblem.
Ralph

On Sep 19, 2008, at 7:18 AM, Greg Watson wrote:
Ralph,
The problem we're seeing is just with the head node.If I specify a particular IP address for the headnode in the hostfile, it gets changed to the FQDNwhen displayed in the map. This is a problem for usas we need to be able to match the two, and sincewe're not necessarily running on the head node, wecan't always do the same resolution you're doing.
Would it be possible to use the same address that isspecified in the hostfile, or alternatively providean XML attribute that contains this information?
Thanks,

Greg

On Sep 11, 2008, at 9:06 AM, Ralph Castain wrote:
Not in that regard, depending upon what you mean by"recently". The only changes I am aware of wrtnodes consisted of some changes to the order inwhich we use the nodes when specified by hostfileor -host, and a little #if protectionism needed byBrian for the Cray port.
Are you seeing this for every node? Reason I ask: Ican't offhand think of anything in the code basethat would replace a host name with the FQDNbecause we don't get that info for remote nodes.The only exception is the head node (where mpirunsits) - in that lone case, we default to the namereturned to us by gethostname(). We do that becausethe head node is frequently accessible on a moreglobal basis than the compute nodes - thus, theFQDN is required to ensure that there is no addressconfusion on the network.
If the user refers to compute nodes in a hostfileor -host (or in an allocation from a resourcemanager) by non-FQDN, we just assume they know whatthey are doing and the name will correctly resolveto a unique address.
On Sep 10, 2008, at 9:45 AM, Greg Watson wrote:
Hi,
Has there been a change in the behavior of the -display-map option has changed recently in the 1.3branch. We're now seeing the host name as a fullyresolved DN rather than the entry that wasspecified in the hostfile. Is there any particularreason for this? If so, would it be possible toadd the hostfile entry to the output since we needto be able to match the two?
Thanks,

Greg
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel

Re: [OMPI devel] -display-map

Reply via email to