Re: [OMPI devel] -display-map

Greg Watson Wed, 14 Jan 2009 09:02:44 -0500

Ralph,

The only time we use the resolved names is when we get a map, so weconsider them part of the map output.

If quasi-XML is all that will ever be possible with 1.3, then you mayas well leave as-is and we will attempt to clean it up in Eclipse. Itwould be nice if a future version of ompi could output correct XML(including stdout) as this would vastly simplify the parsing we needto do.


Regards,

Greg

On Jan 13, 2009, at 3:30 PM, Ralph Castain wrote:

Hmmm...well, I can't do either for 1.3.0 as it is departing thisafternoon.
The first option would be very hard to do. I would have to exposethe display-map option across the code base and check it prior toprinting anything about resolving node names. I guess I should ask:do you only want noderesolve statements when we are displaying themap? Right now, I will output them regardless.
The second option could be done. I could check if any "display"option has been specified, and output the <ompi> root at that time(likewise for the end). Anything we output in-between would beencapsulated between the two, but that would include any user outputto stdout and/or stderr - which for 1.3.0 is not in xml.
Any thoughts?

Ralph
PS. Guess I should clarify that I was not striving for true XMLinteraction here, but rather a quasi-XML format that would help youto filter the output. I have no problem trying to get to somethingmore formally correct, but it could be tricky in some places toachieve it due to the inherent async nature of the beast.
On Jan 13, 2009, at 12:17 PM, Greg Watson wrote:
Ralph,
The XML is looking better now, but there is still one problem. Tobe valid, there needs to be only one root element, but currentlyyou don't have any (or many). So rather than:
<noderesolve name="node0" resolved="Jarrah.local"/>
<noderesolve name="node1" resolved="Jarrah.local"/>
<map>
        <host name="Jarrah.local" slots="8" max_slots="0">
                <process rank="0"/>
                <process rank="1"/>
                <process rank="2"/>
                <process rank="3"/>
                <process rank="4"/>
        </host>
</map>

the XML should be:

<map>
        <noderesolve name="node0" resolved="Jarrah.local"/>
        <noderesolve name="node1" resolved="Jarrah.local"/>
        <host name="Jarrah.local" slots="8" max_slots="0">
                <process rank="0"/>
                <process rank="1"/>
                <process rank="2"/>
                <process rank="3"/>
                <process rank="4"/>
        </host>
</map>

or:

<ompi>
        <noderesolve name="node0" resolved="Jarrah.local"/>
        <noderesolve name="node1" resolved="Jarrah.local"/>
        <map>
                <host name="Jarrah.local" slots="8" max_slots="0">
                        <process rank="0"/>
                        <process rank="1"/>
                        <process rank="2"/>
                        <process rank="3"/>
                        <process rank="4"/>
                </host>
        </map>
</ompi>

Would either of these be possible?

Thanks,

Greg

On Dec 8, 2008, at 2:18 PM, Greg Watson wrote:
Ok thanks. I'll test from trunk in future.

Greg

On Dec 8, 2008, at 2:05 PM, Ralph Castain wrote:
Working its way around the CMR process now.
Might be easier in the future if we could test/debug this in thetrunk, though. Otherwise, the CMR procedure will fall behind anda fix might miss a release window.
Anyway, hopefully this one will make the 1.3.0 release cutoff.

Thanks
Ralph

On Dec 8, 2008, at 9:56 AM, Greg Watson wrote:
Hi Ralph,
This is now in 1.3rc2, thanks. However there are a couple ofproblems. Here is what I see:
[Jarrah.watson.ibm.com:58957] <noderesolve name="node0"resolved="Jarrah.watson.ibm.com">
For some reason each line is prefixed with "[...]", any idea whythis is? Also the end tag should be "/>" not ">".
Thanks,

Greg

On Nov 24, 2008, at 3:06 PM, Greg Watson wrote:
Great, thanks. I'll take a look once it comes over to 1.3.

Cheers,

Greg

On Nov 24, 2008, at 2:59 PM, Ralph Castain wrote:
Yo Greg
This is in the trunk as of r20032. I'll bring it over to 1.3in a few days.
I implemented it as another MCA param"orte_show_resolved_nodenames" so you can actually get theinfo as you execute the job, if you want. The xml tag is"noderesolve" - let me know if you need any changes.
Ralph


On Oct 22, 2008, at 11:55 AM, Greg Watson wrote:
Ralph,
I guess the issue for us is that we will have to run twocommands to get the information we need. One to get theconfiguration information, such as version and MCAparameters, and one to get the host information, whereas itwould seem more logical that this should all be available viasome kind of "configuration discovery" command. I understandthe issue with supplying the hostfile though, so maybe thisjust points at the need for us to separate configurationinformation from the host information. In any case, we'llwork with what you think is best.
Greg

On Oct 20, 2008, at 4:49 PM, Ralph Castain wrote:
Hmmm...just to be sure we are all clear on this. The reasonwe proposed to use mpirun is that "hostfile" has no meaningoutside of mpirun. That's why ompi_info can't do anything inthis regard.
We have no idea what hostfile the user may specify until weactually get the mpirun cmd line. They may have specified adefault-hostfile, but they could also specify hostfiles forthe individual app_contexts. These may or may not includethe node upon which mpirun is executing.
So the only way to provide you with a separate command toget a hostfile<->nodename mapping would require you toprovide us with the default-hostifle and/or hostfile cmdline options just as if you were issuing the mpirun cmd. Wejust wouldn't launch - but it would be the exact equivalentof doing "mpirun --do-not-launch".
Am I missing something? If so, please do correct me - Iwould be happy to provide a tool if that would make iteasier. Just not sure what that tool would do.
Thanks
Ralph


On Oct 19, 2008, at 1:59 PM, Greg Watson wrote:
Ralph,
It seems a little strange to be using mpirun for this, butbarring providing a separate command, or using ompi_info, Ithink this would solve our problem.
Thanks,

Greg

On Oct 17, 2008, at 10:46 AM, Ralph Castain wrote:
Sorry for delay - had to ponder this one for awhile.
Jeff and I agree that adding something to ompi_info wouldnot be a good idea. Ompi_info has no knowledge orunderstanding of hostfiles, and adding that capability toit would be a major distortion of its intended use.
However, we think we can offer an alternative that mightbetter solve the problem. Remember, we now treat hostfilesin a very different manner than before - see the wiki pagefor a complete description, or "man orte_hosts".
So the problem is that, to provide you with what you want,we need to "dump" the information from whatever default-hostfile was provided, and, if no default-hostfile wasprovided, then the information from each hostfile that wasprovided with an app_context.
The best way we could think of to do this is to addanother mpirun cmd line option --dump-hostfiles that wouldoutput the line-by-line name from the hostfile plus thename we resolved it to. Of course, --xml would cause it tobe in xml format.
Would that meet your needs?

Ralph


On Oct 15, 2008, at 3:12 PM, Greg Watson wrote:
Hi Ralph,
We've been discussing this back and forth a bitinternally and don't really see an easy solution. Ourproblem is that Eclipse is not running on the head node,so gethostbyname will not necessarily resolve to the sameaddress. For example, the hostfile might refer to thehead node by an internal network address that is notvisible to the outside world. Since gethostname alsolooks in /etc/hosts, it may resolve locally but not on aremote system. The only think I can think of would be,rather than us reading the hostfile directly as we donow, to provide an option to ompi_info that would dumpthe hostfile using the same rules that you apply whenyou're using the hostfile. Would that be feasible?
Greg

On Sep 22, 2008, at 4:25 PM, Ralph Castain wrote:
Sorry for delay - was on vacation and am now trying towork my way back to the surface.
I'm not sure I can fix this one for two reasons:
1. In general, OMPI doesn't really care what name isused for the node. However, the problem is that it needsto be consistent. In this case, ORTE has already usedthe name returned by gethostname to create its sessiondirectory structure long before mpirun reads a hostfile.This is why we retain the value from gethostname insteadof allowing it to be overwritten by the name in whateverallocation we are given. Using the name in hostfilewould require that I either find some way to rememberany prior name, or that I tear down and rebuild thesession directory tree - neither seems attractive norsimple (e.g., what happens when the user providesmultiple entries in the hostfile for the node, each witha different IP address based on another interface inthat node? Sounds crazy, but we have already seen itdone - which one do I use?).
2. We don't actually store the hostfile info anywhere -we just use it and forget it. For us to add an XMLattribute containing any hostfile-related info wouldtherefore require us to re-read the hostfile. I couldhave it do that -only- in the case of "XML outputrequired", but it seems rather ugly.
An alternative might be for you to simply do a"gethostbyname" lookup of the IP address or hostname tosee if it matches instead of just doing a strcmp. Thisis what we have to do internally as we frequently haveproblems with FQDN vs. non-FQDN vs. IP addresses etc. Ifthe local OS hasn't cached the IP address for the nodein question it can take a little time to DNS resolve it,but otherwise works fine.
I can point you to the code in OPAL that we use - Iwould think something similar would be easy to implementin your code and would readily solve the problem.
Ralph

On Sep 19, 2008, at 7:18 AM, Greg Watson wrote:
Ralph,
The problem we're seeing is just with the head node. IfI specify a particular IP address for the head node inthe hostfile, it gets changed to the FQDN whendisplayed in the map. This is a problem for us as weneed to be able to match the two, and since we're notnecessarily running on the head node, we can't alwaysdo the same resolution you're doing.
Would it be possible to use the same address that isspecified in the hostfile, or alternatively provide anXML attribute that contains this information?
Thanks,

Greg

On Sep 11, 2008, at 9:06 AM, Ralph Castain wrote:
Not in that regard, depending upon what you mean by"recently". The only changes I am aware of wrt nodesconsisted of some changes to the order in which we usethe nodes when specified by hostfile or -host, and alittle #if protectionism needed by Brian for the Crayport.
Are you seeing this for every node? Reason I ask: Ican't offhand think of anything in the code base thatwould replace a host name with the FQDN because wedon't get that info for remote nodes. The onlyexception is the head node (where mpirun sits) - inthat lone case, we default to the name returned to usby gethostname(). We do that because the head node isfrequently accessible on a more global basis than thecompute nodes - thus, the FQDN is required to ensurethat there is no address confusion on the network.
If the user refers to compute nodes in a hostfile or -host (or in an allocation from a resource manager) bynon-FQDN, we just assume they know what they are doingand the name will correctly resolve to a unique address.
On Sep 10, 2008, at 9:45 AM, Greg Watson wrote:
Hi,
Has there been a change in the behavior of the -display-map option has changed recently in the 1.3branch. We're now seeing the host name as a fullyresolved DN rather than the entry that was specifiedin the hostfile. Is there any particular reason forthis? If so, would it be possible to add the hostfileentry to the output since we need to be able to matchthe two?
Thanks,

Greg
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel

Re: [OMPI devel] -display-map

Reply via email to