Re: [OMPI devel] -display-map

Ralph Castain Tue, 13 Jan 2009 15:30:33 -0500

Hmmm...well, I can't do either for 1.3.0 as it is departing thisafternoon.

The first option would be very hard to do. I would have to expose thedisplay-map option across the code base and check it prior to printinganything about resolving node names. I guess I should ask: do you onlywant noderesolve statements when we are displaying the map? Right now,I will output them regardless.

The second option could be done. I could check if any "display" optionhas been specified, and output the <ompi> root at that time (likewisefor the end). Anything we output in-between would be encapsulatedbetween the two, but that would include any user output to stdout and/or stderr - which for 1.3.0 is not in xml.


Any thoughts?

Ralph

PS. Guess I should clarify that I was not striving for true XMLinteraction here, but rather a quasi-XML format that would help you tofilter the output. I have no problem trying to get to something moreformally correct, but it could be tricky in some places to achieve itdue to the inherent async nature of the beast.



On Jan 13, 2009, at 12:17 PM, Greg Watson wrote:

Ralph,
The XML is looking better now, but there is still one problem. To bevalid, there needs to be only one root element, but currently youdon't have any (or many). So rather than:
<noderesolve name="node0" resolved="Jarrah.local"/>
<noderesolve name="node1" resolved="Jarrah.local"/>
<map>
        <host name="Jarrah.local" slots="8" max_slots="0">
                <process rank="0"/>
                <process rank="1"/>
                <process rank="2"/>
                <process rank="3"/>
                <process rank="4"/>
        </host>
</map>

the XML should be:

<map>
        <noderesolve name="node0" resolved="Jarrah.local"/>
        <noderesolve name="node1" resolved="Jarrah.local"/>
        <host name="Jarrah.local" slots="8" max_slots="0">
                <process rank="0"/>
                <process rank="1"/>
                <process rank="2"/>
                <process rank="3"/>
                <process rank="4"/>
        </host>
</map>

or:

<ompi>
        <noderesolve name="node0" resolved="Jarrah.local"/>
        <noderesolve name="node1" resolved="Jarrah.local"/>
        <map>
                <host name="Jarrah.local" slots="8" max_slots="0">
                        <process rank="0"/>
                        <process rank="1"/>
                        <process rank="2"/>
                        <process rank="3"/>
                        <process rank="4"/>
                </host>
        </map>
</ompi>

Would either of these be possible?

Thanks,

Greg

On Dec 8, 2008, at 2:18 PM, Greg Watson wrote:
Ok thanks. I'll test from trunk in future.

Greg

On Dec 8, 2008, at 2:05 PM, Ralph Castain wrote:
Working its way around the CMR process now.
Might be easier in the future if we could test/debug this in thetrunk, though. Otherwise, the CMR procedure will fall behind and afix might miss a release window.
Anyway, hopefully this one will make the 1.3.0 release cutoff.

Thanks
Ralph

On Dec 8, 2008, at 9:56 AM, Greg Watson wrote:
Hi Ralph,
This is now in 1.3rc2, thanks. However there are a couple ofproblems. Here is what I see:
[Jarrah.watson.ibm.com:58957] <noderesolve name="node0"resolved="Jarrah.watson.ibm.com">
For some reason each line is prefixed with "[...]", any idea whythis is? Also the end tag should be "/>" not ">".
Thanks,

Greg

On Nov 24, 2008, at 3:06 PM, Greg Watson wrote:
Great, thanks. I'll take a look once it comes over to 1.3.

Cheers,

Greg

On Nov 24, 2008, at 2:59 PM, Ralph Castain wrote:
Yo Greg
This is in the trunk as of r20032. I'll bring it over to 1.3 ina few days.
I implemented it as another MCA param"orte_show_resolved_nodenames" so you can actually get the infoas you execute the job, if you want. The xml tag is"noderesolve" - let me know if you need any changes.
Ralph


On Oct 22, 2008, at 11:55 AM, Greg Watson wrote:
Ralph,
I guess the issue for us is that we will have to run twocommands to get the information we need. One to get theconfiguration information, such as version and MCA parameters,and one to get the host information, whereas it would seemmore logical that this should all be available via some kindof "configuration discovery" command. I understand the issuewith supplying the hostfile though, so maybe this just pointsat the need for us to separate configuration information fromthe host information. In any case, we'll work with what youthink is best.
Greg

On Oct 20, 2008, at 4:49 PM, Ralph Castain wrote:
Hmmm...just to be sure we are all clear on this. The reasonwe proposed to use mpirun is that "hostfile" has no meaningoutside of mpirun. That's why ompi_info can't do anything inthis regard.
We have no idea what hostfile the user may specify until weactually get the mpirun cmd line. They may have specified adefault-hostfile, but they could also specify hostfiles forthe individual app_contexts. These may or may not include thenode upon which mpirun is executing.
So the only way to provide you with a separate command to geta hostfile<->nodename mapping would require you to provide uswith the default-hostifle and/or hostfile cmd line optionsjust as if you were issuing the mpirun cmd. We just wouldn'tlaunch - but it would be the exact equivalent of doing"mpirun --do-not-launch".
Am I missing something? If so, please do correct me - I wouldbe happy to provide a tool if that would make it easier. Justnot sure what that tool would do.
Thanks
Ralph


On Oct 19, 2008, at 1:59 PM, Greg Watson wrote:
Ralph,
It seems a little strange to be using mpirun for this, butbarring providing a separate command, or using ompi_info, Ithink this would solve our problem.
Thanks,

Greg

On Oct 17, 2008, at 10:46 AM, Ralph Castain wrote:
Sorry for delay - had to ponder this one for awhile.
Jeff and I agree that adding something to ompi_info wouldnot be a good idea. Ompi_info has no knowledge orunderstanding of hostfiles, and adding that capability toit would be a major distortion of its intended use.
However, we think we can offer an alternative that mightbetter solve the problem. Remember, we now treat hostfilesin a very different manner than before - see the wiki pagefor a complete description, or "man orte_hosts".
So the problem is that, to provide you with what you want,we need to "dump" the information from whatever default-hostfile was provided, and, if no default-hostfile wasprovided, then the information from each hostfile that wasprovided with an app_context.
The best way we could think of to do this is to add anothermpirun cmd line option --dump-hostfiles that would outputthe line-by-line name from the hostfile plus the name weresolved it to. Of course, --xml would cause it to be inxml format.
Would that meet your needs?

Ralph


On Oct 15, 2008, at 3:12 PM, Greg Watson wrote:
Hi Ralph,
We've been discussing this back and forth a bit internallyand don't really see an easy solution. Our problem is thatEclipse is not running on the head node, so gethostbynamewill not necessarily resolve to the same address. Forexample, the hostfile might refer to the head node by aninternal network address that is not visible to theoutside world. Since gethostname also looks in /etc/hosts,it may resolve locally but not on a remote system. Theonly think I can think of would be, rather than us readingthe hostfile directly as we do now, to provide an optionto ompi_info that would dump the hostfile using the samerules that you apply when you're using the hostfile. Wouldthat be feasible?
Greg

On Sep 22, 2008, at 4:25 PM, Ralph Castain wrote:
Sorry for delay - was on vacation and am now trying towork my way back to the surface.
I'm not sure I can fix this one for two reasons:
1. In general, OMPI doesn't really care what name is usedfor the node. However, the problem is that it needs to beconsistent. In this case, ORTE has already used the namereturned by gethostname to create its session directorystructure long before mpirun reads a hostfile. This iswhy we retain the value from gethostname instead ofallowing it to be overwritten by the name in whateverallocation we are given. Using the name in hostfile wouldrequire that I either find some way to remember any priorname, or that I tear down and rebuild the sessiondirectory tree - neither seems attractive nor simple(e.g., what happens when the user provides multipleentries in the hostfile for the node, each with adifferent IP address based on another interface in thatnode? Sounds crazy, but we have already seen it done -which one do I use?).
2. We don't actually store the hostfile info anywhere -we just use it and forget it. For us to add an XMLattribute containing any hostfile-related info wouldtherefore require us to re-read the hostfile. I couldhave it do that -only- in the case of "XML outputrequired", but it seems rather ugly.
An alternative might be for you to simply do a"gethostbyname" lookup of the IP address or hostname tosee if it matches instead of just doing a strcmp. This iswhat we have to do internally as we frequently haveproblems with FQDN vs. non-FQDN vs. IP addresses etc. Ifthe local OS hasn't cached the IP address for the node inquestion it can take a little time to DNS resolve it, butotherwise works fine.
I can point you to the code in OPAL that we use - I wouldthink something similar would be easy to implement inyour code and would readily solve the problem.
Ralph

On Sep 19, 2008, at 7:18 AM, Greg Watson wrote:
Ralph,
The problem we're seeing is just with the head node. IfI specify a particular IP address for the head node inthe hostfile, it gets changed to the FQDN when displayedin the map. This is a problem for us as we need to beable to match the two, and since we're not necessarilyrunning on the head node, we can't always do the sameresolution you're doing.
Would it be possible to use the same address that isspecified in the hostfile, or alternatively provide anXML attribute that contains this information?
Thanks,

Greg

On Sep 11, 2008, at 9:06 AM, Ralph Castain wrote:
Not in that regard, depending upon what you mean by"recently". The only changes I am aware of wrt nodesconsisted of some changes to the order in which we usethe nodes when specified by hostfile or -host, and alittle #if protectionism needed by Brian for the Crayport.
Are you seeing this for every node? Reason I ask: Ican't offhand think of anything in the code base thatwould replace a host name with the FQDN because wedon't get that info for remote nodes. The onlyexception is the head node (where mpirun sits) - inthat lone case, we default to the name returned to usby gethostname(). We do that because the head node isfrequently accessible on a more global basis than thecompute nodes - thus, the FQDN is required to ensurethat there is no address confusion on the network.
If the user refers to compute nodes in a hostfile or -host (or in an allocation from a resource manager) bynon-FQDN, we just assume they know what they are doingand the name will correctly resolve to a unique address.
On Sep 10, 2008, at 9:45 AM, Greg Watson wrote:
Hi,
Has there been a change in the behavior of the -display-map option has changed recently in the 1.3branch. We're now seeing the host name as a fullyresolved DN rather than the entry that was specifiedin the hostfile. Is there any particular reason forthis? If so, would it be possible to add the hostfileentry to the output since we need to be able to matchthe two?
Thanks,

Greg
_______________________________________________
devel mailing list
[email protected]
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
[email protected]
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
[email protected]
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
[email protected]
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
[email protected]
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
[email protected]
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
[email protected]
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
[email protected]
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
[email protected]
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
[email protected]
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
[email protected]
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
[email protected]
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
[email protected]
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
[email protected]
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
[email protected]
http://www.open-mpi.org/mailman/listinfo.cgi/devel

Re: [OMPI devel] -display-map

Reply via email to