Re: [OMPI devel] opal_output_verbose usage guidelines

Don Kerr Mon, 9 Jul 2007 09:58:51 -0400


Jeff Squyres wrote:

On Jul 6, 2007, at 5:20 PM, Don Kerr wrote:
Are there any guidelines about the use of opal_output_verbose?
Not so much.
       - Are there hidden meanings for a given verbose level? e.g. 0
reserved for PML, or 50-100 for BTL and so on
Nope. The output was designed to use the values with >= kinds ofchecking; i.e., the higher the verbose value the user gives, the moreoutput they see. I.e., the values are not used in a "bit flag" sense(i.e., each bit enables/disables a specific set of output).
       - Maybe the base component output_id is ok to use in situation
XYZ but a component specific output_id should be used in situationABC?
Or should never be used for component specific output?
I've typically used the base component output_id whenever possible.I usually started off having an output ID for a specific component,but usually that was for debugging (and therefore having oodles andoodles of output). By the time I was done, I usually had only a fewoutput statements and therefore used the base ID.
I guess my suggestion would be: if you're going to have a LOT ofoutput, then make it a component-specific ID. If it's a "reasonable"amount, then just use the base ID. Definitions of those terms aresubjective and intentionally fuzzy. :-)
Why I ask. I want to report a warning to the user when "--enable-debug"is not configured. I also do not want the error to show up all thetime,
only when for example --mca btl_base_debug is set to some value. I am
thinking I will just use opal_output_verbose but wanted to see iftherewere any guidelines about its use? Or if I should be thinking aboutsome
other option all together.
You want a warning to show when:

1. the udapl btl is used
2. --enable-debug was not configured
3. the user specifies btl_*_verbose (or btl_*_debug) >= some_value
Is that right? If so, is the intent to warn that somen checks arenot being performed that one would otherwise assume are beingperformed (because of #3)?

#1 and #2 is just to convey the environment I expect the user to berunning in, not the error case. Interpretation of #3 is a little askew.uDAPL gets its HCA information from /etc/dat.conf. This file has anentry for each HCA, even those that are potentially not "UP". Also itappears the OFED stack includes by default an entry for "OpenIB-bond"which I have not figured out what it is yet. In anycase uDAPL hastrouble distinguishing if an HCA is down intentionally or if is downbecause something is wrong. So the uDAPL BTL attempts to open all of theentries in this file. And the issues becomes how much information totoss back to the user. If a node has two IB interfaces but only one isup, do they want see a warning message about one of the interfaces beingdown when they already know this by looking at "ifconfig"? I think not.But this could be valueable information if there is a real problem.

Since its just one message at this point I think I will go with the baseoutput_id and if I need more I will look to create a component specificid. Thanks Jeff.

I expect to pursue this in order to find a better way to distinguishbetween an interface that is up or down but I don't have a solution atthe moment.


-DON

Re: [OMPI devel] opal_output_verbose usage guidelines

Reply via email to