Right -- the symbol isn't declared in orterun.  It's in libopen-rte.so.

My changes ensure that the .o file that MPIR_Breakpoint is defined in will be 
pulled in by the linker to be in the mpirun process.


On Dec 15, 2011, at 3:30 PM, Nathan Hjelm wrote:

> Your changes don't break anything but they also don't cause MPIR_Breakpoint 
> to appear in orterun:
> ct-login1:/scratch2/hjelmn hjelmn$ nm `type -p orterun` | grep MPIR
> 000000000060b0e0 B MPIR_attach_fifo
> 000000000060b2e0 B MPIR_being_debugged
> 000000000060b7b0 B MPIR_debug_state
> 000000000060ada0 B MPIR_executable_path
> 000000000060b840 B MPIR_force_to_main
> 000000000060b7b4 B MPIR_forward_comm
> 000000000060b8c8 B MPIR_forward_output
> 000000000060b9fc B MPIR_i_am_starter
> 000000000060afe0 B MPIR_partial_attach_ok
> 000000000060b8c0 B MPIR_proctable
> 000000000060b9f8 B MPIR_proctable_size
> 000000000060b300 B MPIR_server_arguments
> 
> -Nathan
> 
> On Thu, 15 Dec 2011, Jeff Squyres wrote:
> 
>> Ok, here's what I did:
>> 
>> https://svn.open-mpi.org/trac/ompi/changeset/25660
>> --> pulls in symbols like MPIR_Breakpoint via a different dummy function
>> 
>> https://svn.open-mpi.org/trac/ompi/changeset/25661
>> --> Fixes the ORTE_DECLSPEC typos that George found
>> 
>> LANL: Can you verify that this (still) works for you with totalview and stat?
>> 
>> 
>> 
>> On Dec 15, 2011, at 1:35 PM, Jeff Squyres wrote:
>> 
>>> On Dec 15, 2011, at 10:28 AM, Ralph Castain wrote:
>>> 
>>>>> I have had the chance now to test it with totalview and stat 1.1.0. Looks 
>>>>> good. I pushed the fix to the trunk and it will need to be CMRed to 1.5.
>>> 
>>> Ralph and I just talked about this on the phone some more -- I don't think 
>>> https://svn.open-mpi.org/trac/ompi/changeset/25654 was the right fix.
>>> 
>>> We still need to ensure that all the symbols in 
>>> orte/debugger/base/debugger_base_fns.o are pulled in from shared libraries 
>>> at run-time.  Totalview may well have gotten confused if we used the actual 
>>> MPIR_Breakpoint symbol to pull it in (i.e., it actually broke right there, 
>>> rather than when we actually invoked MPIR_Breakpoint, later).
>>> 
>>> A better way might be to use another dummy function to pull in all the 
>>> symbols from that .o file.  I.e., instead of doing foo=MPIR_Breakpoint, 
>>> call some other dummy function that lives in debugger_base_fns.c.  It's 
>>> only purpose in life would be to ensure that all the symbols -- including 
>>> MPIR_Breakpoint -- are pulled in at run time.
>>> 
>>> I'll go do that on the trunk right now...
>>> 
>>> --
>>> Jeff Squyres
>>> jsquy...@cisco.com
>>> For corporate legal information go to:
>>> http://www.cisco.com/web/about/doing_business/legal/cri/
>>> 
>>> 
>>> _______________________________________________
>>> devel mailing list
>>> de...@open-mpi.org
>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>> 
>> 
>> -- 
>> Jeff Squyres
>> jsquy...@cisco.com
>> For corporate legal information go to:
>> http://www.cisco.com/web/about/doing_business/legal/cri/
>> 
>> 
>> _______________________________________________
>> devel mailing list
>> de...@open-mpi.org
>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>> 
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel


-- 
Jeff Squyres
jsquy...@cisco.com
For corporate legal information go to:
http://www.cisco.com/web/about/doing_business/legal/cri/


Reply via email to