[Bug 260027] Re: libopenmpi segfaults when electric fence is enabled
** Changed in: openmpi (Ubuntu) Status: Fix Committed = Fix Released -- libopenmpi segfaults when electric fence is enabled https://bugs.launchpad.net/bugs/260027 You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 260027] Re: libopenmpi segfaults when electric fence is enabled
Discussed and fix committed upstream (see http://www.open- mpi.org/community/lists/devel/2008/08/4607.php and follow-ups). ** Changed in: openmpi (Ubuntu) Status: New = Fix Committed -- libopenmpi segfaults when electric fence is enabled https://bugs.launchpad.net/bugs/260027 You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 260027] Re: libopenmpi segfaults when electric fence is enabled
I fixed this by building the intrepid package. sudo apt-get build-dep openmpi dget -x http://archive.ubuntu.com/ubuntu/pool/universe/o/openmpi/openmpi_1.2.7~rc2-1ubuntu2.dsc cd openmpi-1.2.7~rc2/ debuild --no-tgz-check -us -uc -i -I sudo debi -- libopenmpi segfaults when electric fence is enabled https://bugs.launchpad.net/bugs/260027 You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 260027] Re: libopenmpi segfaults when electric fence is enabled
Sorry, this isn't fixed, disregard my last comment. -- libopenmpi segfaults when electric fence is enabled https://bugs.launchpad.net/bugs/260027 You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 260027] Re: libopenmpi segfaults when electric fence is enabled
Expanding the OBJ_CONSTRUCT_INTERAL macro with its definition in opal/class/opal_object.h, one finds that the illegal instruction is ((opal_object_t *) (item))-obj_class = (flist-fl_elem_class); I modified the openmpi source to print out the argument to malloc, the returned pointer, and the address of the above variable. Here is a modified source snippet of opal_free_list_grow, annotated with the output of the debugging printouts: fprintf(stderr, mpidebug: allocating %d\n, (num_elements * flist-fl_elem_size) + sizeof(opal_list_item_t) + CACHE_LINE_SIZE); alloc_ptr = (unsigned char *)malloc(1 * ((num_elements * flist-fl_elem_size) + sizeof(opal_list_item_t) + CACHE_LINE_SIZE)); fprintf(stderr, mpidebug: allocated at memory address %p\n, alloc_ptr); mpidebug: allocating 216 mpidebug: allocated at memory address 0xb62bdf28 for(i=0; inum_elements; i++) { opal_free_list_item_t* item = (opal_free_list_item_t*)ptr; if (NULL != flist-fl_elem_class) { do { if (0 == (flist-fl_elem_class)-cls_initialized) { opal_class_initialize((flist-fl_elem_class)); } fprintf(stderr, mpidebug: accessing address %p\n, ((opal_object_t *) (item))-obj_class); ((opal_object_t *) (item))-obj_class = (flist-fl_elem_class); fprintf(stderr, mpidebug: accessing address %p\n, ((opal_object_t *) (item))-obj_reference_count); ((opal_object_t *) (item))-obj_reference_count = 1; opal_obj_run_constructors((opal_object_t *) (item)); } while (0); } opal_list_append((flist-super), (item-super)); ptr += flist-fl_elem_size; } mpidebug: accessing address 0xb62be000 As can be seen, the instruction ((opal_object_t *) (item))-obj_class = (flist-fl_elem_class) access memory at alloc_ptr + 216, but alloc_ptr was allocated to be of size 216. -- libopenmpi segfaults when electric fence is enabled https://bugs.launchpad.net/bugs/260027 You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 260027] Re: libopenmpi segfaults when electric fence is enabled
After struggling a bit to build libopenmpi1 with debugging symbols (even with libopenmpi-dbg installed, libopal-pal does not have debugging symbols installed), I managed to get a more useful backtrace: 0xb5cdd334 in opal_free_list_grow (flist=0xb2b46a50, num_elements=1) at class/opal_free_list.c:113 113 OBJ_CONSTRUCT_INTERNAL(item, flist-fl_elem_class); (gdb) bt #0 0xb5cdd334 in opal_free_list_grow (flist=0xb2b46a50, num_elements=1) at class/opal_free_list.c:113 #1 0xb5cdd479 in opal_free_list_init (flist=0xb2b46a50, elem_size=56, elem_class=0xb2b46e20, num_elements_to_alloc=73, max_elements_to_alloc=-1, num_elements_per_alloc=1) at class/opal_free_list.c:78 #2 0xb2b381aa in ompi_osc_pt2pt_component_init (enable_progress_threads=false, enable_mpi_threads=false) at osc_pt2pt_component.c:173 #3 0xb792b67c in ompi_osc_base_find_available (enable_progress_threads=false, enable_mpi_threads=false) at base/osc_base_open.c:84 #4 0xb78e6abe in ompi_mpi_init (argc=5, argv=0xbfd61f84, requested=0, provided=0xbfd61e78) at runtime/ompi_mpi_init.c:411 #5 0xb7911a87 in PMPI_Init (argc=0xbfd61f00, argv=0xbfd61f04) at pinit.c:71 #6 0x0811ca6c in MPI::Init () #7 0x08118b8a in main () -- libopenmpi segfaults when electric fence is enabled https://bugs.launchpad.net/bugs/260027 You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 260027] Re: libopenmpi segfaults when electric fence is enabled
** Attachment added: A crashing program, when electric-fence is enabled http://launchpadlibrarian.net/16977965/test.cpp -- libopenmpi segfaults when electric fence is enabled https://bugs.launchpad.net/bugs/260027 You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs