I think this is fixed by Nathan's PR:

    https://github.com/open-mpi/ompi/pull/653

It's waiting for George's review -- George?


> On Jun 24, 2015, at 7:14 AM, Howard Pritchard <hpprit...@gmail.com> wrote:
> 
> Hi Folks,
> 
> I'm seeing some MTT failures from last nite in the form of run failures 
> and/or timeouts.
> Is anyone else seeing these?
> 
> On the ibm dataplex system I'm seeing these kinds of assertion failures in 
> ob1:
> 
> c_ring: pml_ob1_component.c:308: mca_pml_ob1_component_fini: Assertion 
> `((0xdeafbeedULL << 32) +
> 0xdeafbeedULL) == ((opal_object_t *) (mca_pml_ob1_recvreq))->obj_magic_id' 
> failed.
> [c1476:07379] *** Process received signal ***
> [c1476:07379] Signal: Aborted (6)
> [c1476:07379] Signal code:  (-6)
> c_ring: pml_ob1_component.c:308: mca_pml_ob1_component_fini: Assertion 
> `((0xdeafbeedULL << 32) +
> 0xdeafbeedULL) == ((opal_object_t *) (mca_pml_ob1_recvreq))->obj_magic_id' 
> failed.
> [c1475:18949] *** Process received signal ***
> [c1475:18949] Signal: Aborted (6)
> c_ring: pml_ob1_component.c:308: mca_pml_ob1_component_fini: Assertion 
> `((0xdeafbeedULL << 32) +
> 0xdeafbeedULL) == ((opal_object_t *) (mca_pml_ob1_recvreq))->obj_magic_id' 
> failed.
> c_ring: pml_ob1_component.c:308: mca_pml_ob1_component_fini: Assertion 
> `((0xdeafbeedULL << 32) +
> 0xdeafbeedULL) == ((opal_object_t *) (mca_pml_ob1_recvreq))->obj_magic_id' 
> failed.
> [c1476:07375] *** Process received signal ***
> c_ring: pml_ob1_component.c:308: mca_pml_ob1_component_fini: Assertion 
> `((0xdeafbeedULL << 32) +
> 0xdeafbeedULL) == ((opal_object_t *) (mca_pml_ob1_recvreq))->obj_magic_id' 
> failed.
> [c1475:18951] *** Process received signal ***
> [c1475:18951] Signal: Aborted (6)
> [c1477:19137] Signal: Aborted (6)
> [c1477:19137] Signal code:  (-6)
> [c1476:07375] Signal: Aborted (6)
> c_ring: pml_ob1_component.c:308: mca_pml_ob1_component_fini: Assertion 
> `((0xdeafbeedULL << 32) +
> 0xdeafbeedULL) == ((opal_object_t *) (mca_pml_ob1_recvreq))->obj_magic_id' 
> failed.
> 
> ---------- Forwarded message ----------
> From: Howard Pritchard <h...@c1479.nersc.gov>
> Date: 2015-06-24 7:04 GMT-06:00
> Subject: MTT test has completed, status: failed
> To: hpprit...@gmail.com
> 
> 
> Subject: MTT test has completed, status: failed
> hostname: c1479
> uname: Linux c1479 2.6.32-358.18.1.el6.nersc.x86_64 #1 SMP Wed Aug 28 
> 02:17:42 PDT 2013 x86_64 x86_64 x86_64 GNU/Linux
> who am i:
> 
> +----------+---------+-------------------+----------+------+------+----------+------+--------------------------------------------------------------------------+
> | Phase    | Section | MPI Version       | Duration | Pass | Fail | Time out 
> | Skip | Detailed report                                                      
>     |
> +----------+---------+-------------------+----------+------+------+----------+------+--------------------------------------------------------------------------+
> | Test Run | trivial | dev-1936-gdb3c59b | 00:39    | 3    | 3    |          
> |      | 
> Test_Run-trivial-ompi-nightly-master-dev-1936-gdb3c59b-pgi_warnings.html |
> +----------+---------+-------------------+----------+------+------+----------+------+--------------------------------------------------------------------------+
> 
> 
>     Total Tests:    6
>     Total Failures: 3
>     Total Passed:   3
>     Total Duration: 39 secs. (00:39)
> 
> 
> 
> Test Scratch Directory is /global/homes/h/hpp/mtt_carver_tmp
> 
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post: 
> http://www.open-mpi.org/community/lists/devel/2015/06/17525.php


-- 
Jeff Squyres
jsquy...@cisco.com
For corporate legal information go to: 
http://www.cisco.com/web/about/doing_business/legal/cri/

Reply via email to