Hey, It seems to be a problem with the MPI library.
ibv_channel_manager.c is the channel manager for Infiniband verbs. MPIDI_CH3_PKT_EAGER_SEND is expected as all message sizes are below the eager threshold. HYD_pmcd_pmip_control_cmd_cb has something to do with hydra I guess. It really seems to be errors in MPICH2, like this one: (./pm/pmiserv/pmip_cb.c:906): assert (!closed) failed and this one: [1100] Abort: [] Got completion with error 12, vendor code=0x81, dest >From the documentation: 7.2.6 Got completion with error code 12 http://mvapich.cse.ohio-state.edu/support/mvapich_user_guide.html#x1-540007.2.6 "The error code 12 indicates that the InfiniBand HCA has given up after attempting to send the packet after several tries. This can be caused by either loose or faulty cables. Please check the InfiniBand connectivity of your cluster." This is a hardware problem. You should send an email to the support. Sébastien On 01/10/12 10:08 PM, Louis Letourneau wrote: > (./pm/pmiserv/pmip_cb.c:906): assert (!closed) failed ------------------------------------------------------------------------------ Don't let slow site performance ruin your business. Deploy New Relic APM Deploy New Relic app performance management and know exactly what is happening inside your Ruby, Python, PHP, Java, and .NET app Try New Relic at no cost today and get our sweet Data Nerd shirt too! http://p.sf.net/sfu/newrelic-dev2dev _______________________________________________ Denovoassembler-users mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/denovoassembler-users
