Ummm...the "ml" stands for Mellanox. This is a component you folks
contributed at some time. IIRC, the hcoll and/or bcol are meant to replace
it, but you folks would know best what to do with it.



On Tue, Mar 4, 2014 at 12:12 AM, Elena Elkina <elena.elk...@itseez.com>wrote:

> Hi,
>
> Recently I often meet hangs and seg faults with different command lines
> and there are "ml" functions in the stack trace.
> When I just turn "ml" off by do -mca coll ^ml, problems disappear.
> For example,
> oshrun -np 4 --map-by node --display-map  ./ring_oshmem
> fails with seg fault while
> oshrun -np 4 --map-by node --display-map -mca coll ^ml ./ring_oshmem
> passes.
>
> The "ml" priority is low (27), but it could have issues during comm_query
> (it does all initialization staff there).
>
> "Ml" is unreliable component. So It may be reasonable do not to build this
> component by default to avoid such problems.
>
> What do you think?
>
> Best regards,
> Elena
>
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Searchable archives:
> http://www.open-mpi.org/community/lists/devel/2014/03/date.php
>

Reply via email to