Le 03/09/2010 15:38, George Bosilca a écrit :
> Jeff,
>
> I think you will have to revert this patch as the btl_bandwidth __IS__ 
> supposed to be in Mbs and not MBs. We usually talk about networks in Mbs 
> (there is a pattern in Ethernet 1G/10G, Myricom 10G). In addition the 
> original design of the multi-rail was based on this assumption, and the 
> multi-rail handling code deal with these values (at that level I don't think 
> it really matters, but at least it needs consistent values from all BTLs).
>
> However, going over the existing BTLs I can see that some BTLs do not 
> correctly set this value:
>
> BTL     Bandwidth        Auto-detect     Status
> Elan    2000                NO           Correct
>   

2000 looks strange to me. Last time I played with Elan4, bandwidth was
900MB/s or so.

> GM      250                 NO           Doubtful
> MX      2000/10000          YES (Mbs)    Correct (before the patch)
> OFUD    800                 NO           Doubtful
> OpenIB  2000/4000/8000      YES (Mbs)    Correct (multiplied by the 
> active_width)
>   

I found the problem when using both MX and OpenIB at the same time, so
they can't be both wrong or both correct. IB was reporting 800, not
2000/4000/8000. Maybe because auto-detect didn't work and the default is
wrong:
btl_openib_mca.c:527:    mca_btl_openib_module.super.btl_bandwidth = 800;

Brice

Reply via email to