Hi Jon,

This is probably result of LU-6735, where special tuning parameters were added for truscale IB adapters. The problem is these parameters are incompatible with mixed (truescale+mellanox) IB networks.

Lustre 2.8 includes script "ko2iblnd-probe" that will probe IB adapter interface and apply tuning parameters dependent on IB adapter model.

One solution is to use o2iblnd defaults. This can be accomplished by commenting out line that calls the script in ko2ilnd modprobe config ie) file /etc/modprobe.d/ko2iblnd.conf remove line:

install ko2iblnd /usr/sbin/ko2iblnd-probe

If you want to pursue tuning options, you may wish to look at LUDOC-267 where I listed the (few) presentations about truescale and lustre. Also LU-3222 is a good reference for mellanox tuning parameters.


regards,
chris hunter
chunt...@gmail.com


Hi,

I have brought up a test system using

2.8.0-3.10.0_327.3.1.el7.x86_64_g96792ba

I can mount the system over tcp, but when I try to do so over infiniband
i get errors of the type:

Can't accept conn from 10.0.51.1@o2ib, queue depth too large: 128 (<=8
wanted)

Can't accept conn from 10.0.51.1@o2ib (version 12): max_frags 32
incompatible without FMR pool (256 wanted)

After searching I suspected it had something to do with the fact that we
have mellanox (mlx4_ib) on the server and qlogic on the client (ib_qib).

Also found a possible solution, by putting

options ko2iblnd peer_credits=124 concurrent_sends=62 map_on_demand=256

However, there are a bunch of options to ko2iblnd, and to me it is not
obvious which values to chose. Is there a specific strategy one should
follow?

Regards,

/jon

_______________________________________________
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org

Reply via email to