Hi~
My software environment is Lustre 2.15.8. The client uses ConnectX‑5 network adapters, and the server uses E810 network adapters. The switches are configured for RoCE, with the priority set to 3 only. Under normal conditions, all service traffic goes through queue 3, as shown in the figure below. However, after a port flap (down/up event) on a client network interface, this interface will use both queue 0 and queue 3, while other interfaces remain normal, as shown in the figure below. At the same time, a strange issue occurs on the Lustre client: it incorrectly adds the flapping network interface to the peer list. Additionally, running lustre_rmmod hangs because there are still busy services in use. Is this a Lustre‑related issue? If so, how should I resolve it? Please provide me with some troubleshooting ideas. so,How should I locate the problem? Thanks!
_______________________________________________ lustre-discuss mailing list [email protected] http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org
