Hi Jon,

As it happens, I've been looking at the same thing. I hadn't spotted LU-18002 (thanks), but unfortunately it isn't enough to accommodate the move to dkms on rhel.

I don't know how far you've got since Monday, but there now seems a need for an explicit check of /usr/src/ofa_kernel (as it's no longer owned by a package) and the "find" for rdma_cm.h needs the -L flag to make sense of the new maze of twisty passages.

I think that a new jira ticket needs to be opened...

Cheers,

Mark


On Mon, 19 Jan 2026, Jon Marshall via lustre-discuss wrote:

[EXTERNAL EMAIL]
Hi,

I'm in the process of rebuilding lustre on Rocky 8.10 and have noticed that 
NVIDIA have been messing around with their packages again, now rebranding 
everything under the doca label. For LTS purposes we're sticking with 2.15.8 
for lustre, and I'm trying to get this to build with NVIDIA DOCA 3.2.1 LTS.

The trouble is, it seems they have rename the package mlnx-ofa_kernel-devel to 
mlnx-ofa_kernel-dkms. Looking at the DKMS configure script, it is searching for:
                       O2IBPKG="mlnx-ofed-kernel-dkms"
                       O2IBPKG+="|mlnx-ofed-kernel-modules"
                       O2IBPKG+="|mlnx-ofa_kernel-devel"
                       O2IBPKG+="|compat-rdma-devel"
                       O2IBPKG+="|kernel-ib-devel"
                       O2IBPKG+="|ofa_kernel-devel"

And hence it can't find the package (underscore instead of hyphen), which 
causes the build to fail.

Digging around the JIRA, I found 
this<https://jira.whamcloud.com/browse/LU-18002?jql=text%20~%20dkms%20ORDER%20BY%20created%20DESC>
 issue, but it looks to only have been fixed in 2.16, which we've sort of ruled out at this 
stage. Looking at the actual 
patch<https://review.whamcloud.com/c/fs/lustre-release/+/55625/4/lnet/autoconf/lustre-lnet.m4>,
 it seems pretty minor and I was wondering if this could be back ported to 2.15 as well.

I can work around by building things myself, but I was hoping to be able to yum 
install the packages direct from the whamcloud repos, as this greatly 
simplifies my rollout.

Cheers
Jon


Jon Marshall

High Performance Computing Specialist



IT and Scientific Computing Team



Cancer Research UK Cambridge Institute

Li Ka Shing Centre | Robinson Way | Cambridge | CB2 0RE

Web<http://www.cruk.cam.ac.uk/> | Facebook<http://www.facebook.com/cancerresearchuk> 
| Twitter<http://twitter.com/CR_UK>



[Description: CRI Logo]<http://www.cruk.cam.ac.uk/>


_______________________________________________
lustre-discuss mailing list
[email protected]
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org

Reply via email to