[lustre-discuss] DKMS build broken with NVIDIA doca packages

Mark Dixon mark.c.dixon at durham.ac.uk
Wed Jan 21 04:23:56 PST 2026


Hi Jon,

As it happens, I've been looking at the same thing. I hadn't spotted 
LU-18002 (thanks), but unfortunately it isn't enough to accommodate the 
move to dkms on rhel.

I don't know how far you've got since Monday, but there now seems a need 
for an explicit check of /usr/src/ofa_kernel (as it's no longer owned by a 
package) and the "find" for rdma_cm.h needs the -L flag to make sense of 
the new maze of twisty passages.

I think that a new jira ticket needs to be opened...

Cheers,

Mark


On Mon, 19 Jan 2026, Jon Marshall via lustre-discuss wrote:

> [EXTERNAL EMAIL]
> Hi,
>
> I'm in the process of rebuilding lustre on Rocky 8.10 and have noticed that NVIDIA have been messing around with their packages again, now rebranding everything under the doca label. For LTS purposes we're sticking with 2.15.8 for lustre, and I'm trying to get this to build with NVIDIA DOCA 3.2.1 LTS.
>
> The trouble is, it seems they have rename the package mlnx-ofa_kernel-devel to mlnx-ofa_kernel-dkms. Looking at the DKMS configure script, it is searching for:
>                        O2IBPKG="mlnx-ofed-kernel-dkms"
>                        O2IBPKG+="|mlnx-ofed-kernel-modules"
>                        O2IBPKG+="|mlnx-ofa_kernel-devel"
>                        O2IBPKG+="|compat-rdma-devel"
>                        O2IBPKG+="|kernel-ib-devel"
>                        O2IBPKG+="|ofa_kernel-devel"
>
> And hence it can't find the package (underscore instead of hyphen), which causes the build to fail.
>
> Digging around the JIRA, I found this<https://jira.whamcloud.com/browse/LU-18002?jql=text%20~%20dkms%20ORDER%20BY%20created%20DESC> issue, but it looks to only have been fixed in 2.16, which we've sort of ruled out at this stage. Looking at the actual patch<https://review.whamcloud.com/c/fs/lustre-release/+/55625/4/lnet/autoconf/lustre-lnet.m4>, it seems pretty minor and I was wondering if this could be back ported to 2.15 as well.
>
> I can work around by building things myself, but I was hoping to be able to yum install the packages direct from the whamcloud repos, as this greatly simplifies my rollout.
>
> Cheers
> Jon
>
>
> Jon Marshall
>
> High Performance Computing Specialist
>
>
>
> IT and Scientific Computing Team
>
>
>
> Cancer Research UK Cambridge Institute
>
> Li Ka Shing Centre | Robinson Way | Cambridge | CB2 0RE
>
> Web<http://www.cruk.cam.ac.uk/> | Facebook<http://www.facebook.com/cancerresearchuk> | Twitter<http://twitter.com/CR_UK>
>
>
>
> [Description: CRI Logo]<http://www.cruk.cam.ac.uk/>
>
>


More information about the lustre-discuss mailing list