[lustre-discuss] Installing lustre 2.15.6 server on rhel-8.10 fails
Carlos Adean
carlosadean at linea.org.br
Wed Apr 23 18:06:53 PDT 2025
Hello Martin,
Thank you for the hint.
I tried rebuilding using the suggested parameter, but the warnings persist.
Additionally, the system still fails to boot using the lustre kernel.
We noticed that Lustre's kernel image does not have the megaraid_sas
module, which is used by the system to enable the Dell PERC H330
controller. This may be the cause of the boot failure.
[root at mds2 ~]# lsinitrd
/boot/initramfs-4.18.0-553.27.1.el8_lustre.x86_64.img | grep megaraid_sas
[root at mds2 ~]#
However, this is not true for the kernel image installed via dnf.
[root at mds2 ~]# lsinitrd /boot/initramfs-4.18.0-553.27.1.el8_10.x86_64.img |
grep megaraid_sas -rw-r--r-- 1 root root 72560 Jan 15 2024
usr/lib/modules/4.18.0-553.27.1.el8_10.x86_64/kernel/drivers/scsi/megaraid/megaraid_sas.ko.xz
[root at mds2 ~]#
I'm still here struggling to install it.
---
*Carlos Adean*
www.linea.org.br
Em qua., 23 de abr. de 2025 às 09:22, Audet, Martin <
Martin.Audet at cnrc-nrc.gc.ca> escreveu:
> Hello,
>
>
> I think I had a similar problem a long time ago and it was solved by
> adding the "--kmp" option to "mlnx_add_kernel_support.sh" script when
> compiling MOFED RPMs. Without this option, the MOFED RPM compilation
> complete without problems, the same thing when compiling Lustre RPMs but
> later, when installing Lustre RPMs, we get a bunch of problems related to
> symbols.
>
>
> Here is how I compile the MOFED RPMs (uning the root account):
>
>
> # mount_dir is the temporary mount directory
>
> # ofed_iso is the MOFED .iso file
>
> #
> mkdir -p -- $mount_dir
>
> mount -o ro,loop $ofed_iso $mount_dir
>
> $mount_dir/mlnx_add_kernel_support.sh -y --make-tgz --kmp -k $(uname -r)
> -m $mount_dir
>
> #
>
> # The compiled RPMs are now under /tmp
>
> # ex: /tmp/MLNX_OFED_LINUX-24.10-2.1.8.0-rhel8.10.x86_64-ext.tgz
>
>
> It seems that the pre-compiled RPMs distributed by Mellanox/NVIDIA are
> always generated using the --kmp but when using mlnx_add_kernel_support.sh,
> this option must be explicitly specified. In addition, it seems that
> with the newer DOCA OFED, the using script equivatent to
> mlnx_add_kernel_support.sh always add --kmp option on RHEL and
> similar distributions.
>
>
> I hope it helps,
>
>
> Martin
> ------------------------------
> *From:* lustre-discuss <lustre-discuss-bounces at lists.lustre.org> on
> behalf of Carlos Adean via lustre-discuss <lustre-discuss at lists.lustre.org
> >
> *Sent:* April 22, 2025 11:09 PM
> *To:* lustre-discuss at lists.lustre.org
> *Cc:* Eloir Troyack
> *Subject:* EXT: [lustre-discuss] Installing lustre 2.15.6 server on
> rhel-8.10 fails
>
> ***Attention*** This email originated from outside of the NRC.
> ***Attention*** Ce courriel provient de l'extérieur du CNRC.
>
> Hello all,
>
> My current version of RHEL 8 is Rocky Linux 8.10, running the kernel
> 4.18.0-553.27.1.el8_10. I also have the OFED drivers version 24.10-2.1.8.0
> installed for the InfiniBand interface (I tried without OFED before).
>
> The installation of "kmod-lustre-2.15.6-1.el8" and
> "kmod-lustre-osd-ldiskfs-2.15.6-1" always shows these warning messages
> below.
>
> # dnf --nogpgcheck --enablerepo=lustre-server install kmod-lustre
> kmod-lustre-osd-ldiskfs lustre-osd-ldiskfs-mount lustre
> lustre-resource-agents
> [...]
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol __ib_alloc_pd
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol rdma_resolve_addr
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol ib_dereg_mr_user
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol rdma_reject
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol rdma_disconnect
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol __rdma_create_kernel_id
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol ib_register_event_handler
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol rdma_resolve_route
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol ib_unregister_event_handler
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol rdma_bind_addr
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol rdma_create_qp
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol ib_map_mr_sg
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol ib_query_port
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol rdma_notify
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol rdma_listen
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol rdma_destroy_qp
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol __ib_create_cq
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol ib_alloc_mr
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol rdma_connect_locked
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol rdma_set_reuseaddr
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol ib_destroy_cq_user
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol ib_modify_qp
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol ib_dma_virt_map_sg
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol rdma_destroy_id
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol rdma_accept
> depmod: WARNING:
> /lib/modules/4.18.0-553.27.1.el8_lustre.x86_64/extra/lustre/net/ko2iblnd.ko
> needs unknown symbol ib_dealloc_pd_user
> [...]
> Installed:
> kernel-core-4.18.0-553.27.1.el8_lustre.x86_64
> kmod-lustre-2.15.6-1.el8.x86_64
> kmod-lustre-osd-ldiskfs-2.15.6-1.el8.x86_64 lustre-2.15.6-1.el8.x86_64
> lustre-osd-ldiskfs-mount-2.15.6-1.el8.x86_64
> lustre-resource-agents-2.15.6-1.el8.x86_64
>
> Completed!
>
>
> After rebooting, the server drops into an emergency shell because it can't
> find the LVM devices. This issue only occurs with the Lustre kernel, other
> installed kernels boot normally.
>
>
> Any hints on how to proceed?
>
>
> ---
> *Carlos Adean*
> www.linea.org.br
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20250423/8e3fe3f0/attachment.htm>
More information about the lustre-discuss
mailing list