[lustre-discuss] 2.15 install failure

Thomas Roth t.roth at gsi.de
Fri Aug 4 12:44:17 PDT 2023


Hi all,

returning to my Lustre installations, the curious failures continue...

- Download of 2.15.3 for el8.8 from Whamcloud
- Installation of a server with rocky 8.8 (I mean, why not,  while it still exists...)
- Want an ldiskfs server, so
 > dnf install lustre lustre-osd-ldiskfs-mount lustre-ldiskfs-dkms
  -->  Fails because the full ext4 source is not present.

I wonder whether I got the workaround from this mailing list, but it should really be in some official documentation or better not necessary at all:

- Rocky 8.8 installs with kernel 4.18.0-477.15.1, so download 'kernel-4.18.0-477.15.1.el8_8.src.rpm'
 > rpm -i ./kernel-4.18.0-477.15.1.el8_8.src.rpm
 > tar xJf rpmbuild/SOURCES/linux-4.18.0-477.15.1.el8_8.tar.xz
 > cp -a linux-4.18.0-477.15.1.el8_8/fs/ext4/* /usr/src/kernels/4.18.0-477.15.1.el8_8.x86_64/fs/ext4/


Of course, at this stage, 'lustre-ldiskfs-dkms' is already installed, so
 > dnf reinstall lustre-ldiskfs-dkms

This plainly prints out that dkms is successfully installing / compiling all the modules, then prints
 >  Running scriptlet: lustre-ldiskfs-dkms-2.15.3-1.el8.noarch 
                   > 2/2
 > Deprecated feature: REMAKE_INITRD (/var/lib/dkms/lustre-ldiskfs/2.15.3/source/dkms.conf)
 > Deprecated feature: REMAKE_INITRD (/var/lib/dkms/lustre-ldiskfs/2.15.3/source/dkms.conf)
 > Module lustre-ldiskfs-2.15.3 for kernel 4.18.0-477.15.1.el8_8.x86_64 (x86_64).
 > Before uninstall, this module version was ACTIVE on this kernel.
 > Removing any linked weak-modules

and the uninstalls all the modules !!!!

Even the /var/lib/dkms/lustre-ldiskfs gets removed, so this machine is clean and pristine, just that dnf/rpm believe that lustre-ldiskfs-dkms is 
already installed. ;-)

(These messages printed between creation and destruction, they do not really indicate any kind of trouble, do they?)


Well. we all know we are dealing with computers and not with deterministic machines, so
 > dnf remove lustre lustre-ldiskfs-dkms lustre-osd-ldiskfs-mount
and
 > dnf install lustre-ldiskfs-dkms

(Drum roll...) Lustre modules get compiled, installed _and_ _not_ removed.


('modprobe lustre' works, 'dnf install lustre lustre-osd-ldiskfs-mount' does not create new havoc)

I'm flabbergasted and really have no idea how I misconfigured a simple, minimal el8.8 installation into this kind of behavior.


Cheers
Thomas


--------------------------
Thomas Roth           IT-HPC-Linux
Location: SB3 2.291   Phone: 1453



More information about the lustre-discuss mailing list