[lustre-discuss] Issue updating lustre from 2.10.6 to 2.10.7

Kurt Strosahl strosahl at jlab.org
Fri Apr 12 11:41:43 PDT 2019


Thanks, that was exactly what I needed to dig out the bad modules!



________________________________
From: Jeff Johnson <jeff.johnson at aeoncomputing.com>
Sent: Friday, April 12, 2019 12:00 PM
To: Kurt Strosahl
Cc: lustre-discuss at lists.lustre.org
Subject: Re: [lustre-discuss] Issue updating lustre from 2.10.6 to 2.10.7

Kurt,

I see you're using dkms. Take a look at `dkms status` and ensure that there are no lingering installs from the previous version. Sometimes the older version doesn't get uninstalled from /lib/modules/`uname -r`/ and the dkms install process for the new version doesn't overwrite them.

--Jeff


On Fri, Apr 12, 2019 at 8:34 AM Kurt Strosahl <strosahl at jlab.org<mailto:strosahl at jlab.org>> wrote:

Good Morning,


   I've encountered an issue updating lustre from 2.10.6 to 2.10.7 on my metadata system.


I installed the updated RPMs using whamcloud's yum repository, but when I run modprobe -v lustre I get the following errors:

insmod /lib/modules/3.10.0-957.1.3.el7_lustre.x86_64/extra/libcfs.ko.xz
insmod /lib/modules/3.10.0-957.1.3.el7_lustre.x86_64/extra/lnet.ko.xz networks=o2ib0(bond0)
insmod /lib/modules/3.10.0-957.1.3.el7_lustre.x86_64/extra/obdclass.ko.xz
insmod /lib/modules/3.10.0-957.1.3.el7_lustre.x86_64/extra/ptlrpc.ko.xz
insmod /lib/modules/3.10.0-957.1.3.el7_lustre.x86_64/extra/fld.ko.xz
modprobe: ERROR: could not insert 'lustre': Invalid argument

dmesg shows the following:

[ 377.981515] fld: disagrees about version of symbol class_export_put
[ 377.981518] fld: Unknown symbol class_export_put (err -22)
[ 377.981529] fld: disagrees about version of symbol req_capsule_server_pack
[ 377.981531] fld: Unknown symbol req_capsule_server_pack (err -22)
[ 377.981537] fld: disagrees about version of symbol req_capsule_set_size
[ 377.981538] fld: Unknown symbol req_capsule_set_size (err -22)
[ 377.981542] fld: disagrees about version of symbol req_capsule_client_get
[ 377.981543] fld: Unknown symbol req_capsule_client_get (err -22)
[ 377.981555] fld: disagrees about version of symbol lu_env_init
[ 377.981556] fld: Unknown symbol lu_env_init (err -22)
[ 377.981562] fld: disagrees about version of symbol ptlrpc_queue_wait
[ 377.981563] fld: Unknown symbol ptlrpc_queue_wait (err -22)
[ 377.981571] fld: disagrees about version of symbol lu_context_key_get
[ 377.981573] fld: Unknown symbol lu_context_key_get (err -22)
[ 377.981604] fld: disagrees about version of symbol lu_env_fini
[ 377.981606] fld: Unknown symbol lu_env_fini (err -22)
[ 377.981611] fld: disagrees about version of symbol lu_context_key_degister
[ 377.981612] fld: Unknown symbol lu_context_key_degister (err -22)
[ 377.981623] fld: disagrees about version of symbol class_exp2cliimp
[ 377.981625] fld: Unknown symbol class_exp2cliimp (err -22)
[ 377.981631] fld: disagrees about version of symbol req_capsule_set
[ 377.981632] fld: Unknown symbol req_capsule_set (err -22)


I've tried uninstalling and reinstalling the RPMs but this is always the state I end up in.


I'm using zfs for the back end, and those modules work after the upgrade


Here are the RPMs currently installed.

perf-3.10.0-957.1.3.el7_lustre.x86_64
lustre-resource-agents-2.10.7-1.el7.x86_64
lustre-osd-zfs-mount-2.10.7-1.el7.x86_64
kernel-headers-3.10.0-957.1.3.el7_lustre.x86_64
lustre-2.10.7-1.el7.x86_64
kernel-3.10.0-957.1.3.el7_lustre.x86_64
perf-debuginfo-3.10.0-957.el7_lustre.x86_64
lustre-zfs-dkms-2.10.7-1.el7.noarch
kernel-debuginfo-common-x86_64-3.10.0-957.el7_lustre.x86_64
kernel-devel-3.10.0-957.1.3.el7_lustre.x86_64
bpftool-3.10.0-957.el7_lustre.x86_64



Thank you,

Kurt J. Strosahl
System Administrator: Lustre, HPC
Scientific Computing Group, Thomas Jefferson National Accelerator Facility

_______________________________________________
lustre-discuss mailing list
lustre-discuss at lists.lustre.org<mailto:lustre-discuss at lists.lustre.org>
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org<https://gcc01.safelinks.protection.outlook.com/?url=http%3A%2F%2Flists.lustre.org%2Flistinfo.cgi%2Flustre-discuss-lustre.org&data=02%7C01%7Cstrosahl%40jlab.org%7Ca0a639bae29b43dcc64f08d6bf5ff962%7Cb4d7ee1f4fb34f0690372b5b522042ab%7C1%7C1%7C636906816276221845&sdata=1hd1cgdHo7g%2BBbX39Wak9xjIeto0%2FB4Vj5rZHJIUO4s%3D&reserved=0>


--
------------------------------
Jeff Johnson
Co-Founder
Aeon Computing

jeff.johnson at aeoncomputing.com<mailto:jeff.johnson at aeoncomputing.com>
www.aeoncomputing.com<https://gcc01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.aeoncomputing.com&data=02%7C01%7Cstrosahl%40jlab.org%7Ca0a639bae29b43dcc64f08d6bf5ff962%7Cb4d7ee1f4fb34f0690372b5b522042ab%7C1%7C1%7C636906816276231849&sdata=BwElgBPP9ehr98Qfm23I0mu3OhJfrrW9DC8QjGEEXnI%3D&reserved=0>
t: 858-412-3810 x1001   f: 858-412-3845
m: 619-204-9061

4170 Morena Boulevard, Suite C - San Diego, CA 92117

High-Performance Computing / Lustre Filesystems / Scale-out Storage
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20190412/b1922eb5/attachment.html>


More information about the lustre-discuss mailing list