[lustre-discuss] No port 988?

Jan Andersen jan at comind.io
Tue Sep 26 09:26:35 PDT 2023


I have come a bit further with this problem - it seems the lnet module 
can't load:

[root at rocky8 lustre-release]# depmod lnet
depmod: ERROR: Bad version passed lnet

I deleted the VMs and reinstalled Rocky 8.8, then built lustre 2.15.3 
and installed it, everything without any error messages. I haven't been 
able to find any indication of what this message means through google, 
but I assume it would mean that the kernel source doesn't match the 
running kernel? But how well must they match? This is my running kernel:

[root at rocky8 lustre]# uname -r
4.18.0-477.10.1.el8_8.x86_64

And this is the kernel source:

[root at rocky8 lustre]# ll /usr/src/kernels
total 4
drwxr-xr-x. 23 root root 4096 Sep 26 12:34 4.18.0-477.27.1.el8_8.x86_64/

IOW, they diverge just after '477.' - is that the problem?

/jan

Hi,

I've built and installed lustre on two VirtualBoxes running Rocky 8.8 
and formatted one as the MGS/MDS and the other as OSS, following a 
presentation from Oak Ridge National Laboratory: "Creating a Lustre Test 
System from Source with Virtual Machines" (sorry, no link; it was a 
while ago I downloaded them).

I can mount the filesystems on the MDS, but when I try from the OSS, it 
just times out - from dmesg:

[root at oss1 log]# dmesg | grep -i lustre
[  564.028680] Lustre: Lustre: Build Version: 2.15.58_42_ga54a206
[  625.567672] LustreError: 15f-b: lustre-OST0000: cannot register this 
server with the MGS: rc = -110. Is the MGS running?
[  625.567767] LustreError: 
1789:0:(tgt_mount.c:2216:server_fill_super()) Unable to start targets: -110
[  625.567851] LustreError: 1789:0:(tgt_mount.c:1752:server_put_super()) 
no obd lustre-OST0000
[  625.567894] LustreError: 
1789:0:(tgt_mount.c:132:server_deregister_mount()) lustre-OST0000 not 
registered
[  625.588244] Lustre: server umount lustre-OST0000 complete
[  625.588251] LustreError: 
1789:0:(tgt_mount.c:2365:lustre_tgt_fill_super()) Unable to mount  (-110)

Both 'nmap' and 'netstat -nap' show that there is nothing listening on 
port 988:

[root at mds ~]# netstat -nap | grep -i listen
tcp        0      0 0.0.0.0:111             0.0.0.0:* LISTEN      1/systemd
tcp        0      0 0.0.0.0:22              0.0.0.0:* LISTEN      806/sshd
tcp6       0      0 :::111                  :::* LISTEN      1/systemd
tcp6       0      0 :::22                   :::* LISTEN      806/sshd

What should be listening on 988?

/jan



More information about the lustre-discuss mailing list