[lustre-discuss] No port 988?

Andreas Dilger adilger at whamcloud.com
Tue Sep 26 22:54:00 PDT 2023


On Sep 26, 2023, at 06:12, Jan Andersen <jan at comind.io<mailto:jan at comind.io>> wrote:

Hi,

I've built and installed lustre on two VirtualBoxes running Rocky 8.8 and formatted one as the MGS/MDS and the other as OSS, following a presentation from Oak Ridge National Laboratory: "Creating a Lustre Test System from Source with Virtual Machines" (sorry, no link; it was a while ago I downloaded them).

There are a number of such resources linked from the https://wiki.lustre.org/ front page.

I can mount the filesystems on the MDS, but when I try from the OSS, it just times out - from dmesg:

[root at oss1 log]# dmesg | grep -i lustre
[  564.028680] Lustre: Lustre: Build Version: 2.15.58_42_ga54a206
[  625.567672] LustreError: 15f-b: lustre-OST0000: cannot register this server with the MGS: rc = -110. Is the MGS running?
[  625.567767] LustreError: 1789:0:(tgt_mount.c:2216:server_fill_super()) Unable to start targets: -110
[  625.567851] LustreError: 1789:0:(tgt_mount.c:1752:server_put_super()) no obd lustre-OST0000
[  625.567894] LustreError: 1789:0:(tgt_mount.c:132:server_deregister_mount()) lustre-OST0000 not registered
[  625.588244] Lustre: server umount lustre-OST0000 complete
[  625.588251] LustreError: 1789:0:(tgt_mount.c:2365:lustre_tgt_fill_super()) Unable to mount  (-110)

Both 'nmap' and 'netstat -nap' show that there is nothing listening on port 988:

[root at mds ~]# netstat -nap | grep -i listen
tcp        0      0 0.0.0.0:111             0.0.0.0:* LISTEN      1/systemd
tcp        0      0 0.0.0.0:22              0.0.0.0:* LISTEN      806/sshd
tcp6       0      0 :::111                  :::* LISTEN      1/systemd
tcp6       0      0 :::22                   :::* LISTEN      806/sshd

What should be listening on 988?

The  MGS should be listening on port 988, running on the "mgsnode" that was specified at format time for the OSTs and MDTs.

It is possible to have the MGS and MDS share the same storage device for simple configurations, but in production they are usually running on separate devices so they can be started/stopped independently, even if they are running on the same server.

Cheers, Andreas
--
Andreas Dilger
Lustre Principal Architect
Whamcloud







-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20230927/298b3597/attachment-0001.htm>


More information about the lustre-discuss mailing list