[lustre-discuss] Question About Mellanox-RDMA On Lustre

Brad Merchant bmerchant at cambridgecomputer.com
Tue Jun 6 17:29:05 PDT 2023


It looks like your mkfs commands are not quite complete. The "failnode"
option is deprecated and specifying only one will make that target only use
that IP address. So it's timing out because it doesn't have an LNET address
available to contact the MGS. It would likely work if you tried to mount
from the other server.

Assuming some topology here so it might not be completely accurate, but I
think the mkfs commands you want are:


mkfs.lustre --fsname=ltfs1 --mgs --mdt --index=0
--servicenode=192.168.19.14 at o2ib1 --servicenode=192.168.19.15 at o2ib1
 --reformat --mkfsoptions "-E stride=32"
/dev/disk/by-id/scsi-3600b3420371420b645dde4066c0000a8

mkfs.lustre --fsname=ltfs1  --mdt --index=1 --mgsnode=192.168.19.14 at o2ib1
--mgsnode=192.168.19.15 at o2ib1 --servicenode=192.168.19.14 at o2ib1
--servicenode=192.168.19.15 at o2ib1  --reformat  --mkfsoptions "-E stride=32"
/dev/disk/by-id/scsi-3600b3420371420b645dde5093e0000a9

mkfs.lustre --fsname=ltfs1  --mdt --index=2 --mgsnode=192.168.19.14 at o2ib1
--mgsnode=192.168.19.15 at o2ib1 --servicenode=192.168.19.15 at o2ib1
--servicenode=192.168.19.14 at o2ib1  --reformat  --mkfsoptions "-E stride=32"
/dev/disk/by-id/scsi-3600b3420371420b645dde619060000aa

mkfs.lustre --fsname=ltfs1  --mdt --index=3 --mgsnode=192.168.19.14 at o2ib1
--mgsnode=192.168.19.15 at o2ib1 --servicenode=192.168.19.15 at o2ib1
--servicenode=192.168.19.14 at o2ib1  --reformat  --mkfsoptions "-E stride=32"
/dev/disk/by-id/scsi-3600b3420371420b645dde7367f0000ab
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20230606/41fa6c5e/attachment.htm>


More information about the lustre-discuss mailing list