<!DOCTYPE html>
<html>
<head>
<meta http-equiv="content-type" content="text/html; charset=UTF-8">
</head>
<body>
Hi,<br>
<br>
After updating to lustre 2.15.4 I've had trouble mounting over TCP.
Using Infiniband works fine, but over TCP it just hangs without
errors on client or servers.<br>
<br>
OS is Rocky 9.2 on client and CentOS 7.9 on servers running 2.12.9.<br>
<br>
Rocky 9.2 + 2.15.3 works, but both Rocky 9.2 and 9.3 with 2.15.4
hangs.<br>
<br>
Anyone having the same issue?<br>
<br>
A few notes about our system:<br>
<br>
- It's ZFS based.<br>
- It was created back in 2015. MGS, and MDTs have survived since
then (zfs send/receive), while new OSTs have been added over time an
old ones have been taken out. <br>
- There are 2 filesystems on an MDS pair. One MDT on each MDS. Both
have the hanging problem.<br>
- Dual network stack with Infiniband and TCP. For historical reasons
we are using tcp1 and not the default tcp0. No routers.<br>
<br>
I'll dive into getting more debugging info out. Any pointers on how
to do this efficiently would be much appreciated.<br>
<br>
Cheers,<br>
Hans Henrik<br>
<br>
<br>
</body>
</html>