[lustre-discuss] frequent Connection lost, Connection restored to mdt

David Cohen cdavid at physics.technion.ac.il
Sun Dec 22 08:06:38 PST 2019


Hi,
We are running 2.10.5 on the servers and 2.10.8 on the clients.
Every few minutes, we see:

On client side:

Dec 22 15:26:34 gftp kernel: Lustre:
439834:0:(client.c:2116:ptlrpc_expire_one_request()) @@@ Request sent has
timed out for slow reply: [sent 1577021187/real 1577021187]
 req at ffff88160be9c6c0 x1653620348981536/t0(0)
o36->lustre-MDT0000-mdc-ffff8817d9776c00 at 10.0.0.1@tcp:12/10 lens 608/4768 e
0 to 1 dl 1577021194 ref 2 fl Rpc:X/0/ffffffff rc 0/-1
Dec 22 15:26:34 gftp kernel: Lustre:
439834:0:(client.c:2116:ptlrpc_expire_one_request()) Skipped 3 previous
similar messages
Dec 22 15:26:34 gftp kernel: Lustre: lustre-MDT0000-mdc-ffff8817d9776c00:
Connection to lustre-MDT0000 (at 10.0.0.1 at tcp) was lost; in progress
operations using this service will wait for recovery to complete
Dec 22 15:26:34 gftp kernel: Lustre: Skipped 3 previous similar messages
Dec 22 15:26:34 gftp kernel: Lustre: lustre-MDT0000-mdc-ffff8817d9776c00:
Connection restored to 10.0.0.1 at tcp (at 192.114.101.153 at tcp)
Dec 22 15:26:34 gftp kernel: Lustre: Skipped 3 previous similar messages

On server side:

Dec 22 15:26:34 oss03 kernel: Lustre: lustre-MDT0000: Client
38d6eef1-e146-be41-bab9-409b272d0d4f (at 10.0.0.10 at tcp) reconnecting
Dec 22 15:26:34 oss03 kernel: Lustre: lustre-MDT0000: Connection restored
to ec2cdfce-353f-583a-c970-fde3f5d5189c (at 10.0.0.10 at tcp)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20191222/433bb5ad/attachment.html>


More information about the lustre-discuss mailing list