[Lustre-discuss] no handle for file close and tlrpc_import_delay_req
Wojciech Turek
wjt27 at cam.ac.uk
Thu Nov 1 05:35:11 PDT 2007
Dear All,
I am seeing following errors on MDS:
Nov 1 12:14:13 mds01 kernel: LustreError: 17076:0:(mds_open.c:
1474:mds_close()) Skipped 139 previous similar messages
Nov 1 12:14:27 mds01 kernel: LustreError: 17088:0:(mds_open.c:
1474:mds_close()) @@@ no handle for file close ino 26997837: cookie
0x47a2a9d95b67cfb6 req at 0000010094367c00 x32950/t0 o35->451db1a1-8c58-
f825-eaab-a1dd24586e93 at NET_0x200000a8f092e_UUID:-1 lens 296/560 ref 0
fl Interpret:/0/0 rc 0/0
Nov 1 12:14:32 mds01 kernel: LustreError: 17089:0:(mds_open.c:
1474:mds_close()) @@@ no handle for file close ino 28697676: cookie
0x47a2a9d964ee6f0e req at 000001009181d400 x28502/t0 o35->e838fcbc-4b8c-
f448-a5d2-5e472e474229 at NET_0x200000a8f060b_UUID:-1 lens 296/560 ref 0
fl Interpret:/0/0 rc 0/0
Nov 1 12:14:32 mds01 kernel: LustreError: 17089:0:(mds_open.c:
1474:mds_close()) Skipped 4 previous similar messages
Nov 1 12:14:47 mds01 kernel: LustreError: 17061:0:(mds_open.c:
1474:mds_close()) @@@ no handle for file close ino 28697676: cookie
0x47a2a9d964ee7ade req at 00000100cd26ec00 x113640/t0 o35->d774ea81-
c2ee-01a7-0d28-87054e92c858 at NET_0x200000a8f0510_UUID:-1 lens 296/560
ref 0 fl Interpret:/0/0 rc 0/0
Nov 1 12:14:47 mds01 kernel: LustreError: 17061:0:(mds_open.c:
1474:mds_close()) Skipped 2 previous similar messages
Nov 1 12:15:47 mds01 kernel: Lustre: ddn-home-MDT0000: haven't heard
from client 9e6c2d9a-1649-3c61-0fda-b5052af0e09f (at 10.143.5.11 at tcp)
in 227 seconds. I think it's dead, and I am evicting it.
Nov 1 12:15:47 mds01 kernel: Lustre: Skipped 33 previous similar
messagesNov 1 12:15:55 mds01 kernel: LustreError: 17076:0:
(mds_open.c:1474:mds_close()) @@@ no handle for file close ino
25301776: cookie 0x47a2a9d95b4c6238 req at 00000100cd217c00 x211014/t0
o35->31eec1e1-1f7d-a43b-ed8c-9841a694da28 at NET_0x200000a8f0421_UUID:-1
lens 296/560 ref 0 fl Interpret:/0/0 rc 0/0
Nov 1 12:15:55 mds01 kernel: LustreError: 17076:0:(mds_open.c:
1474:mds_close()) Skipped 24 previous similar messagesNov 1 12:15:55
mds01 kernel: LustreError: 17076:0:(ldlm_lib.c:
1437:target_send_reply_msg()) @@@ processing error (-116)
req at 00000100cd217c00 x211014/t0 o35->31eec1e1-1f7d-a43b-
ed8c-9841a694da28 at NET_0x200000a8f0421_UUID:-1 lens 296/560 ref 0 fl
Interpret:/0/0 rc -116/0
Nov 1 12:15:55 mds01 kernel: LustreError: 17076:0:(ldlm_lib.c:
1437:target_send_reply_msg()) Skipped 39 previous similar
messagesNov 1 12:17:00 mds01 kernel: LustreError: 16649:0:
(mds_open.c:1474:mds_close()) @@@ no handle for file close ino
28968880: cookie 0x47a2a9d95bcda4b2 req at 00000100c26b4800 x58156/t0
o35->1ed2c692-aea1-c9b8-ee90-6e8d4269cda8 at NET_0x200000a8f0503_UUID:-1
lens 296/560 ref 0 fl Interpret:/0/0 rc 0/0
Nov 1 12:17:00 mds01 kernel: LustreError: 16649:0:(mds_open.c:
1474:mds_close()) Skipped 2 previous similar messages
Nov 1 12:18:00 mds01 kernel: LustreError: 17066:0:(mds_open.c:
1474:mds_close()) @@@ no handle for file close ino 28968880: cookie
0x47a2a9d95bcd8dd6 req at 00000100ccf76a00 x42691/t0 o35->0a5afd52-
cffe-0bff-9c69-e6027f201f5f at NET_0x200000a8f0424_UUID:-1 lens 296/560
ref 0 fl Interpret:/0/0 rc 0/0
Nov 1 12:18:00 mds01 kernel: LustreError: 17066:0:(mds_open.c:
1474:mds_close()) Skipped 1 previous similar message
and OSS are showing following errors:
Nov 1 12:21:35 storage10.beowulf.cluster kernel: LustreError:
23337:0:(client.c:519:ptlrpc_import_delay_req()) @@@ IMP_INVALID
req at 00000100b0211000 x434527/t0 o101->MGS at MGC10.143.245.201@tcp_0:26
lens 232/240 ref 1 fl Rpc:/0/0 rc 0/0
Nov 1 12:21:35 storage10.beowulf.cluster kernel: LustreError:
23337:0:(client.c:519:ptlrpc_import_delay_req()) Skipped 119 previous
similar messages
Nov 1 12:21:59 storage08.beowulf.cluster kernel: LustreError:
22609:0:(client.c:519:ptlrpc_import_delay_req()) @@@ IMP_INVALID
req at 00000100af90a800 x755145/t0 o101->MGS at MGC10.143.245.201@tcp_0:26
lens 232/240 ref 1 fl Rpc:/0/0 rc 0/0
Nov 1 12:21:59 storage08.beowulf.cluster kernel: LustreError:
22609:0:(client.c:519:ptlrpc_import_delay_req()) Skipped 119 previous
similar messages
Nov 1 12:23:31 storage09.beowulf.cluster kernel: LustreError:
23045:0:(client.c:519:ptlrpc_import_delay_req()) @@@ IMP_INVALID
req at 00000100c31fc600 x511984/t0 o101->MGS at MGC10.143.245.201@tcp_0:26
lens 232/240 ref 1 fl Rpc:/0/0 rc 0/0
Nov 1 12:23:31 storage09.beowulf.cluster kernel: LustreError:
23045:0:(client.c:519:ptlrpc_import_delay_req()) Skipped 119 previous
similar messages
Nov 1 12:24:32 storage07.beowulf.cluster kernel: LustreError:
22220:0:(client.c:519:ptlrpc_import_delay_req()) @@@ IMP_INVALID
req at 0000010119e39400 x1064767/t0 o101->MGS at MGC10.143.245.201@tcp_0:26
lens 232/240 ref 1 fl Rpc:/0/0 rc 0/0
Nov 1 12:24:32 storage07.beowulf.cluster kernel: LustreError:
22220:0:(client.c:519:ptlrpc_import_delay_req()) Skipped 119 previous
similar messages
Does anybody has an idea what can be the reason of this errors?
My system consist of 4 OSS, 24 OST, 1 MDS, 585 clients
Lustre version is 1.6.3
Kernel version on the whole cluster is 2.6.9-55.0.9.EL_lustre.1.6.3smp
Thanks for you help!
Mr Wojciech Turek
Assistant System Manager
University of Cambridge
High Performance Computing service
email: wjt27 at cam.ac.uk
tel. +441223763517
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20071101/029fdd89/attachment.htm>
More information about the lustre-discuss
mailing list