[Lustre-discuss] OST disconnect messages on OSS
Alex Lee
alee at datadirectnet.com
Wed Aug 13 06:39:01 PDT 2008
I have a system thats been spitting out OST disconnect messages under
heavy load. I'm guessing the OST eventually reconnects.
I want to say this happens when the OSS is extremely overloaded but I
did notice this happening even under light load. Only the OSS seems to
spit out any error messages. I dont see anything on the client side.
Should I be concern? Or does this typically happen on other sites too?
-Alex
clip off one of the OSS:
Aug 13 17:26:48 lustre-oss-0-1 kernel: LustreError: 137-5: UUID
'lfs-OST0004_UUID' is not available for connect (no target)
Aug 13 17:26:48 lustre-oss-0-1 kernel: LustreError:
11094:0:(ldlm_lib.c:1536:target_send_reply_msg()) @@@ processing error
(-19) req at f
fff8101f4570600 x54/t0 o8-><?>@<?>:0/0 lens 240/0 e 0 to 0 dl 1218616308
ref 1 fl Interpret:/0/0 rc -19/0
Aug 13 17:26:48 lustre-oss-0-1 kernel: LustreError:
11094:0:(ldlm_lib.c:1536:target_send_reply_msg()) Skipped 3 previous
similar messag
es
Aug 13 17:26:48 lustre-oss-0-1 kernel: LustreError: Skipped 3 previous
similar messages
Aug 13 17:48:56 lustre-oss-0-1 kernel: LustreError: 137-5: UUID
'lfs-OST0004_UUID' is not available for connect (no target)
Aug 13 17:48:56 lustre-oss-0-1 kernel: LustreError:
10984:0:(ldlm_lib.c:1536:target_send_reply_msg()) @@@ processing error
(-19) req at f
fff81010fc86600 x50/t0 o8-><?>@<?>:0/0 lens 240/0 e 0 to 0 dl 1218617636
ref 1 fl Interpret:/0/0 rc -19/0
Aug 13 17:48:56 lustre-oss-0-1 kernel: LustreError:
10984:0:(ldlm_lib.c:1536:target_send_reply_msg()) Skipped 1 previous
similar messag
e
Aug 13 17:48:56 lustre-oss-0-1 kernel: LustreError: Skipped 1 previous
similar message
Aug 13 18:47:39 lustre-oss-0-1 kernel: LustreError: 137-5: UUID
'lfs-OST0005_UUID' is not available for connect (no target)
Aug 13 18:47:39 lustre-oss-0-1 kernel: LustreError:
11070:0:(ldlm_lib.c:1536:target_send_reply_msg()) @@@ processing error
(-19) req at f
fff81022861b400 x49/t0 o8-><?>@<?>:0/0 lens 240/0 e 0 to 0 dl 1218621159
ref 1 fl Interpret:/0/0 rc -19/0
Aug 13 18:47:39 lustre-oss-0-1 kernel: LustreError:
11070:0:(ldlm_lib.c:1536:target_send_reply_msg()) Skipped 1 previous
similar messag
e
Different OSS:
Aug 12 20:13:49 lustre-oss-6-0 kernel: LustreError: 137-5: UUID
'lfs-OST0050_UUID' is not available for connect (no target)
Aug 12 20:13:49 lustre-oss-6-0 kernel: LustreError:
13527:0:(ldlm_lib.c:1536:target_send_reply_msg()) @@@ processing error
(-19) req at f
fff8103d3b79a00 x124/t0 o8-><?>@<?>:0/0 lens 240/0 e 0 to 0 dl
1218539929 ref 1 fl Interpret:/0/0 rc -19/0
Aug 12 20:13:49 lustre-oss-6-0 kernel: LustreError:
13527:0:(ldlm_lib.c:1536:target_send_reply_msg()) Skipped 1 previous
similar messag
e
Aug 12 20:13:49 lustre-oss-6-0 kernel: LustreError: Skipped 1 previous
similar message
Aug 12 20:13:55 lustre-oss-6-0 kernel: LustreError: 137-5: UUID
'lfs-OST004f_UUID' is not available for connect (no target)
Aug 12 20:13:55 lustre-oss-6-0 kernel: LustreError:
13521:0:(ldlm_lib.c:1536:target_send_reply_msg()) @@@ processing error
(-19) req at f
fff8103d3e92a00 x125/t0 o8-><?>@<?>:0/0 lens 240/0 e 0 to 0 dl
1218539935 ref 1 fl Interpret:/0/0 rc -19/0
Aug 12 20:13:55 lustre-oss-6-0 kernel: LustreError:
13521:0:(ldlm_lib.c:1536:target_send_reply_msg()) Skipped 1 previous
similar messag
e
Aug 12 20:13:55 lustre-oss-6-0 kernel: LustreError: Skipped 1 previous
similar message
Aug 12 20:13:58 lustre-oss-6-0 kernel: LustreError: 137-5: UUID
'lfs-OST004f_UUID' is not available for connect (no target)
Aug 12 20:13:58 lustre-oss-6-0 kernel: LustreError:
28121:0:(ldlm_lib.c:1536:target_send_reply_msg()) @@@ processing error
(-19) req at f
fff8103d3983c00 x125/t0 o8-><?>@<?>:0/0 lens 240/0 e 0 to 0 dl
1218539938 ref 1 fl Interpret:/0/0 rc -19/0
Aug 12 20:13:58 lustre-oss-6-0 kernel: LustreError:
28121:0:(ldlm_lib.c:1536:target_send_reply_msg()) Skipped 5 previous
similar messag
es
More information about the lustre-discuss
mailing list